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Introduction 


The  34th  annual  Sanibel  Symposium,  organized  by  the  faculty  and  staff  of  the 
Quantum  Theory  Project  of  the  University  of  Florida,  was  held  on  February  12- 
19,  1994  at  the  Marriott  at  Sawgrass  Resort,  Ponte  Vedra  Beach,  Florida.  More 
than  300  participants  gathered  for  8  days  of  lectures  and  informal  discussions. 

The  format  of  the  symposium  adopted  for  the  past  few  years  was  followed  again 
this  year  with  a  compact  8-day  schedule  with  an  integrated  program  of  quantum 
biology,  quantum  chemistry,  and  condensed  matter  physics.  The  topics  of  the  ses¬ 
sions  covered  by  these  proceedings  include  Propagator  and  Resolvent  Theory,  Per¬ 
turbation  Theory,  Density  Functional  Theory  for  Molecules,  High  Performance 
Parallel  Computing  Algorithms,  Weakly  Bound  Molecular  Systems,  Theoretical 
Inorganic  Chemistry,  Temporal  Phenomena,  Quantum  Dynamics  at  Interphases, 
Energy  Depositions,  Non-Linear  Optical  Phenomena,  and  Novel  Condensed 
Systems. 

The  articles  have  been  subjected  to  the  ordinary  refereeing  procedures  of  the 
International  Journal  of  Quantum  Chemistry .  The  articles  presented  in  the  sessions 
on  quantum  biology  and  associated  poster  sessions  are  published  in  a  separate 
volume  of  the  International  Journal  of  Quantum  Chemistry. 

The  organizers  acknowledge  the  following  sponsors  for  their  support  of  the  1 994 
Sanibel  Symposium: 

•  The  Office  of  Naval  Research  through  Grant  NOOO 14-93- 1-0343.  “This  work 
relates  to  Department  of  Navy  Grant  NOOO  14-93- 1-0343  issued  by  the  Office 
of  Naval  Research.  The  United  States  Government  has  a  royalty-free  license 
throughout  the  world  in  all  copyrightable  material  contained  herein.” 

•  U.S.  Army  Research  Office  (Physics)/CRDEC  and  U.S.  Army  Edgewood 
RD&E  Center  through  Grant  DAAH04-94-G-0015.  “The  views,  opinions,  and/ 
or  findings  contained  in  this  report  are  those  of  the  author(s)  and  should  not 
be  construed  as  an  official  Department  of  the  Army  position,  policy,  or  decision, 
unless  so  designated  by  other  documentation.” 

•  U.S.  Department  of  Energy  through  Grant  DE-FG05-94ER61785. 

•  International  Science  Foundation. 

•  CAChe. 

•  IBM. 

•  Silicon  Graphics. 

•  Sun  Microsystems. 

•  The  University  of  Florida. 
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Very  special  thanks  go  to  the  staff  of  the  Quantum  Theory  Project  of  the  University 
of  Florida  for  handling  the  numerous  administrative,  clerical,  and  practical  details. 
The  organizers  are  proud  to  recognize  the  contributions  of  Mrs.  Judy  Parker,  Ms. 
Leann  Golemo,  Mrs.  Karen  Yanke,  Ms.  Sandra  Weakland,  Mr.  Sullivan  Beck,  Dr. 
Agustm  Diz,  and  Dr.  Erik  Deumens.  All  the  graduate  students  of  the  Quantum 
Theory  Project,  who  served  as  “gofers”  are  gratefully  recognized  for  their  contri¬ 
butions  to  the  1994  Sanibel  Symposium. 


N.  Y.  Ohrn 
J.  R.  Sabin 
M.  C.  Zerner 


On  the  Occasion  of 
Yngve  Ohrn’s  60th  Birthday 

The  34th  Sanibel  Meeting  was  distinguished  by  recognizing  the  60th  birthday  of 
one  of  its  principal,  long-time  (33  years)  organizers,  Yngve  Ohm.  Two  sessions  at 
the  meeting  entitled  “Ode  on  a  Swedish  Ohm”  and  “Ode  on  a  Swedish  Ohm: 
Second  Stanza”  emphasized  some  of  the  research  advances  whose  roots  lie  in  Yngve’s 
group.  (See  the  representative  selection  of  papers  below.)  You  might  have  expected 
these  sessions  to  deal  with  “Proper-gators,”  as  Yngve  and  Jan  Linderberg  were 
primarily  responsible  for  establishing  these  methods  as  one  of  the  three  principal 
approaches  to  the  correlation  problem  (along  with  MBPT/CC  and  Cl),  and  they 
did  to  some  extent;  but  reflecting  Yngve’s  group’s  continuing,  original  contributions, 
they  also  featured  their  new  combined  electron-nuclear  dynamics  approach  that 
uses  coherent  states,  but  no  Born-Oppenheimer  approximation  or  potential  energy 
surfaces,  i.e.,  the  “END”  method!  In  these  sessions,  in  many  poster  presentations, 
and  now,  in  these  Proceedings,  we  have  heard  from  Yngve’s  students  and  postdocs, 
and  as  a  credit  to  him  as  a  teacher,  they  were  inspired  to  establish  their  original 
research  paths.  Yet  their  contributions  are  characterized  by  the  same  rigor  and 
attention  to  detail  that  they  initially  learned  in  the  Ohm  group.  (A  list  of  his  students 
and  postdoctoral  associates  is  appended.) 

No  doubt  building  upon  administrative  skills  learned  as  a  chief  organizer  of  the 
Sanibel  Meeting,  Yngve  spent  five  years  distinguishing  himself  as  an  unusually 
successful  department  chairman.  One  of  his  accomplishments  was  turning  one 
senior  position  in  QTP  into  two,  for  which  two  of  us  are  especially  grateful!  A  short 
list  of  other  accomplishments  include  marrying  Ann,  a  tremendous  asset;  mem¬ 
berships  in  the  Swedish,  Finnish,  and  Danish  academies;  editing  the  International 
Journal  of  Quantum  Chemistry;  being  the  director  of  QTP  for  more  than  10  years; 
and,  while  in  high  school,  being  regional  champion  in  the  100  meter  dash,  high 
jump,  and  discus,  while  finishing  third  in  Sweden  as  a  member  of  the  4X100  relay 
team!  He  also  played  left  field  for  the  QTP  softball  team  that  retired  undefeated! 

Yngve,  many  happy  returns  from  your  many  friends. 

Rodney  J.  Bartlett 
Erik  Deumens 
Per-Olov  Lowdin 
David  A.  Micha 
Hendrik  J.  Monkhorst 
John  R.  Sabin 
Samuel  B.  Trickey 
Michael  C.  Zerner 
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Abstract 

Developments,  through  some  35  years,  in  the  theory  of  electronic  structure  and  reaction  dynamics 
are  reviewed  from  the  perspective  available  to  a  close  collaborator  to  Yngve  Ohrn.  ©  1994  John  Wiley  & 
Sons,  Inc. 


Ode 

A  scholar  by  name  Yngve  Ohrn 
approaches  a  rather  sharp  turn 
going  full  pace, 
he's  really  an  ace, 
it  seems  he'll  never  adjourn. 


Three  dozen  years  ago,  when  I  was  working  quietly  at  my  desk  at  the  Quantum 
Chemistry  Group  at  Uppsala  University,  Yngve  Ohrn  presented  himself  at  my  door 
as  having  been  hired  by  Per-Olov  Lowdin  as  a  new  assistant  at  the  group.  He  came 
with  an  excellent  academic  record,  a  soon  to  be  completed  officer’s  training  in  the 
Signal  Corps  of  the  Swedish  army,  and  he  was  a  sharp  athlete.  It  turned  out,  with 
time,  that  the  two  of  us  had  a  similar  background  from  steel  mill  towns  of  less  than 
urban  settings.  I  have  been  most  fortunate  to  become  a  personal  friend  and  associate 
of  Yngve  Ohrn  and  I  will  attempt  to  give  a  sketch  of  his  contributions  to  the  field 
of  quantum  chemistry. 

Electronic  correlation  was  at  the  top  of  the  agenda  at  Uppsala  in  the  late  fifties, 
Lowdin  had  published  his  comprehensive  review  [1]  and  Lajos  Redei  had  completed 
his  study  on  the  mixed  use  of  superposition  of  configurations  and  correlation  factors 
[2].  Second  quantization  methods  were  gaining  prominence  in  many-particle 
problems  and  I  was  assigned  to  present  the  classical  papers  of  Born,  Jordan,  and 
Fock  [3]  to  the  Quantum  Chemistry  Group.  Yngve  was  directed  towards  the  use 
of  interparticle  coordinates  for  three  electron  systems,  an  arduous  task  with  many 
mathematical  and  numerical  ramifications.  The  paper  [4],  which  came  out  of  this, 
demonstrates  the  analytical  clarity  and  algorithmic  proficiency  that  has  been 
a  hallmark  of  Ohrn’s  writings  ever  since.  It  paved  the  way  for  some  of  the 
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most  accurate  calculations  on  the  states  of  the  Lithium-like  ions  that  have  been  per¬ 
formed  [  5  ] . 

Part  of  the  work  on  the  multidimensional  integrals  required  for  the  use  of  inter¬ 
particle  coordinates  was  completed  during  Yngve  Ohm’s  and  Jan  Nordling’s  tenure 
as  resident  Swedes  at  the  Quantum  Theory  Project  in  1961-63.  Yngve  produced, 
while  at  Florida,  another  piece  of  work,  together  with  Reid,  that  was  indicative 
of  his  talent  and  interests  in  correlation  phenomena.  A  direct  method  for  the 
computation  of  the  natural  orbitals  was  designed  and  implemented  with  good 
results  [6]. 

Ohrn  returned  to  Uppsala  in  the  spring  of  1963,  after  having  ended  his  stay  in 
the  United  States  with  a  period  at  Wisconsin,  in  Hirschfelder’s  group,  where  he 
befriended,  and,  I  am  sure,  impressed  a  number  of  people,  among  them  Egil  Hyl- 
leraas  from  Oslo.  Yngve’s  presence  gave  me  the  opportunity  be  part  of  a  collaborative 
effort  with  him.  Our  first  joint  project  was  spawned  by  Anders  Froman  and  con¬ 
cerned  excited  states  of  the  Cesium  atom  [7].  The  spectroscopist  Kjell  Bockasten 
could  not  describe  the  levels  of  the  2F-series  with  conventional  fitting  procedures 
and  Froman  identified  the  problem  as  arising  from  the  penetration  of  the  excited 
orbital  into  the  core  region  where  the  effective  potential  just  about  supports  a  bound 
/state.  Yngve  and  I  did  the  numerical  work  and  confirmed  this  analysis. 

The  Cesium  problem  made  us  aware  that  we  worked  well  together  and  we  em¬ 
barked  on  another  project  which  was  to  influence  our  lives  strongly.  John  Hubbard, 
who  made  significant  contributions  towards  an  understanding  of  the  correlation 
problem  in  the  free  electron  gas  in  the  middle  of  the  fifties,  came  out  with  his  first 
papers  on  the  correlation  phenomena  in  narrow  energy  bands  and  magnetic  ma¬ 
terials  in  1963.  His  set-up  was  quite  similar  to  the  Pariser-Parr-Pople  model  for 
conjugated  systems  and  we  took  upon  ourselves  to  explore  Hubbard’s  notions  in 
this  field.  We  connected  our  efforts  to  the  most  elegant  work  of  Coulson  and 
Longuet-Higgins  from  the  forties  [8]  and  had  early  gratification  with  regard  to  the 
descriptions  of  excitation  features  of  conjugated  hydrocarbons  [9].  These  papers 
were  part  of  the  dissertation  that  Yngve  Ohm  defended  for  the  degree  Doctor  of 
Philosophy  at  the  University  of  Uppsala  in  May  1966.  Our  esteemed  teacher, 
Per-Olov  Lowdin,  used  the  occasion  to  plant  a  seed  of  further  inspiration  by  ques¬ 
tioning  the  N-representability  of  the  Green  function  decoupling  methods. 

I  will  not  dwell  further  on  the  development  of  propagator  theory,  a  special  section 
of  this  symposium  is  dedicated  to  this,  but  I  ensure  you  that  our  work  on  the 
monograph  Propagators  in  Quantum  Chemistry  [10]  was  a  most  satisfying  expe¬ 
rience.  A  good  deal  of  it  took  place  at  Aarhus  where  Ohrn  was  a  visiting  professor 
in  1970-71. 1  sometimes  refer  to  him  as  my  contraauthor  for  his  kind  and  insistent 
efforts  to  straighten  out  my  algebra  and  syntax. 

Yngve  Ohm  resumed  his  career  at  Florida  in  1 966.  He  initiated  a  series  of  detailed 
and  accurate  investigations  of  diatomic  systems  using  the  methods  of  superposition 
of  configurations.  The  use  of  natural  orbitals  rather  than  conventional  Hartree- 
Fock  based  orbitals  proved  economical  and  led  to  results  that  have  lasting  value 
for  the  interpretation  of  spectral  features.  Uniformly  accurate  potential  curves  were 
obtained  for  neutral,  as  well  as  for  positive  and  negative,  species  [11]. 
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Aarhus  University  has  derived  great  benefits  from  the  work  of  Yngve  Ohrn,  he 
has  been  a  frequent  visitor  and  has  spent  extended  periods  with  us.  He  has  extended 
generous  hospitality  to  a  large  number  of  Danes  at  Gainesville  and  taught  them 
the  stern  lessons  of  hard  work  and  accuracy.  In  addition  he  has  entrusted  us  to  care 
for  several  of  his  graduates.  His  influence  has  been  felt  also  at  Copenhagen  and 
Odense,  and  he  was  elected  as  a  foreign  member  of  the  Royal  Danish  Academy  of 
Sciences  and  Letters  in  recognition  of  his  scientific  standing. 

Yngve  Ohm  served  as  chairman  of  the  Department  of  Chemistry  at  the  University 
of  Florida  in  the  late  seventies  and  early  eighties.  He  led  the  department  with 
integrity  and  scholarly  leadership  to  increased  strength,  most  notably  for  our  com¬ 
munity,  by  bringing  Rodney  Bartlett  and  Michael  Zerner  to  the  faculty.  I  was 
particularly  impressed  with  the  creation  of  an  advisory  panel  for  the  department 
to  which  distinguished  alumni  were  invited.  It  should  also  be  pointed  out  that  his 
efforts  on  the  fiscal  scene  were  successful. 

My  review  of  Yngve  Ohrn’s  academic  career  is  about  to  be  concluded,  not  for 
lack  of  material,  but  rather  a  sense  of  inadequacy  on  my  part.  The  surge  of  scientific 
effort  that  emanates  from  him  has  assumed  a  new  dimension  in  recent  years.  Time- 
dependent  ways  of  addressing  the  vexing  problems  of  chemical  processes  have  been 
introduced  by  Yngve  and  his  collaborators  [12]  through  novel  and  challenging 
techniques.  By  means  of  coherent  states  and  Lie  group  analysis  he  propagates  his 
systems  in  a  manifestly  N-representable  fashion.  We  will  learn  more  about  this 
from  his  own  mouth  shortly. 
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Abstract 

The  content  of  an  ab  initio  time-dependent  theory  of  quantum  molecular  dynamics  of  electrons  and 
atomic  nuclei  is  presented.  Employing  the  time-dependent  variational  principle  and  a  family  of  approx¬ 
imate  state  vectors  yields  a  set  of  dynamical  equations  approximating  the  time-dependent  Schrodinger 
equation.  These  equations  govern  the  time  evolution  of  the  relevant  state  vector  parameters  as  molecu¬ 
lar  orbital  coefficients,  nuclear  positions,  and  momenta.  This  approach  does  not  impose  the  Born- 
Oppenheimer  approximation,  does  not  use  potential  energy  surfaces,  and  takes  into  account  electron- 
nuclear  coupling.  Basic  conservation  laws  are  fully  obeyed.  The  simplest  model  of  the  theory  employs 
a  single  determinantal  state  for  the  electrons  and  classical  nuclei  and  is  implemented  in  the  computer 
code  ENDyne.  Results  from  this  ab-initio  theory  are  reported  for  ion-atom  and  ion-molecule  collisions. 
©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Physical  measurements  on  molecular  systems  involve  the  use  of  a  probe  and 
study  the  change  of  the  system  or  the  probe.  Such  change  implies  an  evolving 
molecular  state,  which  at  least  in  principle  can  be  obtained  as  a  solution  to  the 
time-dependent  Schrodinger  equation.  This  is  the  fundamental  equation  that  gov¬ 
erns  the  dynamics  of  electrons  and  atomic  nuclei,  the  building  blocks  of  molecular 
systems.  Quantum  chemistry  and  molecular  dynamics  often  focus  on  the  stationary 
states,  which  are  solutions  to  the  time-independent  Schrodinger  equation.  Fur¬ 
thermore,  the  coupling  of  electronic  and  nuclear  dynamics  is  forced  to  obey  the 
adiabatic  approximation  thereby  leading  to  the  theoretical  constructs  of  elec¬ 
tronic  adiabatic  states,  corresponding  potential  energy  surfaces,  and  nonadiabatic 
coupling  terms. 

The  stationary  states  form  a  basis  in  which  the  evolving  states  can  be  expressed 
and  interpreted.  However,  their  calculation  could  be  an  unnecessary  and  costly 
detour  in  order  to  obtain  the  evolving  states  that  describe  molecular  events.  Sta¬ 
tionary  electronic  states  and  their  associated  potential  energy  surfaces  have  occupied 
quantum  chemistry  and  molecular  dynamics  since  the  beginning  of  quantitative 
theory.  Yet,  one  knows  of  no  fundamental  reason  why  a  molecular  event,  other 
than  in  exceptional  cases,  should  take  place  on  a  potential  energy  surface.  Possibly, 
a  few  potential  energy  surfaces  with  associated  coupling  terms  would  give  a  rea¬ 
sonable  description.  However,  their  pointwise  determination  is  an  extremely  costly 
process,  the  fitting  of  them  and  the  coupling  terms  to  suitable  analytical  forms 
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frought  with  uncertainties,  and  the  dynamics  on  even  two  or  three  such  surfaces 
only  possible  for  the  simplest  of  systems. 

Therefore,  a  method  that  can  generate  accurate  evolving  states  without  resorting 
to  the  expensive  generation  of  potential  energy  surfaces  would  be  important.  In 
this  article  we  review  some  of  the  accomplishments  of  the  end  (Electron  Nuclear 
Dynamics)  [1]  theory,  which  is  such  an  approach  that  treats  the  full  electronic- 
nuclear  dynamics  with  all  coupling  terms.  This  method  generates  equations  of 
motion  that  approximate  the  time-dependent  Schrodinger  equation.  Their  solution 
involves  time  integration  with  often  small  steps  because  of  the  mixture  of  fast 
(electronic)  and  slow  (nuclear)  time  scales  and  might  be  considered  computationally 
demanding.  However,  when  the  cost  of  generating  accurate  potential  energy  surfaces 
and  their  coupling  terms  is  included  in  the  overall  accounting  of  dynamical  methods 
employing  them,  their  advantage  over  end  in  computational  effort  disappears. 

Time-dependent  theoretical  methods  are  emerging  important  tools  for  the  study 
of  molecular  processes.  Explicit  consideration  of  the  time  parameter  makes  it  pos¬ 
sible  to  follow,  say,  the  detailed  evolution  of  a  reactive  event  from  reactants  to 
products  and  leads  naturally  to  the  theoretical  determination  of  transition  proba¬ 
bilities  and  rates.  Since  the  generation  of  a  quality  surface  is  such  a  daunting  task 
for  a  many-atom  system,  one  often  resorts  to  the  construction  of  empirical  and 
semi-empirical  surfaces  as  the  diatomics  in  molecule  (DIM)  surfaces.  Although 
inexpensive  to  generate,  such  surfaces  are  not  accurate  for  the  study  of  detailed 
dynamics.  If  only  a  single  such  surface  is  used,  as  is  often  the  case,  then  the  dynamics 
of  the  electrons  is  neglected.  Only  by  incorporating  two  or  more  surfaces  and  their 
couplings  is  the  effect  of  electronic  dynamics  included  [2]. 

In  spite  of  these  difficulties  electronic-nuclear  dynamics  are  being  studied  with 
the  use  of  potential  energy  surfaces.  The  surface  hopping  model  [  3  ]  has  seen  some 
success,  but  needs  additional  nondynamical  assumptions  about  the  electronic- 
nuclear  coupling.  Dynamics  on  potential  energy  surfaces  with  conical  intersection 
[4],  and  wave-packet  dynamics  on  vibrationally  coupled  dissociative  potential  en¬ 
ergy  surfaces  [5,6]  have  been  carried  out  with  interesting  results.  Multiconfiguration 
time-dependent  self-consistent  field  approximations  for  dynamics  with  curve  cross¬ 
ings  [  7  ]  have  been  developed  and  applied  with  promising  results.  Direct  integration 
methods  for  the  time-dependent  Schrodinger  equation  [  8  ]  have  been  developed 
and  applied  also  for  dynamics  with  two  surfaces.  However,  the  direct  methods  using 
numerical  grid  techniques  are  so  far  limited  to  dynamical  problems  in  three  or 
fewer  dimensions. 

There  are  approaches,  such  as  the  Close-Coupling  method  [9,10]  used  in  study 
of  atomic  and  molecular  collisions,  where  electronic  dynamics  is  considered.  How¬ 
ever,  these  methods  employ  nondynamical  straight  line  or  Coulomb  nuclear  tra¬ 
jectories  and  ignore  the  details  of  the  nuclear  dynamics.  While  fixed  trajectories 
work  well  for  collisional  phenomena  at  energies  above  1  keV/amu,  they  lead  to 
spurious  dynamics  at  lower  collision  energies. 

Another  body  of  work  has  its  origin  in  the  approach  to  simulating  annealing  by 
Car  and  Parrinello  [11].  This  approach  has  been  generalized  to  study  dynamics  of 
molecular  systems  or  clusters  [12,13,14]  and  uses  a  fictitious  mass  associated  to 
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the  molecular  orbital  coefficients  to  generate  newtonian-like  equations  for  them  in 
time.  The  method  attempts  to  follow  the  lowest  potential  energy  surface  of  the 
system,  without  having  to  perform  an  electronic  optimization  at  each  new  nuclear 
geometry.  Thus  it  is  very  much  like  dynamics  on  a  single  potential  energy  surface, 
but  without  the  cost  of  generating  the  full  surface. 

The  END  (Electron  Nuclear  Dynamics)  theory  of  Ohm,  Deumens,  et  al.  [1,15- 
17]  uses  the  time-dependent  variational  principle  (tdvp)  to  generate  equations 
that  approximate  the  time-dependent  Schrodinger  equation  and  govern  the  time 
evolution  of  electronic  and  nuclear  dynamical  variables  on  a  generalized  phase 
space.  The  choice  of  trial  wave  function  for  the  electrons  and  the  nuclei  determines 
the  level  of  approximation.  Both  electrons  and  nuclei  are  treated  fully  dynamically 
including  coupling  terms.  The  dynamics  of  participating  nuclei  and  electrons  is 
subject  to  the  instantaneous  forces  due  to  the  full  Coulomb  interactions  and  there 
is  no  need  to  generate  potential  energy  surfaces  and  associated  stationary  states. 

A  related  method  by  Runge  and  Micha  [18,19]  uses  the  Frenkel  variational 
principle  to  derive  equations  for  the  electronic  degrees  of  freedom  and  then  use  the 
eikonal  approximation  for  the  nuclei  moving  in  the  potential  given  by  the  electrons 
and  the  nuclear  Coulomb  repulsion.  This  method  has  been  applied  to  proton- 
hydrogen  and  hydrogen-hydrogen  collisions. 

The  simplest  level  of  the  end  theory  employs  a  single  determinant  for  the  electrons 
and  the  classical  limit  for  the  nuclei.  This  level  of  approximation  has  been  imple¬ 
mented  in  a  computer  code  ENDyne  [20].  In  spite  of  its  simplicity  this  level  of 
theory  yields  accurate  results  for  transition  probabilities,  differential  and  total  cross 
sections  for  a  great  variety  of  ion-atom  and  ion-molecule  reactive  collisions.  The 
end  theory  for  a  multiconfigurational  and  for  an  antisymmetrized  geminal  power 
(AGP)  electronic  wave  function  has  also  been  published  [21,22]. 

The  next  section  gives  an  overview  of  the  end  theory  and  the  conservation  theo¬ 
rems;  the  third  discusses  the  effect  of  the  nonadiabatic  coupling  terms.  If  these 
couplings  are  neglected,  the  calculation  tends  to  follow  a  potential  energy  surface, 
in  a  Car-Parinello-like  fashion.  An  example  shows  that  dynamics  without  these 
terms  are  in  disagreement  with  experiment.  When  the  couplings  are  included  agree¬ 
ment  is  excellent. 


END  Theory 

The  details  of  the  end  theory  [16]  and  the  associated  ENDyne  code  [17]  for 
the  simplest  level  of  theory  are  published,  so  it  suffices  here  to  give  a  summary  sur¬ 
vey  to  show  the  fundamental  generality  and  pleasing  structure  of  the  dynamical 
equations. 

The  tdvp  [23]  for  a  full  quantum  approach  requires  stationarity  of  the  action 
with  the  Lagrangian 


L  = 


<*l*> 


(1) 
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where  H  is  the  full  molecular  Hamiltonian  and  ^  the  electronic-nuclear  many- 
body  wave  function.  The  derivative  d/dt  acts  on  the  bra.  The  choice  of  trial  wave 
function  determines  the  details  of  the  dynamical  equations.  Representing  the  nuclei 
by  Gaussian  wave  packets  in  the  limit  of  zero  widths  results  in  a  classical  description 
with  nuclear  masses  M,  positions  R ,  and  momenta  P  equivalent  to  employing  the 
Lagrangian 


+ 


l{d__ 
2  \dt 


HeI 


(2) 


where  the  electronic  Hamiltonian  Hei  includes  the  nuclear-nuclear  repulsion.  The 
end  theory  even  admits  a  mixed  approach  with  some  of  the  nuclei  treated  by 
classical  and  others  by  quantum  mechanics. 

The  full  quantum  tdvp  approach  with  a  completely  general  form  of  trial  function 
yields  the  time-dependent  Schrodinger  equation.  Restricting  the  trial  wave  function 
to  a  particular  form  leads  to  a  set  of  coupled  first-order  differential  equations  [23,16] 
governing  the  time  evolution  of  the,  in  general,  complex  wave-function  parameters. 
The  parameters  assume  the  role  of  dynamical  variables  and  form  a  generalized 
phase  space.  This  phase  space  is  not  flat  but  has  a  nonunit  metric  and  results  in  a 
Hamiltonian  form  of  dynamical  equations.  The  choice  of  wave-function  parameters 
becomes  critical  to  ensure  nonredundancy  and  continuous  trajectories. 

For  the  case  of  a  single  determinantal  TV-electron  state  vector  Thouless  [24]  has 
provided  the  suitable  parametrization.  A  Thouless  determinant  |  z)  is  an  antisym¬ 
metric  product  of  nonorthogonal  spin  orbitals 

K 

Xh  =  <pfi  T  2  Zphfip  ,  (3) 

p~N+ 1 


which  in  end  are  expressed  in  terms  of  an  atomic  orbital  basis  set,  {fa;  i  =  1,  K} , 
centered  on  the  nuclei.  The  molecular  orbital  coefficients  { zph }  and  their  complex 
conjugates  are  the  electronic  dynamical  variables.  This  form  of  wave  function  has 
the  capacity  to  become  any  determinantal  iV-electron  wave  function  in  the  basis  as 
the  coefficients  assume  different  values.  The  dynamical  equations  describing  particles 
with  widely  different  time  scales  are  stiff  and  as  they  are  integrated  in  rather  short 
discrete  time  steps  the  z-coefficients  change  producing  at  each  instant  a  different 
determinantal  function.  This  turns  out  to  be  a  surprisingly  accurate  description  of 
the  electronic  degrees  of  freedom. 

A  Thouless  determinant  is  an  example  of  a  coherent  states  in  the  theory  of  Lie 
groups  [25].  The  z-coefficients  constitute  a  complex,  continuous,  nonredundant 
parametrization  and  this  parameter  space  also  has  a  positive  measure  dii(z)  such 
that  the  resolution  of  the  identity 


/  =  J  \z)(z\dn(z) 


(4) 


exists. 
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They  are  derived  from  the  overlap 

S(z*,R\z,R)  =  (z-,R'\z;R)  (8) 

of  determinants  at  geometries  R  and  Rl. 

The  END  theory  yields  a  dynamics  that  satisfies  the  conservation  laws  of  important 
physical  quantities,  as  total  energy,  total  momentum,  and  total  angular  momentum 
[16,17].  The  important  conservation  of  total  linear  and  angular  momentum  only 
holds  when  the  electron-nuclear  coupling  terms  are  included. 


Electron-Nuclear  Coupling 

The  dynamical  metric  of  Eq.  ( 5 )  couples  the  electronic  and  nuclear  degrees  of 
freedom.  The  third  row  of  the  metric  matrix  has  coupling  terms  that  ensure  the 
conservation  of  total  momentum  [16].  The  theory  must  satisfy  such  conservation 
laws  to  be  able  to  produce  accurate  detailed  information  about  nuclear  and  electronic 
quantities.  One  might  think  that  at  higher  energies  the  momentum  of  the  nuclei 
overwhelms  the  contribution  from  the  electrons  when  properties  are  computed, 
but  that  this  is  not  the  case  is  shown  by  our  results. 

The  first  row  of  the  metric  matrix  couples  the  electronic  degrees  of  freedom  to 
the  nuclear  velocities.  These  coupling  terms  make  up  the  matrix  representation  of 
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the  translation  operator  and  provide  the  transformation  of  the  z-coefficients  as  the 
centers  of  the  basis  functions  move  from  one  instant  to  the  next.  One  could,  alter¬ 
natively,  use  a  basis  with  explicit  electron  translation  factors  [26]  exp(-ivA-r), 
where  vA  is  the  velocity  of  center  A,  but  in  its  full  implementation  this  would 
require  extension  of  current  integral  codes. 

Time-independent  approaches  introduce  nonadiabatic  coupling  by  expanding 
the  full  wave  function  in  terms  of  adiabatic  states  with  each  term  being  a  product 
of  an  eigenstate  of  the  electronic  Hamiltonian  and  a  nuclear  wave  function.  The 
coupling  terms  arise  from  the  nuclear  kinetic  energy  operator  acting  on  the  electronic 
adiabatic  states,  which  are  parametrically  dependent  on  the  nuclear  positions.  Such 
coupling  terms  are  absent  when  the  full  wave  function  is  expanded  in  terms  of 
diabatic  states  with  each  term  being  a  product  of  an  electronic  state  independent 
of  nuclear  coordinates  and  a  nuclear  wave  function.  Similarly,  in  time-dependent 
treatments  coupling  terms  emerge  from  using  electronic  basis  functions  that  depend 
on  nuclear  coordinates,  and  possibly  also  on  nuclear  velocities.  These  coupling 
terms  are  absent  when  the  electronic  basis  is  fixed  in  space,  as  is  the  case  for  numerical 
grids  fixed  in  the  lab  frame.  The  dynamical  metric  matrix  in  Eq.  ( 5 )  would  in  such 
a  case  have  no  coupling  terms. 

The  influence  of  the  nonadiabatic  coupling  terms  in  Eq.  (5)  is  in  many  cases 
essential  in  order  to  describe  even  a  qualitatively  correct  dynamics  [27  ] .  Neglecting 
the  coupling  terms  in  end  generates  dynamical  equations  identical  to  those  used 
by  others  [28].  Other  approaches  that  neglect  electron-nuclear  coupling  include 
dynamics  on  a  single  potential  energy  surface  and  dynamics  using  fictitious  kinetic 
energy  terms  to  force  the  dynamics  close  to  a  surface  as  does  the  method  of  Car 
and  Parrinello  [1 1,13,14].  Part  of  the  coupling  effects  can  be  handled  via  electron 
translation  factors,  as  is  done  by  Micha  and  Runge  [19] ,  but  that  still  accounts  for 
only  part  of  the  coupling.  A  dynamic  treatment  such  as  the  Close-Coupling  method 
[  29,9  ]  uses  electron  translation  factors,  but  limits  the  dynamics  to  fixed  trajectories, 
which  also  neglects  full  electron-nuclear  coupling  terms. 

Total  cross  sections  may  not  be  too  sensitive  to  the  neglect  of  electron-nuclear 
coupling  terms,  but  differential  cross  sections  and  details  such  as  the  presence  or 
absence  of  rainbow  scattering,  state-to-state  transition  probabilities,  branching  of 
reaction  channels,  etc.  cannot  be  predicted  without  accounting  for  the  full  coupling. 

In  Figure  1  the  electron  transfer  probability  at  250  eV  for  the  H+  +  H  system 
as  a  function  of  scattering  angle  is  depicted.  Results  from  two  experiments  [30,31] 
are  compared  to  the  END  using  a  pVDZ  basis.  Experiments  have  a  finite  angular 
resolution.  In  Ref.  [30]  the  resolution  is  reported  to  be  Ad  =  0.4°  while  Ref.  [31] 
reports  A0  =  0.6°.  END  calculations  were  averaged  to  reflect  the  finite  angular 
resolutions  of  the  two  experiments.  This  was  done  by  calculating  probabilities  suf¬ 
ficiently  close  together  (at  some  points  every  0.02°  apart)  and  then  averaging  the 
probabilities  inside  ±A0/2  of  a  given  angle  0.  It  is  interesting  to  note  that  the 
experimental  values  reported  in  Ref.  [30]  are  actually  more  consistent  with  an 
angular  resolution  of  1°  rather  than  0.4°.  The  results  by  Helbig  and  Everhart  are 
in  excellent  agreement  with  END  for  the  maxima,  but  the  minimum  value  is  too 
high.  This  is  most  likely  due  to  some  experimental  problem  since  the  oscillations 
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Scattering  Angle  (degrees) 


Figure  1 .  Transfer  probability  versus  scattering  angle  for  p  +  H  collision  at  250  eV.  END 
using  pVDZ  basis  and  three  averaged  results  using  an  angular  resolution  ±A0  given  in 
parenthesis.  Experiments:  Exp  I:  Ref.  [30],  Exp  2:  Ref  [31]. 


should  be  symmetric  about  a  probability  of  0.5  [  30] .  Other  sources  of  error  in  the 
experimental  results  are  the  determination  of  the  dissociation  fraction  (H/H2  in 
the  collision  region )  and  the  calibration  necessary  between  the  different  processes 
(elastic,  charge  transfer).  Another  problem  is  that  as  the  scattering  angle  grows, 
the  number  of  scattered  particles  falls  very  steeply.  This  actually  shifts  the  minima 
and  maxima  towards  larger  angles,  as  seen  in  the  results  for  Houver  et  al.  at  angles 
larger  than  4°.  This  is  because  more  particles  are  measured  coming  from  the  -  A0 
than  +A 6  range  about  a  given  angle  0.  With  these  considerations,  one  can  conclude 
that  the  agreement  between  theory  and  experiment  is  excellent. 

In  Figure  2  the  end  results  for  the  same  transfer  probability  (using  perfect  res¬ 
olution)  is  compared  to  the  results  that  completely  neglect  the  electron-nuclear 
coupling  and  one  for  which  the  couplings  in  the  electronic  part  only  (i.e.,  the  first 
row  of  the  metric  of  Eq.  ( 5 )  is  retained.  The  latter  means  that  there  is  no  conservation 
of  total  momentum.  It  is  clear  that  the  neglect  of  the  coupling  terms  leads  to  wrong 
behavior  with  increasing  scattering  angle;  either  incorrect  number  of  oscillations 
or  shifted  positions  of  the  peaks. 

Reactive  Collisions 

ENDyne  calculations  of  total  and  differential  cross  sections  of  ion-atom  collisions 
such  as  proton /hydrogen,  proton /helium,  proton /lithium,  alpha/helium  [16,32- 
34]  with  rather  modest  Gaussian  basis  sets  have  shown  excellent  agreements  with 
an  abundance  of  experimental  data. 

For  instance,  proton /  helium  collisions  have  been  studied  [32]  in  great  detail  for 
a  variety  of  collision  energies  from  500  to  5,000  eV.  The  classical  differential  cross 
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Figure  2.  Transfer  probability  versus  scattering  angle  for  p  +  H  collision  at  250  eV,  using 
pVDZ  basis.  Solid  line  same  as  in  Figure  1.  Dotted  line:  without  any  electron-nuclear 
coupling.  Dashed  line:  neglecting  momentum  conserving  couplings. 


sections  for  electron  transfer  and  elastic  processes  are  calculated  using  a  pVDZ 
basis  set  with  excellent  agreement  with  experiment  for  rainbow  angles.  The  differ¬ 
ential  elastic  cross  sections  with  a  simple  semiclassical  correction  are  indistinguish¬ 
able  from  the  experimental  results. 

Ongoing  work  for  small  collisional  systems  at  energies  from  a  few  to  a  fraction 
of  an  eV  yields  similarly  encouraging  results.  Since  END  imposes  no  restrictions  on 
the  dynamics,  beyond  that  of  basis  sets  and  wave-function  form,  basically  all  chan¬ 
nels  accessible  from  given  initial  conditions  can  be  studied.  The  analysis  of  processes 
involving  ionization  with  one  or  more  electron  departing  are  just  being  started. 
Processes  involving  ion-molecule  collisions  with  bond  breaking  and  bond  formation 
are  readily  accessible  with  end. 

Proton-hydrogen  molecule  collisions  at  30  eV  have  been  the  subject  of  detailed 
experimental  investigation  [35].  There  have  been  several  attempts  to  explain  the 
experiments  using  semi-classical  [35,36]  and  quantum  descriptions  of  the  nuclei 
[37,38].  In  all  these  approaches  dim  surfaces  were  used.  Florescu  et  al.  modified 
the  surfaces  to  achieve  quantitative  agreement  with  experiment  [36].  Preliminary 
results  from  end  show  that  the  rainbow  angle  is  calculated  more  accurately  than 
by  using  dim  surfaces  and  the  approximations  inherent  in  the  Trajectory  Surface 
Hopping  (TSH)  [35]  and  infinite  order  sudden  approximation  (IOSA)  [37,38] 
approaches. 

Experiment  puts  the  primary  rainbow  angle  for  charge  transfer  in  the  H+  +  H2 
collision  at  30  eV  between  6°  and  7°.  The  TSH  and  quantal  IOSA  approach  methods 
place  it  at  10°  [35,37,38].  Preliminary  end  results  place  the  primary  rainbow  to 
be  near  8.5°,  with  new  calculations  lowering  this  result.  The  primary  rainbow  angle 
for  elastic  and  vibrationally  excited  collisions  is  also  observed  to  be  between  6°  and 
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7°.  The  other  approaches  show  a  shift  of  approximately  2.5°  to  larger  angles,  while 
preliminary  END  results  show  a  value  of  7.5°  for  the  rainbow  angle. 

Table  I  shows  the  different  channels  for  the  proton  on  H2  collision  as  identified 
by  ENDyne  (no  differentiation  among  different  vibrational  states  is  made  here).  It 
is  clear  from  these  results  that  the  assumptions  made  in  the  iosa  and  the  use  of  a 
dim  surface  for  this  system  are  somewhat  questionable.  The  orientation  of  the 
system  is  important  in  defining  the  outcome  of  a  collision.  It  is  more  important 
for  small  impact  parameters  than  for  large  ones.  Furthermore,  dissociation  is  not 
accounted  for  in  the  IOSA  approach. 

The  total  breakup  channels  are,  of  course,  not  accessible  by  the  close-coupling 
theory  or  any  other  approach  using  the  so-called  impulse  approximation  that  does 
not  permit  the  nuclear  arrangement  of  the  molecule  to  adjust  during  the  collision. 
Some  channels  for  the  smallest  impact  parameters  correspond  to  nuclear  transfer 
and  should,  strictly  speaking  not  be  included  in  the  electron  transfer  cross  section. 
It  is  almost  impossible,  however,  to  separate  experimentally  these  processes  from 
the  transfer,  since  the  outgoing  proton  or  hydrogen  atom  have  almost  the  same 
energy  as  the  incoming  proton. 

Therefore  it  is  essential  to  have  a  fully  dynamical  theory  accounting  for  electron 
nuclear  coupling  as  the  end  in  order  to  get  all  accessible  product  channels  and  to 
interpret  the  experiment  properly.  Artificial  restrictions  on  the  electron-nuclear 
dynamics  by  prescribed  trajectories,  by  frozen  nuclear  configurations,  by  forcing 
the  dynamics  to  take  place  on  a  potential  energy  surface,  or  on  several  surfaces 
with  ad  hoc  hopping  introduce  nonphysical  dynamics  and  errors  that  are  hard  to 
control. 

Another  system  under  study  is  the  low  energy  collisions  of  Hj  with  H2.  This 
system  exhibits  much  of  the  complexity  of  a  general  molecular  collisional  system 
and  there  are  no  accurate  potential  energy  surfaces  available.  For  the  few  theoretical 
dynamics  studies  that  have  been  performed  on  this  system  [39,40]  approximate 
potential  energy  surfaces  are  used,  and  the  agreement  with  experiment  is  wanting. 
Preliminary  results  for  END  using  a  Is,  2s  basis  for  collinear  initial  conditions  with 
a  single  impact  parameter  (i.e.,  assuming  spherically  symmetric  molecular  species) 

Table  I.  Product  channels  for  various  orientations  and  their  impact  parameter  ranges  for  H+  +  H2 
collisions  at  30eV.  0  is  the  angle  formed  between  the  initial  velocity  vector  of  the  proton  (defined  as  the 
z  axis)  and  the  H2  axis.  The  initial  orientation  of  the  H2  molecule  with  respect  to  the  x-z  plane  is  0.  The 
impact  parameter  b  is  zero  at  the  CM  of  H2  and  increases  along  the  x  axis.  A  range  of  probabilities  is 
shown  for  the  charge  transfer  channel. 


Charge  transfer 

(O,  4.) 

Dissociation 

Proton  exchange 

(probability) 

(0, 0) 

— 

0.0  <  b  <  0.3 

0.3  <  b  (0.4-0.0) 

(45,  0) 

0.65  <b<  1.2 

0.0  <  b  <  0.65 

1.2  <b  (0.24-0.0) 

(90,  0) 

0.0  <  b  <  0.6 

1.1  <b  <  1.35 

0.6  <b<  1.1 

1.4  <  b  (0.26-0.0) 

(90,  45) 

0.0  <  b  <  0.4 

— 

0.4  <  b  (0.2-0.0) 
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show  good  agreement  with  experimental  total  cross  section  for  the  formation  of 
H3  +  H.  Further  studies  are  being  carried  out  beyond  these  simple  assumptions 
for  the  initial  conditions  including  all  channels. 
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Abstract 

Electron  propagator  calculations  have  been  executed  with  a  semi-direct  algorithm  that  generates  only 
a  subset  of  transformed  electron  repulsion  integrals  and  that  takes  advantage  of  Abelian  point  group 
symmetry.  Diagonal  self-energy  expressions  that  are  advantageous  for  large  molecules  are  employed. 
Illustrative  calculations  with  basis  sets  in  excess  of  200  functions  include  evaluations  of  the  ionization 
energies  of  Cy~  and  Zn(C5H5)2.  In  the  former  application,  a  bound  dianion  obtains  for  a  D3h  structure. 
In  the  latter,  many  final  states  of  the  same  symmetry  are  calculated  without  difficulty.  ©  1994  John  Wiley 
&  Sons,  Inc. 


Introduction 

The  scope  of  applications  for  electron  propagator  techniques  [1-7]  continues  to 
expand  in  several  directions.  Open  shell  species  can  be  examined  with  multicon- 
figurational,  self-consistent  field  reference  states  [  8  ]  and  with  renormalized  corre¬ 
lation  corrections  to  unrestricted  Hartree-Fock  reference  states  [9,10].  Evaluation 
of  final  state  properties  and  total  energies  has  been  facilitated  by  calculation  of 
electron  binding  energy  gradients  and  by  reference  state  energy  gradients  corre¬ 
sponding  to  contour  integral  expressions  for  total  energies  [11,12].  The  interpretative 
advantages  of  the  one-electron  picture  provided  by  electron  propagator  theory  have 
provided  insights  into  the  structure  and  bonding  of  a  variety  of  molecules  [13]. 
Direct  and  semi-direct  methods  for  Hartree-Fock  and  correlated  calculations  have 
enabled  applications  to  increasingly  larger  systems  [14]  and  can  be  adapted  to 
propagator  methods.  In  this  contribution,  semi-direct  algorithms  for  calculating 
self-energy  corrections  to  uncorrelated  results  are  employed  for  the  evaluation  of 
ionization  energies  of  C7-  and  Zn(C5H5)2.  This  offering,  in  honor  of  Professor 
Yngve  Ohrn’s  60th  birthday,  owes  a  debt  to  his  pioneering  work  in  propagator 
methods  of  quantum  chemistry. 


Methods 

Nondiagonal  self-energy  calculations  at  the  second  order  and  3+  levels  [  15]  are 
carried  out  for  certain  test  calculations  on  C?-.  These  methods  are  integrated  into 
the  Gaussian  program  suite  [  16]  in  a  modified  version  of  ept90  [17]. 
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Extensive  program  revisions  have  been  applied  to  diagonal  self-energy  calculations 
in  third  order  and  in  the  OVGF  [18]  approximation.  Abelian  point  group  symmetry 
is  used  in  the  calculation  of  second  and  third  order  self-energy  diagrams.  A  semi- 
direct  procedure,  where  transformed  electron  repulsion  integrals  are  retained  only 
if  they  contain  at  least  one  occupied  orbital  index,  has  been  employed.  The  most 
difficult  portion  of  the  calculation  pertains  to  the  evaluation  of  the  third  order  self¬ 
energy  diagram  with  a  four-particle  ladder  vertex.  For  the  pth  diagonal  element  of 
the  self-energy  matrix,  this  diagram  is 

y  ( pi\\ab)(ab\\cd)(cd\\pi ) 

4  iabcd  ( E  +  £‘  ~  £a~  Cb)(E  +  ei  ~  ec  "  *d) 

where  i  is  an  occupied  index  and  a,  b,  c,  d  are  virtual  indices.  It  is  calculated 
without  storage  of  the  block  of  transformed  integrals  with  four  virtual  indices  [  19] . 
Electron  repulsion  integrals  in  the  atomic  basis  are  needed  at  a  certain  intermediate 
point  in  this  calculation,  which  is  similar  to  the  evaluation  of  the  four-particle 
ladder  term  in  third  order  many-body  perturbation  theory  (mbpt)  [20].  While 
previous  versions  of  our  programs  have  assumed  the  availability  of  electron  repulsion 
integrals  in  the  atomic  basis,  the  present  version  instead  generates  these  integrals 
as  they  are  needed.  The  number  of  arithmetic  operations  in  this  step  is  proportional 
to  the  number  of  occupied  orbitals  times  the  fourth  power  of  the  number  of  virtual 
orbitals.  This  scaling  factor  is  the  same  as  that  required  for  partial  integral  trans¬ 
formations  that  have  been  implemented  in  the  Gaussian  suite  for  post-SCF  calcu¬ 
lations.  Use  of  this  semi-direct  algorithm  allows  execution  of  the  applications  dis¬ 
cussed  below  on  an  IBM  RISC  6000,  model  550  workstation  with  64  Mb  main 
memory  and  2  Gb  external  disk.  Diagonal  self-energy  calculations  now  can  be 
performed  routinely  for  basis  sets  with  200-300  basis  functions. 

Ionization  Energies  of  C2~ 

A  recent  experimental  report  on  carbon  cluster  dianions  has  indicated  that  the 
smallest  bound  species  has  seven  carbon  atoms  [21].  Geometry  optimizations  on 
linear  species  have  been  performed  for  C7,  C? ,  and  C2“  at  the  restricted  Hartree- 
Fock  level  with  a  double  £  plus  polarization  basis  [22].  At  the  latter  geometry, 
electron  propagator  theory  (ept)  ionization  energy  calculations  are  executed  with 
a  uhf  reference  state  for  the  3 2^  dianion.  The  3s2pld,  3s3pld,  and  3s2p2d  basis 
sets  of  another  work  [23]  are  employed.  Nondiagonal  second  order  and  nondiagonal 
ept(3+)  [15]  self-energies  generate  the  dianion  ionization  energies  in  Table  I. 
Correlation  effects  decrease  the  ionization  energies  to  the  22^  final  state  of  the 
anion.  Discrepancies  between  second  order  and  ept(3+)  results  are  small;  it  is 
likely  that  the  correlation  treatment  is  not  qualitatively  incorrect.  The  second  basis 
differs  from  the  first  by  the  addition  of  diffuse  p  functions,  but  the  EPT(  3+ )  ionization 
energy  is  still  far  from  being  bound.  Use  of  a  double  polarization  set  in  the  third 
basis  has  but  a  minor  effect  on  the  propagator  results.  Even  with  the  addition  of 
more  diffuse  and  polarization  functions,  it  is  likely  that  the  dianion  is  not  bound 
in  this  linear  structure  at  the  second  order  or  3+  levels. 
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Table  I.  D^h  Cf  Ionization  energy  (eV). 


Basis 

Koopmans 

EPT  (2) 

EPT  (3+) 

3s2pid 

-0.459 

-1.790 

-1.789 

3s3pld 

-0.005 

-1.432 

-1.265 

3s2p2d 

-0.378 

-1.656 

-1.603 

A  more  promising  species  exhibits  D3A  symmetry  and  has  been  examined  with 
Hartree-Fock  and  configuration  interaction  techniques  [24].  Three  spokes  of  two 
carbons  each  radiate  from  a  central  carbon  in  this  planar  structure.  Geometry 
optimizations  at  the  second  order  mbpt  level  with  the  3s2pld  basis  produce  a 
distance  of  1 .422  A  between  the  central  carbon  and  its  neighbors,  while  the  distance 
between  the  latter  carbons  and  their  outer  neighbors  is  1.284  A.  Examination  of 
molecular  orbitals  indicates  that  the  dianion  can  be  considered  isoelectronic  with 
a  D3/,  C(CN)3  structure,  where  the  cationic,  three-coordinate  carbon  at  the  center 
is  covalently  bound  to  three  cyanide  groups.  Diagonal  self-energy  calculations  are 
performed  at  a  geometry  optimized  with  second  order  mbpt  and  the  3s2pld  basis. 
These  calculations  have  been  performed  on  three  final  states  with  the  3s3pld  basis. 
At  the  Koopmans  level,  the  dianion  is  bound  with  respect  to  the  anion  at  this 
geometry.  Correlation  effects  destabilize  the  dianion  with  respect  to  the  anion,  but 
the  former  species  is  still  bound  by  0.4  eV.  The  quality  of  the  basis  is  improved  by 
replacing  the  double  correlation-consistent  [25]  component  of  the  3s3pld  basis 
with  its  triple  £  counterpart.  The  results  are  summarized  in  Table  II.  The  dianion 
is  clearly  bound  with  respect  to  the  anion.  Pole  strengths  (P)  are  close  to  0.9, 
indicative  of  the  validity  of  the  perturbative  and  diagonal  self-energy  arguments 
underlying  these  calculations.  Additional  correlation  effects  would  probably  diminish 
the  predicted  ionization  energy  of  the  dianion,  but  augmented  basis  sets  are  likely 
to  have  the  opposite  effect.  An  estimate  of  0.6  eV  for  the  vertical  ionization  energy 
of  the  dianion  strongly  suggests  that  the  D3*  structure  is  responsible  for  the  stability 
of  the  heptatomic  species. 


Table  II.  D3h  Cf  Ionization  energies  (eV). 


Final  state 

Koopmans 

3rd  order 

OVGF 

P 

[3s3pld]  Basis  Set:  1 19  AOs 

2A'2 

0.97 

0.76 

0.40 

.88 

2E" 

1.07 

0.87 

0.62 

.88 

2E' 

1.93 

1.57 

1.12 

.87 

[4s4p2d  1  f]  Basis  Set:  231  AOs 

2A'2 

1.03 

0.92 

0.55 

.87 

2E " 

1.11 

1.04 

0.79 

.87 

iE> 

1.98 

1.74 

1.28 

.87 
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Ionization  Energies  of  Zn(C5H5)2 

Metallocenes  are  among  the  most  important  structural  motifs  in  organometallic 
chemistry.  Their  photoelectron  spectra  have  been  extensively  studied  [26].  As  a 
test  of  the  semidirect  algorithms,  a  preliminary  study  was  undertaken  for  Zn(C5H5)2, 
a  closed-shell  species  with  ad10  electron  configuration  on  the  metal  center.  Basis 
sets  were  taken  from  a  previous  study  on  Zn(CH3  )2  [  27  ] .  Geometry  optimizations 
at  the  second  order  mbpt  level  were  performed  without  f  functions  on  Zn.  These 
functions  are  included  in  the  propagator  calculations;  a  total  of  220  contracted 
functions  is  generated  in  this  manner.  At  the  optimized  geometry,  the  Zn-ring 
centroid  distance  is  1.951  A,  the  C-ring  centroid  distance  is  1.222  A,  the  C  H 
distance  is  1.092  A,  and  the  ring  centroid-C  —  H  angle  is  90.7°.  Table  III  displays 
results  for  various  states.  The  order  of  states  from  the  uncorrelated  calculations  is 
preserved  in  the  propagator  calculations.  Pole  strengths  are  0.8  or  0.9  and  indicate 
the  qualitative  validity  of  the  Koopmans  picture  of  the  final  states.  Newton’s  method 
for  finding  the  zeroes  of  a  function  is  used  in  the  pole  searches.  Each  of  these 
searches  converges  within  three  iterations;  there  are  no  difficulties  in  obtaining 
excited  final  states  of  a  given  symmetry  type. 

Conclusions 

Electron  propagator  calculations  on  large  molecules  have  been  facilitated  through 
semi-direct  algorithms  where  only  transformed  electron  repulsion  integrals  with  at 
least  one  occupied  index  are  explicitly  calculated.  Diagonal  approximations  to  the 
self-energy  matrix  have  been  employed.  For  C?  ,  the  D3/,  structure  is  bound  with 
respect  to  the  corresponding  monoanion.  In  Zn(C5H5)2,  it  is  easy  to  obtain  many 
cationic,  final  states  of  a  given  symmetry  through  application  of  ordinary  pole 
search  methods. 


Table  III.  D5h  Zn(C5H5)2  Ionization  energies  (eV). 


Final  state 

Koopmans 

3rd  order 

OVGF 

P 

2E\ 

7.66 

7.59 

7.34 

.90 

2E\ 

9.30 

9.14 

8.88 

.90 

2A'i 

14.08 

12.82 

12.42 

.82 

2E'{ 

14.11 

13.10 

12.54 

.90 

2E'2 

14.13 

13.10 

12.56 

.90 

2E\ 

14.77 

13.59 

13.12 

.90 

2E\ 

14.90 

13.72 

13.26 

.90 

2A\ 

15.39 

14.01 

13.63 

.78 

2A\ 

18.80 

17.32 

16.70 

.86 

2A$ 

18.96 

17.47 

16.92 

.85 

2E'2 

19.90 

18.09 

17.49 

.84 

2E\ 

19.94 

18.12 

17.56 

.84 

ELECTRON  PROPAGATOR  CALCULATIONS 


27 


Acknowledgments 

This  work  was  partially  supported  by  the  National  Science  Foundation  un¬ 
der  Grants  che-9101777  and  CHE-9321434  and  by  the  Petroleum  Research  Fund 
under  Grant  245 12-AC6. 


Bibliography 

[11  J.  Linderberg  and  Y.  Ohrn,  Propagators  in  Quantum  Chemistry  (Academic  Press,  New  York, 
1973). 

[2]  B.  T.  Pickup  and  O.  Goscinski,  Mol.  Phys.  26,  1013  (1973). 

[3]  L.  S.  Cederbaum  and  W.  Domcke,  Adv.  Chem.  Phys.  26,  206  ( 1977). 

[4]  J.  Simons,  Theor.  Chem.  Adv.  Persp.  3,  1  (1978). 

[5]  M.  F.  Herman,  K.  F.  Freed,  and  D.  Yeager,  Adv.  Chem.  Phys.  48,  1  (1981). 

[6]  Y.  Ohrn  and  G.  Born,  Adv.  Quantum  Chem.  13,  1  ( 1981 ). 

[7]  W.  von  Niessen,  J.  Schirmer,  and  L.  S.  Cederbaum,  Comput.  Phys.  Rep.  1,  57  ( 1984). 

[8]  J.  A.  Nichols,  D.  L.  Yeager,  and  P.  Jorgensen,  J.  Chem.  Phys.  80,  293  ( 1984);  J.  T.  Golab  and 
D.  L.  Yeager,  J.  Chem.  Phys.  87,  2925  ( 1987). 

[9]  J.  V.  Ortiz,  Int.  J.  Quantum  Chem.,  Quantum  Chem.  Symp.  25,  35  (1991). 

[10]  J.  V.  Ortiz,  Chem.  Phys.  Lett.  199,  530  (1992). 

[11]  J.  Cioslowski  and  J.  V.  Ortiz,  J.  Chem.  Phys.  96,  8379  (1992). 

[  12]  J.  V.  Ortiz,  Int.  J.  Quantum  Chem.,  Quantum  Chem.  Symp.  26,  1  (1992). 

[13]  See,  for  example,  the  following  references:  J.  V.  Ortiz,  J.  Chem.  Phys.  92 , 6728  ( 1990) ;  J.  V.  Ortiz, 
Chem.  Phys.  Lett.  169,  1 16  ( 1990);  J.  V.  Ortiz,  J.  Am.  Chem.  Soc.  113,  1 102  ( 1991 );  J.  V.  Ortiz, 
J.  Chem.  Phys.  94,  6064  (1991);  J.  V.  Ortiz,  Polyhedron  10,  1285  (1991);  J.  V.  Ortiz,  J.  Am. 
Chem.  Soc.  113,  3593  ( 1991 );  J.  V.  Ortiz  and  J.  W.  Mintmire,  J.  Phys.  Chem.  95,  8609  ( 1991 ); 
J.  V.  Ortiz,  Macromolecules  26,  2989  ( 1993);  J.  V.  Ortiz  and  C.  M.  Rohlfing,  Macromolecules 
26,7282  (1993). 

[14]  P.  R.  Taylor,  Int.  J.  Quantum  Chem.  31 , 521  (1987);  S.  Saebo  and  J.  Almldf,  Chem.  Phys.  Lett. 
154,  83  (1989);  M.  Head-Gordon,  J.  A.  Pople  and  M.  J.  Frisch,  Chem.  Phys.  Lett.  153,  503 
( 1988);  J.  Almlof,  K.  Faegri,  and  K.  Korsell,  J.  Comp.  Chem.  3,  385  ( 1982);  R.  Ahlrichs,  M.  Bar, 
M.  Haser,  H.  Horn,  and  C.  Kolmel,  Chem.  Phys.  Lett.  162,  165  ( 1989);  S.  Saebo  and  J.  Almldf, 
Chem.  Phys.  Lett.  154,  83  (1989);  M.  Feyereisen,  J.  Nichols,  J.  Oddershede,  and  J.  Simons,  J. 
Chem.  Phys.  96,2978  (1992). 

[15]  J.  V.  Ortiz,  J.  Chem.  Phys.  99,  6716  ( 1993). 

[16]  Gaussian  90,  Revision  F,  M.  J.  Frisch,  M.  Head-Gordon,  G.  W.  Trucks,  J.  B.  Foresman,  H.  B. 
Schlegel,  K.  Ragavachari,  M.  Robb,  J.  S.  Binkley,  C.  Gonzalez,  D.  J.  Defrees,  D.  J.  Fox,  R.  A. 
Whiteside,  R.  Seeger,  C.  F.  Melius,  J.  Baker,  R.  L.  Martin,  L.  R.  Kahn,  J.  J.  P.  Stewart,  S.  Topiol, 
and  J.  A.  Pople,  Gaussian,  Inc.,  Pittsburgh,  PA,  1990. 

[17]  J.  V.  Ortiz,  EPT90,  an  ab  initio  electron  propagator  program;  J.  V.  Ortiz,  Int.  J.  Quantum  Chem., 
Quantum  Chem.  Symp.  23,  321  (1989). 

[18]  L.  S.  Cederbaum,  W.  Domcke,  and  W.  von  Niessen,  J.  Phys.  B  10,  2963  (1977). 

[19]  J.  A.  Pople,  J.  S.  Binkley,  and  R.  Seeger,  Int.  J.  Quantum  Chem.,  Quantum  Chem.  Symp.  10,  1 
(1976). 

[20]  R.  J.  Bartlett,  Ann.  Rev.  Phys.  Chem.  32,  359  ( 1981 ),  and  references  therein. 

[21  ]  S.  N.  Schauer,  P.  Williams,  and  R.  N.  Compton,  Phys.  Rev.  Lett.  65,  625  ( 1990). 

[22]  J.  Watts  and  R.  J.  Bartlett,  J.  Chem.  Phys.  97,  3445  (1992). 

[23]  V.  G.  Zakrzewski  and  J.  V.  Ortiz,  J.  Chem.  Phys.  100,  6614  (1994). 

[24]  T.  Sommerfeld,  M.  K.  Scheller,  and  L.  S.  Cederbaum,  Chem.  Phys.  Lett.  209,  216  ( 1993). 

[25]  T.  H.  Dunning,  J.  Chem.  Phys.  90,  1007  (1989). 

[26]  J.  C.  Green,  Struct.  Bonding  (Berlin)  43,  37  ( 1981 ). 

[27]  V.  G.  Zakrzewski  and  J.  V.  Ortiz,  J.  Chem.  Phys.  100,  6508  (1994). 


Received  June  2,  1994 


Treatment  of  Molecular  Resonances  Using  the 
Bi-Orthogonal  Dilated  Electron  Propagator 
with  Application  to  the  2IIg  Shape 
Resonance  in  c-N2  Scattering 

MILAN  N.  MEDIKERI*  and  MANOJ  K.  MISHRA 

Department  of  Chemistry,  Indian  Institute  of  Technology,  Powai,  Bombay  400  076,  India 


Abstract 

Preliminary  results  from  the  first  application  of  the  bi-orthogonal  dilated  electron  propagator  to  the 
treatment  of  molecular  resonances  are  presented  for  the  energy  and  the  width  of  the  2ng  shape  resonance 
in  e-N2  scattering.  The  corresponding  resonant  Feynman-Dyson  amplitudes  (fdas)  are  plotted  to  get  a 
quantitative  affirmation  of  the  topology  of  the  lowest  unoccupied  molecular  orbital  (lumo)  for  N2.  It  is 
shown  that  a  plot  of  the  resonant  fdas  offer  new  insights  into  the  role  of  correlation  in  the  formation 
and  decay  of  molecular  shape  resonances.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

It  is  a  great  pleasure  to  be  able  to  contribute  to  the  special  session  on  Propagator 
and  Resolvent  theory  honoring  Professor  Yngve  Ohm.  One  of  us  (M.  K.  M.)  did 
his  Ph.D.  under  him  and  fondly  recollects  his  gentle  guidance,  meticulous  teaching, 
and  indulgent  support.  Yngve  Ohm  has  been  instrumental  in  establishing  the  elec¬ 
tron  propagator  method  as  a  popular  and  potent  tool  for  molecular  structure  cal¬ 
culations  [1,2]  and  references  to  his  work  are  bound  to  be  strewn  throughout  this 
volume.  The  present  work  is  an  outgrowth  of  the  work  M.  K.  M.  did  for  his  dis¬ 
sertation,  and  we  are  pleased  to  dedicate  the  results  from  this  first  application  of 
the  bi-orthogonal  dilated  electron  propagator  to  the  treatment  of  molecular  reso¬ 
nances  to  Yngve. 

The  biorthogonal  dilated  electron  propagator  technique  [3-5]  has  been  employed 
extensively  for  direct  calculation  of  atomic  Auger  and  shape  resonance  energies 
and  widths  [3-1 1].  A  review  of  the  different  approaches  to  the  construction  of  the 
complex  scaled  electron  propagator  is  offered  by  Mishra  [3].  More  recently,  the 
Feynman-Dyson  amplitudes  (fdas)  of  the  bi-orthogonal  dilated  electron  propagator 
have  been  used  to  investigate  the  orbital  picture  of  shape  resonance  formation  and 
decay  [11].  It  has  been  argued  [11]  that  since  the  resonances  are  believed  to  result 
from  a  temporary  trapping  of  the  impinging  electron  in  one  of  the  unoccupied 
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orbitals  of  the  target  and  unoccupied  (virtual)  orbital,  energies  of  the  Fock  operator 
can  be  completely  altered  even  with  a  minor  modification  of  the  primitive  basis; 
the  conventional  SCF  procedures  cannot  be  used  for  identifying  the  resonant  orbital. 
However,  the  fda(s)  associated  with  the  resonant  pole(s)  of  the  biorthogonal  dilated 
electron  propagator  based  on  an  underlying  bi-variational  SCF  [12-15]  may  be 
rigorously  and  unequivocally  associated  with  the  resonant  orbital  and  have  been 
successfully  utilized  for  a  quantitative  affirmation  of  the  orbital  picture  of  metastable 
anion  formation  in  atoms  [11]. 

The  lowest  unoccupied  molecular  orbitals  (lumos)  are  central  to  phenomeno¬ 
logical  theories  of  chemical  reactivity.  Although  qualitative  correlation  of  lumos 
with  metastable  anionic  resonances  is  available  in  abundance  [16,17],  a  quantitative 
affirmation  once  again  would  require  an  unequivocal  mechanism  for  the  identifi¬ 
cation  of  the  unoccupied  resonant  orbital  of  the  molecular  target  in  which  the 
impinging  electron  will  get  trapped.  Such  a  characterization  can  be  made  by  iden¬ 
tifying  the  resonant  pole(s)  of  the  dilated  electron  propagator  with  the  resonance 
resulting  from  the  temporary  trapping  of  the  impinging  electron  in  the  LUMO  and 
associating  the  corresponding  fda(s)  with  the  resonant  orbital/LUMO  [1 1].  All  the 
applications  of  the  bi-orthogonal  dilated  electron  propagator,  however,  have  been 
limited  to  the  treatment  of  atomic  resonances  so  far. 

The  2Hg  shape  resonance  in  e-N2  scattering  has  been  the  favorite  test  case  for 
the  study  of  molecular  resonances  [18-26]  and  the  topology  of  the  \irg  lumo  of 
N2  is  a  well-established  textbook  example.  For  these  reasons  we  have  also  chosen 
the  2II g  shape  resonance  in  <?-N2  scattering  for  the  first  application  of  the  bi¬ 
orthogonal  dilated  electron  propagator  to  the  treatment  of  molecular  resonances. 

In  the  next  (second)  section  we  outline  our  method.  In  the  first  part  of  the  third 
section,  we  discuss  the  results  for  energy  and  width  obtained  from  our  second-  and 
zeroth-order  decouplings.  The  fdas  corresponding  to  the  zeroth-  and  second-order 
resonant  poles  correlate  well  with  the  textbook  portrayal  of  the  lirg  N2  lumo,  and 
the  differences  between  the  amplitudes  from  the  two  decouplings  as  also  between 
the  resonant  FDA  on  the  real  line  and  in  the  complex  plane  (for  the  optimal  value 
of  the  dilation  parameter)  are  used  to  elicit  mechanistic  insights  about  the  formation 
and  decay  of  molecular  shape  resonances  in  the  second  part  of  the  third  section. 
A  brief  survey  of  the  main  results  concludes  this  article. 

Method 

The  Dyson  equation  for  the  bi-orthogonal  matrix  electron  propagator  G(rj,  E) 
may  be  expressed  as  [4,5] 

G-\v,  E)  =  Go  E)  -  2(7/,  E) ,  (1) 

where  G0(t),  E)  is  the  zeroth-order  propagator  for  the  uncorrelated  electron  motion 
here  chosen  as  given  by  the  bivariational  SCF  approximation  [12-15]  and  77  =  ae~l6 
is  the  complex  scaling  (dilation)  parameter  (the  method  of  complex  scaling  has 
been  reviewed  in  the  literature  [27a];  there  are  also  some  recent  applications  of 
complex  scaling  [27b]).  The  self-energy  matrix  2(77,  E)  contains  the  relaxation  and 
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correlation  effects  and  may  be  determined  through  perturbative  or  renormalized 
decouplings  [4,5]. 

Solution  of  the  bivariational  SCF  equations  for  the  7V-electron  ground  state  yields 
a  set  of  occupied  and  unoccupied  orbitals  In  terms  of  these  orbitals  the  matrix 

elements  of  Gq1(t},  E)  are 

(Go  \rhE))iJ  =  (E-ci)5ij,  (2) 

where  eh  the  orbital  energy  of  the  zth  orbital,  is  obviously  the  zeroth-order  pole  of 
the  dilated  electron  propagator.  The  corresponding  orbital  is  the  zeroth-order 
fda.  The  propagator  Eq.  ( 1 )  may  be  recast  as 

G-\v,  E)  =  El  -  L  („,  E ),  L  =  £  +  2  ,  (3) 

and  the  poles  of  the  dilated  electron  propagator  are  the  energy-dependent  eigenvalues 
of  L(?7,  E)  given  by 


Hv,  E)Xn{t),  E)  =  <£,,(77,  E)Xn(r),  E) ,  (4) 

which  satisfy  the  relation 

£„(?7,  E)  =  E  (5) 

and  may  be  obtained  by  iterative  diagonalization  [28]  of  L. 

The  Feynman-Dyson  amplitude  Xn  corresponding  to  the  nih.  (N  +  l)-electron 
state  is  given  by 

X„  =  J  3,...,N,N  +  1,  n) 

X  *£(1,  2,  3, ... ,  N)d(\)d{2)d(i)-  •  -d(N) ,  (6) 

where  \Fo  is  the  optimal  single  determinantal  description  of  the  7V-electron  target 
based  on  the  bi-variationally  determined  SCF  orbitals  {&}  and  ^+1  is  the  «th 
(N  +  1)  state  generated  by  the  creation  operator  manifold  {at  A,  at  A  cij  A  akL;j  > 
i}  [4,5].  In  the  bivariationally  obtained  biorthogonal  orbital  basis  Xn  is  a  linear 
combination  [29] 


X„(r)  =  2  CniHr) ,  (7) 

i 

where  the  mixing  of  the  canonical  orbitals  allows  for  the  incorporation  of  correlation 
and  relaxation  effects.  In  the  zeroth  (2  =  0)  and  quasiparticle  approximations  (di¬ 
agonal  2),  there  is  no  mixing.  The  difference  between  perturbative  second  order 
(22)  or  renormalized  diagonal  2^/z-tda  (22M‘tda)  decouplings  manifests  itself 
through  differences  between  the  mixing  coefficients  C„,  from  these  approximations 
[2,5].  Our  recent  experience  with  different  decouplings  [5,10,1 1]  is  that  the  more 
demanding  diagonal  2/7/z-tda  decoupling  does  not  necessarily  lead  to  an  improved 
characterization,  and  our  treatment  here  is  limited  to  the  zeroth-  and  second-order 
decouplings  of  the  dilated  electron  propagator. 

The  basic  computational  difference  between  the  atomic  and  the  molecular  dilated 
electron  propagator  stems  from  the  non-dilatation-analyticity  of  the  Born-Oppen- 
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heimer  Hamiltonian  [30-33].  A  computationally  tractable  solution  is  offered  by 
Moiseyev-Corcoran  method  of  treating  the  nuclear  attraction  integrals  for  the  dilated 
Hamiltonian  [32].  This  is  the  method  adopted  in  an  alternative  approach  to  the 
construction  of  the  molecular  dilated  electron  propagator  [26,34]  based  on  real  SCF 
as  the  zeroth-order  approximation  [28],  and  we  follow  the  same  modified  procedure 
for  the  calculation  of  the  complex  three  center  nuclear  attraction  integrals  in  the 
construction  of  our  molecular  biorthogonal  dilated  electron  propagator. 

Results  and  Discussion 


Energetics 

The  basis  functions  employed  in  our  calculation  are  the  4s, 9p  atom  centered 
CGTOs  employed  in  a  previous  characterization  of  this  resonance  [26] .  The  resonant 
trajectories  from  both  the  zeroth-order  and  the  second-order  decouplings  are  plotted 
in  Figure  1 .  No  attempt  at  basis  set  optimization  or  optimization  of  the  radial  scale 
factor  a  has  been  attempted  in  this  demonstrative  first  application.  However,  the 
fact  that  the  resonance  is  uncovered  without  any  radial  scaling  and  with  very  small 
amount  of  rotation  (0opt  =  0.02  radians)  speaks  well  for  the  effectiveness  of  our 
technique.  The  results  for  the  energy  and  the  width  of  the  2Ug  e-N2  resonance  are 
collected  in  Table  I.  Our  results  compare  reasonably  with  those  from  experiment 
and  other  related  techniques  [24-26].  Surprisingly,  the  zeroth-order  results  offer 
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Figure  1. 


Theta  trajectory  for  the  resonant  root  from  zeroth-order  and  second-order 
decouplings  of  the  dilated  electron  propagator. 
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Table  I.  Energy  and  width  of  the  2Tlg  N2. 


Method/reference 

Energy 

(eV) 

Width 

(eV) 

Experiment  [35,  36] 

2.20 

0.57 

Static  exchange  [18] 

3.70 

1.16 

Static  exchange  .K-matrix  [19] 

2.15 

0.34 

Stabilization  method  [20] 

2.44 

0.32 

7?-matrix  [21] 

3.26 

0.80 

Complex  scf  [22] 

3.19 

0.44 

Stieltjes  imaging  technique  [23] 

2.23a 

0.40 

Many-body  optical  potential  [24] 

2.20 

— 

Many-body  optical  potential  [25] 

2.61 b 

0.58 

2.26 c 

0.42 

2.54d 

0.54 

Second-order  dilated  electron  propagator  with  real  scf  [26] 

2.14 

0.26 

Zeroth-order  dilated  electron  propagator  (this  work) 

2.07 

0.14 

Second-order  dilated  electron  propagator  (this  work) 

1.77 

0.13 

a  Configuration  interaction. 
b  Second  order. 
c  2p-h  TDA. 
d  ADC  (3). 


better  agreement  with  the  experimental  data  both  for  the  energy  and  the  width  than 
those  from  the  second  order.  The  results  presented  here  are,  however,  without  any 
basis-set  or  radial  scale  optimization,  and  any  definitive  conclusion  other  than  the 
effectiveness  and  promise  of  this  technique  is  unwarranted.  In  any  case,  since  these 
molecular  resonances  have  a  rich  vibrational  structure  [16],  further  calculations 
with  internuclear  distances  spanning  the  stretchings  involved  in  at  least  the  first 
few  vibrations  are  essential  for  a  comprehensive  characterization  of  the  energy  and 
the  width. 


Mechanism 

It  is  easily  seen  from  Figure  1  that  the  correlation  and  relaxation  effects  incor¬ 
porated  by  the  second-order  decoupling  stabilize  the  resonance  by  lowering  both 
its  energy  and  the  width.  Mechanistic  insights  emerge  from  an  analysis  of  the  cor¬ 
responding  fdas  on  the  real  line  (Fig.  2)  which  clearly  resemble  the  familiar  Wg 
orbital  of  the  general  chemistry  textbooks.  The  difference  between  the  second-order 
and  the  zeroth-order  FDA  on  the  real  line  (Fig.  2)  makes  obvious  the  mechanism 
for  stabilization  by  showing  that  the  lobes  of  the  second-order  FDA  are  both  less 
positive  and  less  negative  compared  to  those  of  the  zeroth  order,  i.e.,  the  correlation/ 
relaxation  decreases  the  antibonding  nature  of  the  1  irg  orbital  and  thereby  prepares 
it  for  metastable  anion  formation. 
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Difference  between  Second  order  end  Zeroth  order 


Figure  2.  Feynman-Dyson  amplitudes  from  the  zeroth-  and  second-order  dilated  electron 
propagator  corresponding  to  the  resonant  roots  of  Figure  1  on  the  real  line.  The  plot  of 
the  difference  between  the  second-  and  zeroth-order  amplitude  reveals  that  correlation 
stabilizes  the  Urg  orbital  by  reducing  its  antibonding  character. 
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The  metastability  is  facilitated  by  the  accumulation  of  the  amplitude  for  the 
optimal  complex  scaling  parameter  as  shown  in  Figure  3.  Where  there  was  no 
amplitude  in  the  internuclear  region  on  the  real  line  (Fig.  2),  in  Figure  3  we  see 
amplitude  accumulation  in  the  internuclear  region  for  the  optimal  value  of  the 

Real  pert  of  Second  order  Feynman  Dyeon  Amp  I  itude 


Imaginary  part  of  Second  order  Feynman  Dyson  Amp  I  Itude 


Figure  3.  Feynman-Dyson  amplitude  corresponding  to  the  resonant  second-order  pole 
for  optimal  theta.  The  resonant  metastable  anion  formation  is  clearly  indicated  by  the 
amplitude  accumulation  in  the  internuclear  region  absent  on  the  real  line. 
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complex  scaling  parameter.  As  argued  earlier  for  atomic  resonances  [11]  the  resonant 
fdas  once  again  offer  a  quantitative  and  rigorous  affirmation  of  the  orbital  picture 
of  resonances,  and  the  complex  scaling  parameter  is  seen  to  have  a  critical  role  in 
their  unmasking. 


Concluding  Remarks 

The  preliminary  results  from  the  first  application  of  the  biorthogonal  dilated 
electron  propagator  to  a  well-characterized  molecular  resonance  provide  sufficient 
promise  about  its  effectiveness  in  dealing  with  the  energetics  of  the  molecular  res¬ 
onance  formation  and  decay.  An  examination  of  the  resonant  FDAs  offers  mech¬ 
anistic  insights  which  bring  out  the  chemistry  of  the  resonance  formation  in  some 
detail.  Such  a  quantitative  affirmation  of  the  resonance  formation,  to  the  best  of 
our  knowledge,  has  not  been  available  earlier,  and  we  conclude  with  the  sense  of 
excitement  we  feel  about  the  new  insights  that  may  emerge  from  a  more  compre¬ 
hensive  application  to  many  more  systems  like  the  2II  resonance  in  e-CO  scattering 
or  the  resonance  resulting  from  the  trapping  of  electron  in  the  x*  orbital  of  C2H4. 
As  mentioned  earlier,  even  the  present  results  need  to  be  revisited  with  more  ex¬ 
haustive  basis  sets,  optimized  radial  scale  factors,  internuclear  relaxation,  and  other 
decouplings.  An  effort  along  these  lines  is  underway  in  our  group. 
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Abstract 

A  new  approach  for  solving  multielectronic  integrals  in  quantum  chemistry  is  presented.  The  Coulomb 
potential  r\\  is  expanded  in  a  complete  function  set  within  a  finite  spherical  domain  of  radius  R.  This 
expansion  yields  a  series  where  the  and  r2  variables  are  symmetrically  separated.  The  generalization 
to  other  potentials  is  briefly  discussed.  The  Yukawa  potential  exp(-fo2)/r12,  and  the  functions 
exp(-fr?2)  and  exp(-fr12)  are  considered  as  particular  cases.  As  an  example  of  the  potentialities  of  this 
approach,  a  certain  type  of  four-electron  integrals  that  appear  in  the  Hylleraas  ci  method  are  evaluated. 
Results  thus  obtained  are  discussed  in  terms  of  the  approximations  made  and  compared  with  values 
obtained  using  other  procedures  available  in  the  literature.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

In  a  previous  article  [1]  a  novel  approach  to  calculate  the  electronic  integrals 
that  appear  in  an  scf-lcao  calculation  using  any  type  of  sufficiently  fast  decaying 
atomic  orbitals  in  the  basis  set  was  presented.  It  was  based  on  the  expansion  of  the 
binary  products  of  atomic  orbitals  entering  in  those  integrals  in  terms  of  an  adequate 
complete  function  basis  set.  This  complete  function  set  was  built  up  in  a  finite 
domain  defined  by  a  sphere  of  radius  R  whose  value  can  be  chosen  according  to 
the  problem  dealt  with.  The  functions  defining  this  complete  set  are  solutions  of 
the  Helmholtz  equation  with  boundary  conditions  devised  as  to  make  the  Coulomb 
integral  particularly  simple  [1]. 

Bielectronic  integrals  of  an  scf-lcao  calculation,  multielectronic  integrals  that 
appear  in  the  realization  of  Hylleraas  Configuration  Interaction,  hci  [2-4]  as  well 
as  others  which  appear  in  different  models  that  intend  to  take  into  account  electronic 
correlation  [5],  contain  in  their  integrands  atomic  orbital  products  and  functions 
of  the  interelectronic  distances.  These  functions  are  called,  hereafter,  potential 
functions. 

In  this  article  a  different  approach  to  study  similar  problems  is  presented.  Instead 
of  expanding  binary  atomic  orbital  products  in  terms  of  the  mentioned  complete 


International  Journal  of  Quantum  Chemistry:  Quantum  Chemistry  Symposium  28,  39-48  (1994) 

©  1994  John  Wiley  &  Sons,  Inc.  CCC  0360-8832/94/01039-10 


40 


PEREZ  ET  AL. 


function  set  as  in  the  previous  article  [1],  the  potential  function  is  expanded  in 
terms  of  the  same  complete  function  set.  Such  expansions  allow  one  to  “decouple” 
the  potential  functions  f(rl2)  in  symmetrically  separated  integration  variables  ?i 
and  7  2  [6,7],  permitting  the  concrete  realization  of  the  corresponding  multielectronic 
integrals.  It  is  also  shown  that  this  expansion  can  be  performed  along  two  differ¬ 
ent  lines. 

Finally,  how  to  apply  these  ideas  to  implement  the  practical  calculation  of  four- 
electron  integrals  over  gaussian-type  orbitals  that  appear  in  the  hci  method  [4]  is 
shown  and  some  numerical  values  are  presented  which  are  compared  with  those 
obtained  using  other  methods  available  in  the  literature  [3].  The  promising  poten¬ 
tialities  of  this  approach  are  briefly  discussed. 

Symmetrical  Separation  of  the  rl  and  r2  Variables  in  Potential  Functions 

of  the  Form /(fj  2) 


Basis  Set  Functions  Employed  in  the  Present  Work 

In  a  previous  work  [1],  it  has  been  shown  that  a  complete  set  of  functions 
{U,mn(j)},  with  l  =  0,  1,  . . .  m  =  -(/  -  1),  •  •  • ,  /  and  n  =  1,  2,  . . . ,  can  be 
defined  such  that  each  U/m„(r)  is  a  solution  of  the  Helmholtz  equation  inside  a 
sphere  of  radius  R.  and  satisfies  the  following  integral  equation: 


UiUT) 

\f-n 


U/mn(r) 


(1) 


where,  in  spherical  coordinates, 


uur)  =  B„Jl+lf, rue,  <t>) 


with  r  <  R,  Ji+in(x)  is  the  Bessel  function  of  the  first  kind  of  half  integer  order  [8], 
B,„  are  the  corresponding  radial  normalization  constants  which  are  equal  to  2 1/2/ 
[RJl+1/2(ki„R)],  Yim{d,  <t>)  are  the  normalized  spherical  harmonics  [9],  and  kln  satisfy 
an  appropriate  radial  boundary  condition,  i.e.,  Eq.  (1)  evaluated  at  |?|  =  a. 
Essentially,  this  condition  is: 

Ji-mikinR)  =  0  (3) 

i.e.,  for  each  /  =  0,  1, . . .  the  n  =  1,  2, . . .  index  stands  for  the  «th  zero  of //_,/2(x). 
The  functions  given  in  Eq.  (2)  are  suitable  for  the  purposes  of  this  work. 

Symmetrical  Separation  of  Variables  for  the  Coulomb  Potential 

The  complete  basis  set  {Uimn(r)},  can  be  used  for  expanding  any  function  that 
belongs  to  the  L\r  <  R)  Hilbert  Space.  In  particular,  f(rl2)  =  l?i  “  ?2|  1  =  ri2 
for  each  r2  satisfying  ±  r2  and  0  <  ru  r2  <  R,  assumes  the  following  expres¬ 


r i2  —  2  CimnUimn(r i) 

Imn 


sion: 


(4) 
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UU?  i)  /4* 


7 t\UM?2) 


~Tl\  \tt 


where  the  last  expression  results  from  Eq.  (1). 

Equations  (4)  and  (5)  yield  the  symmetric  separation  of  rf \  in  ?!  and  ?2, 


-  ?2I  1  =  4ir  2) 


ufmn(y1)u,mn{7i) 


It  is  worthy  of  mention  that  the  Laplace  expansion  [10]  of  r12': 

rA  =  —  2  (— V(2/  +  l T'YWi,  <h)YU6u  M 


allows  one  to  relate  (r</r>+1)  with  the  sum  over  n  in  Eq.  (6),  as  a  direct  consequence 
of  the  linear  independence  of  spherical  harmonics.  That  is: 


1  \  ric  “  Rln(r<)Rln(r>) 


21+1  r 


where 


Rin(r )  =  BinJi+l/2(kinr)/rl 


The  series  in  Eq.  (8)  is  the  discrete  analogous  of  the  well  known  (see  p.  274  of 
Ref.  [11])  relation: 


1  [rAl+m  r  dk 


l +  1/2 


(rJ\  =  f 

\r>/  Jo 


Ji+ !  /i(kr. >)//+!  /i(kr  < ) 


This  analogy  is  a  direct  consequence  of  the  close  connection  existing  between 
the  present  discrete  approach  and  the  Fourier  Transform  Method  (see  Sec.  2.4  of 
Ref.  [1]). 


A  Brief  Outline  to  Generalize  the  Results  for  Arbitrary  Potentials 

Given  a  function  f(r),  with  0  <  r  <  R,  its  expansion  in  terms  of  the  complete  set 
defined  in  this  section,  is: 


Rr)  =24 


sin(/:0„r) 


where 


=  Br 


ffcoon  =  Bol-f]'12  f  dV  UZo„(7)f(r)  (11) 

'Aw  \7r/QW  Jr<R 
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It  is  important  to  note  that  if  f(r)  is  continuous  in  [0,  R]  then  the  series  of  Eq.  (10) 
uniformly  converges  in  the  interval  [6,  R  —  5]  with  5  arbitrarily  small,  [12,  p.  602]. 

The  next  step  is  to  put  r  =  ri2  in  Eq.  (10),  under  the  assumption  that  r]2  ^  R. 
Making  use  of  the  following  relation  [1 1,  p.  221], 

sin(fe°  —  -  2x2  i  2  FU?iWU?i)  (12) 

r12  /=0  m=-l 


where 


Flmrfji)  -  Y[m(6i,  (f>i)Jl+l/2(konf,i)/r}/2 


Eq.  (10)  transforms  into: 

Arn)  =  2x2  2  ZnFfmn(r{)Flmn{Y2)  (13) 

Imn 


The  procedure  depicted  above  generates  a  separated  expansion  for  the  potential 
flj{2)  in  a  “complicated”  region  defined  by  the  condition  rl2  <  R.  In  particular  it 
is  appropriate  to  be  used  in  the  evaluation  of  integrals  of  the  potential  functions  if 
the  domain  of  integration  is  restricted  to  r\ ,  r2  <  R/2.  The  parameter  R  is  selected 
in  the  way  discussed  in  Appendix  I  of  Ref.  [1],  i.e.,  assuring  that  the  error  due  to 
the  truncation  of  the  space  is  negligible  as  a  consequence  of  the  exponentially 
decaying  property  of  atomic  orbitals. 

From  Eq.  ( 1 1 )  and  Eq.  ( 1 3)  it  is  clear  that  the  Coo„  coefficients  provide  the  sufficient 
information  to  separate  the  variables  r  i  and  r2.  To  illustrate  this  point,  the  Coo* 
coefficients  for  the  Yukawa  potential  and  for  the  functions  exp(-ari2)  and 
exp(“«ri2)  are  explicitly  given  in  what  follows. 

The  C0Qn  coefficient  which  corresponds  to  exp(-fr)/r  is: 


where 


and 


'00* 


dr  r2R0n(r)YMu)  exp (-fr)/r 


,1/7„  /  2  \1/2  k0„  —  exp(— fR)f  sin(k0nR) 

-(4')  Hs)  - - 

_  ,  U& j„(0)  ,  _ ,  vm<n 

—  4ir  ^  _j_  ^2  4”  exp(  £R)Qn 


UM0)  =  (-ir'koJVvR]''2 


Qn  = 


(AX'2  ■-  sin(kp„R) 

°"[koJ  ff2  +  ^„ 


(14) 
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/  8  \1/2  f°° 

O*  =  drrexp(-{r2)  sin(/c0„r)  (19) 

it  can  be  shown  that  |  en  |  is  a  quantity  of  order  exp(— £R2). 

The  three  examples  shown  above  have  a  common  structure.  The  second  terms 
in  Eqs.  (14),  (16),  and  (18)  are  negligible  with  respect  to  the  corresponding  first 
ones  (provided  a  large  R  value  is  selected).  Moreover,  these  first  terms  can  be  ex¬ 
pressed  as  the  product  of  the  basis  function  evaluated  at  r  =  0  and  the  Fourier 
Transform  of  the  corresponding  potential  specialized  in  the  k0n  eigenvalues.  Con¬ 
sequently,  eq.  (13)  can  be  considered  as  the  discrete  analogous  of  the  relation  [13]: 

/(r12)  =  (32ir)1/2  2  YMwt )YlmMmrl ,  r2)  (20) 


where 
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J*oo 

dkk2ji{kr ,  )ji(kr2p(k) 

0 


with  ji(x)  representing  the  spherical  Bessel  functions  of  order  /;  3(k)  is  the  Fourier 
transform  of  J[r).  Comparison  of  Eqs.  (13)  and  (20)  gives: 


„  ,  £  ^  JfridfWi)Ji+idko*r2) 
2w  „?,z" 


(32t r)mH}{ru  r2) 


Alternatively,  the  direct  expansion  referred  to  in  the  last  subsection,  for  an  ar¬ 
bitrary  function  /(r12)  ultimately  is: 


f{?\2)  “2/  2  Aimjin'U imr^r l)U imd? \  ) 

Im  «,«'=  1 

where  comes  from  the  scalar  product: 

(U/Wn{l)UU2)J(rl2))  =  M  mm'Alm,nn' 


The  expansion  obtained  in  Eq.  (13)  differs  from  the  one  given  in  Eq.  (22)  in  the 
following  points:  in  the  former  only  the  {k0n}  and  the  {C0o«}  subsets  are  needed  in 
contrast  to  the  information  required  to  obtain  Eqs.  (22)  and  (23).  Moreover,  for 
an  arbitrary  potential  Eq.  (22)  contains  one  extra  summation  compared  to  Eq. 
(13).  However,  it  must  be  emphasized  that  the  expansion  Eq.  (13)  is  restricted  to 
the  smaller  domain  ,  r2  <  R/2,  as  discussed  above. 

In  particular,  for  the  Coulomb  case,  the  summation  over  ri  in  Eq.  (22)  is  elim¬ 
inated  as  a  consequence  of  the  property  of  the  {(Uimn(r)}  functions  shown  in  Eq. 
(1),  thus  obtaining  the  expansion  Eq.  (6).  At  present,  the  convergence  properties 
of  the  expansions  for  following  each  one  of  the  two  approaches  discussed  above 
are  being  analyzed. 


Evaluation  of  a  Class  of  Integrals  Appearing  in  the  Hylleraas  Cl  Method 

It  is  well  known  [3,4]  that  four-electron  integrals  of  the  type: 

Eabcdefgh  -  j  dV \  dV 2  dV i  dV4  Pab{^ ^Pcdi'r 2) Pef(j 2,) Pgh{~r a)  *  ?  12^23^34  (24) 

are  needed  in  the  HCI  method.  Here,  abcdefgh  are  indices  for  the  bicentric  pxy(r) 
atomic  orbital  product  densities. 

The  standard  approach  to  evaluate  this  eight-center  quantity  is  to  replace  rjj1 
(and  analogously  r,y)  by  the  following  representation  [3,4]: 

rjj 1  =  7T  1/2  J  dxx~m  exp (-xrjj)  (25) 

and  then  to  interchange  the  integration  variables,  permitting  the  spatial  integrals 
to  be  performed  analitically,  provided  that  gaussian  atomic  orbitals  are  used.  The 
remaining  integrals  (three  dimensional  in  general)  are  evaluated  numerically. 
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A  new  method  to  perform  the  evaluation  of  Ea. .  .h  can  be  obtained  using  the 
expansion  of  Eq.  (6).  To  this  end  both  integrals  over  r{  and  r2  must  be  carried  out 
partitioning  the  space  into  two  regions,  one  is  a  sphere  of  radius  R  and  the  other 
is  the  rest  of  the  space.  Therefore,  Eq.  (24)  transforms  into: 

Ea. .  .h  =  dv ,  dv 2  dv3  dV4  pab(h)pcd(r2)PeAr3)pgh(r4) 

J  rurA<  co 
r2,h<R 

seabed*  s-iefgh 

-i  i  /  a  ^ Imn  '-'Imn  .  { 

•  02^23  ^34  +  ea, .  .h(R)  =  4tt  2  - p -  +  Ca- •  -  h(R)  (26) 

Imn 


where 


seabed  _ 
'-'Imn 


J 

J  r,<oo 
r2<R 


dV i  dV2  pab(r i )pcd(r2)r  12 Ufm„(r2) 


(27) 


and  .  .b(R)  is  the  error  of  truncation  of  space.  It  can  be  shown  (for  analogies  see 
Appendix  I  of  ref.  [1]  that  it  is  a  quantity  of  order  exp(-const.  R2)  for  gaussian- 
type  orbitals,  provided  the  pcd  and  /^densities  correspond  to  atomic  orbitals  centered 
inside  the  mentioned  sphere. 

In  practice,  it  is  convenient  to  extend  the  integration  over  r2  in  Eq.  (31)  to  the 
whole  space.  The  new  coefficients  thus  obtained  will  be  denoted  by  Tff.  Although 
this  introduces  a  new  source  of  error,  it  greatly  simplifies  their  computation  [1].  In 
what  follows,  this  source  of  error  together  with  ea. .  .h(R)  and  the  truncation  of  the 
series  in  Eq.  (26),  will  be  tested  in  a  simple  but  important  example. 

The  case  to  be  explicitly  considered  is  such  that: 


Pab(r)  =  expf- 


a(3 


a  +  f3 


\A 


-  2|2j  exp|- 


(«  +  $ 


r  — 


aA  +  (3B 


a  + 


(28) 


where  A  and  B  are  fix  vectors  that  localize  two  1 S  gaussian  orbitals  with  orbital 
exponents  a  and  ft  respectively.  In  this  example,  aA  +  ftEj=  yC  +  5D  =  •  •  •  = 
0  with  a  =  =  y  =  •  •  •  =  0.5  a.u.  and  \A  —  B\  =  \C  —  D\  =  3.0349  a.u. 

The  standard  approach  yields  the  following  bidimensional  integral: 


=  KabKcdKefKgh2Tr9/2  f  dx  f  dy(x  y)~ 1/2 
Jo  Jo 

•  -fr  (lf(x)f(y)]-'{f(x)g(y)  +  f(y)gix)]-'11)  (29) 


where  f(x)  =  1  +  2x,  g(x)  =  1  +  x  and  Ki}  is  the  constant  factor  in  Eq.  (28).  The 
numerical  integration  of  Eq.  (29)  was  carried  out  using  Gauss-Tchebicheff  quad¬ 
rature  [1 1,  p.  42],  after  the  change  of  variable  x  =  (1  +  t)/{  1  -  t)  (analogously  for 
y).  The  result,  with  the  same  number  of  integration  points  in  x  and  y,  is  displayed 
in  Table  I.  It  is  worthy  of  mention  its  monotonic  behavior  and  the  10-7  a.u.  reached 
precision  with  only  20  points. 
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Table  I.  Comparison  of  the  values  obtained  for  the  integral  Eq.  (24)  as  obtained  using  the  numerical 
integration  Eq.  (29)  and  the  expansion  Eq.  (26)  performed  using  the  approximate  coefficients  defined  in 
Eq.  (30).  See  text  for  the  description  of  the  atomic  orbitals  involved. 


Integration 

Result 

Method  Eq.  (29) 

points 

(a.u.) 

20  X20 

0.179555632 

40  X  40 

0.179555674 

60  X60 

0.179555676 

100  X  100 

0.179555676 

Method  Eq.  (26) 

Radius  of  sphere 

Result 

(Int.  points:  40) 

(a.u.) 

Number  of  terms 

(a.u.) 

4 

5 

0.179555500 

10 

0.179555679 

15 

0.179555679 

7 

5 

0.177401483 

10 

0.179555459 

15 

0.179555680 

20 

0.179555680 

10 

5 

0.164561488 

10 

0.179499393 

15 

0.179555521 

20 

0.179555680 

25 

0.179555680 

The  new  method  requires  the  evaluation  of  the  Tfmn  coefficients,  given  by: 

Ttcn  =  f  dVldV2pad7l)pcd(r2)rl2UrU72) 

Jri,r2<  oo 

f00  dx  d  f 

=  — >r“l/2  Jo  -^2  —  J  dv ,  dv2pab(7 t)pcd(72)  exp (-xr2l2)Ufm„(72) 

_  r  F(X)eX'{-^k)) 

~  &lO&moKathKcdUoOn(Q)  dx  “  A^Xl  +  2x?)'*^ 

where  W{x)  =  (1  +  2x*)/(l  +  Jt2),  F(x)  =  6(1  +  x2)  -  [Hn/2W(x)l  and  kon  =  (n- 

0.5)(t r/R)  with  n  =  1,  2, _ After  the  change  of  variable  x  =  (1  +  0/(1  -  0,  the 

integral  in  Eq.  (30)  was  evaluated  numerically  by  means  of  Gauss-Tchebicheff 
quadrature,  with  a  fixed  number  of  integration  points  which  was  set  equal  to  40. 

Table  I  also  shows  the  results  for  Ea. .  .h  obtained  by  the  systematic  variation  of 
R  and  the  number  of  terms  for  the  partial  sums  of  the  series  in  Eq.  (26).  The 
following  comments  are  pertinent. 
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The  best  partial  sums,  for  each  R,  are  very  close  to  the  values  obtained  with  the 
standard  method  (which  is  the  reference  value  in  this  work).  This  is  an  indicator 
that  all  sources  of  error  would  generate  controllable  quantities.  It  occurs  also  that 
the  greater  the  R  value  the  more  number  of  terms  are  needed  to  achieve  a  given 
precision.  Due  to  the  rapidly  decaying  property  of  gaussian  orbitals,  it  seems  that 
R  =  4  a.u.  is  a  sufficient  value.  The  last  point  to  stress  is  that  the  exp(-&o«/4JF) 
factor  in  Eq.  (30)  ensures  an  optimum  rate  of  convergence  for  the  series;  this  can 
be  demonstrated  by  obtaining  an  upper  bound  to  the  integral,  which  has  the  factor 
exp(— &o„/ 4  U'max  )• 


Concluding  Remarks 

Expansions  of  potential  functions  in  a  complete  function  set  presented  in  this 
article  are  found  to  be  an  interesting  alternative  for  solving  several  types  of  electronic 
integrals  that  appear  in  quantum  chemistry.  It  is  interesting  to  note  that  two  ap¬ 
proaches  were  followed  to  obtain  expansions  within  a  finite  domain  of  potential 
functions  of  type/r^)  where  the  variables  and  r2  are  symmetrically  separated. 
In  one  of  these  approaches  all  eigenvalues  {kln}  are  used  and  only  for  the  Coulomb 
potential  a  three  indices  summation  is  obtained  (compare  Eq.  (22)  with  Eq.  (6)]. 
Within  the  other  approach,  use  is  made  of  the  spherical  symmetry  of j(rn)  to  obtain 
a  three-indices  expansion  in  which  the  variables  are  symmetrically  separated  term 
by  term  [see  Eq.  (13)].  In  this  case,  only  a  subset  of  eigenvalues  {ko„}  is  required. 
However,  the  validity  of  these  expansions  is  restricted  to  regions  smaller  than  the 
sphere  of  radius  R,  and,  therefore,  the  advantage  of  using  only  a  subset  of  eigenvalues 
and  not  all  of  them,  in  some  cases  can  be  superseded  by  the  need  of  including  a 
larger  number  of  terms  to  obtain  an  adequate  accuracy  in  the  final  values.  For  this 
reason  an  adequate  comparison  between  these  two  approaches  would  require  a 
more  detailed  study  for  different  particular  cases.  However,  it  is  important  to  recall 
that  for  the  numerical  values  shown  above  for  some  Hylleraas  ci  integrals  using 
gaussian  orbitals  a  sphere  of  radius  R  =  4  a.u.  seems  to  yield  an  adequate  accuracy 
when  the  expansion  Eq.  (6)  is  used. 
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Abstract 

We  describe  the  time  evolution  of  hydrogenic  orbitals  perturbed  by  a  moving  charge.  Starting  with 
the  equation  for  an  atom  interacting  with  a  charge,  we  use  an  eikonal  representation  of  the  total  wave- 
function,  followed  by  an  eikonal  approximation,  to  derive  coupled  differential  equations  for  the  temporal 
change  of  the  orbitals  and  the  charge’s  trajectory.  The  orbitals  are  represented  by  functions  with  complex 
exponents  changing  with  time,  describing  electronic  density  and  flux  changes.  For  each  orbital,  we  solve 
a  set  of  six  coupled  differential  equations;  two  of  them  are  derived  with  a  time-dependent  variational 
procedure  for  the  real  and  imaginary  parts  of  the  exponents,  and  the  other  four  are  the  Hamilton  equations 
of  the  positions  and  momenta  of  the  moving  charge.  The  molecular  potentials  are  derived  from  the 
exact  expressions  for  the  electronic  energies.  Results  of  calculations  for  Is  and  2s  orbitals  show  large 
variation  of  the  real  exponent  parts  over  time,  with  respect  to  asymptotic  values,  and  that  imaginary 
parts  remain  small.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Time-dependent  molecular  phenomena  can  be  described  in  terms  of  time- 
dependent  molecular  orbitals  written  as  linear  combinations  of  atomic  orbitals  with 
time-dependent  coefficients.  Recent  work  on  ion-atom  collisions  at  low  collision 
velocities  (small  compared  with  electronic  velocities)  show  that  accurate  results  for 
probabilities  of  electron  transfer  can  be  obtained  provided  suitable  atomic  basis 
sets  are  used  [1-7].  It  is  well  known  that  improved  results  can  be  obtained  in  the 
calculation  of  potential  energies,  for  example,  in  Hj  and  H2,  by  allowing  for  variable 
atomic  orbital  exponents  as  nuclear  distances  decrease  [  8  ] .  A  similar  situation  may 
be  expected  for  dynamical  phenomena,  but  there  is  no  information  about  temporal 
changes  of  orbital  exponents  as  functions  of  the  velocities  of  slow  moving  charges. 

The  present  study  addresses  this  case.  It  starts  with  the  equation  for  the  interaction 
of  an  atom  with  a  charge,  at  a  given  total  energy.  It  then  introduces  an  eikonal 
representation  of  the  total  wavefunction  for  electrons  and  nuclei  [9-11]  and  takes 
the  eikonal  limit  of  short  de  Broglie  wavelengths  for  the  nuclear  motions  to  derive 
coupled  equations  for  the  atomic  orbitals  and  the  nuclear  trajectories.  The  atomic 
orbitals  satisfy  a  time-dependent  Schrodinger  equation  which  can  be  solved  by 
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using  a  time-dependent  variational  principle  (tdvp)  [  12,13]  with  a  suitable  trial 
form  for  the  orbitals. 

Some  of  the  early  applications  of  variational  principles  in  atomic  collisions  made 
use  of  real  exponential  parameters  [  14-16].  In  the  late  ’70s  Kleber  and  co-workers 
[17,18]  started  using  complex  parameters  because  they  describe  some  physical 
features,  like  the  flux  of  the  electron  charge,  that  are  not  present  in  treatments  with 
real  parameters  only. 

In  this  work  we  describe  the  effect  of  a  charge  moving  in  the  plane  of  two  nuclei, 
on  hydrogenic  atomic  orbitals  with  complex  exponents  which  change  in  time.  The 
eikonal  approximation  leads  to  coupling  of  the  Hamiltonian  equations  for  the  tra¬ 
jectories  to  the  differential  equations  for  the  atomic  orbital  parameters  that  are 
derived  from  the  time-dependent  variational  principle.  The  six  coupled  differential 
equations  for  each  orbital  (two  for  the  exponential  parameters  and  four  for  the 
positions  and  momenta)  are  solved  numerically  using  a  Runge-Kutta  fourth-order 
algorithm  [19]. 

We  use  a  simple  model  consisting  of  an  electron  moving  in  the  field  of  two 
nuclear  charges  of  number  Z^,  for  the  target  atom,  and  Z/>,  for  the  projectile.  The 
electronic  orbitals  are  kept  centered  at  the  target  atom,  which  does  not  move,  so 
that  its  orbitals  change  in  time  during  the  interaction  with  the  moving  projectile 
due  to  the  variation  with  time  of  the  target’s  effective  nuclear  charge.  Electron 
transfer  is  not  allowed.  This  model  describes  the  interaction  of  a  low  energy  projectile 
with  oriented  valence  orbitals  in  a  molecule  or  at  a  solid  surface.  By  restricting  the 
orbital  variation  to  its  exponent,  it  is  possible  to  evaluate  the  importance  of  varying 
nuclear  charges  in  dynamical  phenomena,  and  to  develop  a  computational  pro¬ 
cedure  to  parametrize  the  orbitals  as  linear  combinations  of  Gaussian  orbitals.  This 
has  recently  been  done  by  introducing  a  scaling  of  the  electron  radial  variable  with 
the  effective  nuclear  charge,  to  isolate  the  atomic  orbitals  from  their  time  dependence 
and  to  find  Gaussian  combinations  valid  for  all  times  [20]. 


Coupling  of  Trajectories  with  Orbitals 

In  this  section  we  use  the  eikonal  representation  for  the  wavefunction  of  electrons 
and  nuclei  [  1 0  ]  to  derive  the  Hamilton  equations  of  motion  that  couple  the  trajectory 
of  the  projectile  to  the  electronic  motions  in  a  target  atom. 

We  are  interested  in  the  interaction  between  a  projectile  with  a  positive  charge 
Zpe  and  a  target  atom  with  an  electron  around  a  charge  Zp€.  The  time-dependent 
Hamiltonian  of  a  single  electron,  of  mass  m,  at  position  r  moving  around  the  target 
is 


//e.(  R)  = 


ZTe2 

r 


ZPe2 
|r-R|  * 


(1) 


The  vector  R  gives  the  position  of  the  projectile  nucleus  relative  to  the  target  nucleus. 
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The  Hamilton  Equations  of  Motion  for  the  Projectile  Trajectory 

We  assume  that  the  nuclear  motion  is  almost  classical.  For  the  sake  of  convenience 
in  the  introduction  of  the  eikonal  (or  short  de  Broglie  wavelength)  approximation, 
we  represent  the  total  wave  function  for  all  electrons  and  nuclei,  as  [  10,21  ] 

*(r,R)  =  X(r,R)  exp[iS(R)/h]  ,  (2) 

where  by  choice  the  function  S  is  real  and  X  is  complex-valued.  The  phase  of  X 
can  compensate  in  principle  for  deviations  from  the  correct  phase.  The  function  X 
satisfies  the  equation 


1  (h  d  dS\2 
2 M\i  dR  +  dR/ 


+  Hr-E 


|X(R)>  =  0, 


(3) 


where  M  is  the  reduced  mass  of  the  colliding  pair,  HR  is  the  Hamiltonian  for  fixed 
nuclear  positions, 


Hr  —  He i  +  ZPZT/R  ,  (4) 

with  Hd  being  the  electronic  Hamiltonian,  and  E  is  the  total  energy,  for  all  electrons 
and  nuclei.  Here  we  have  used  the  Dirac  bracket  notation  to  omit  the  electron 
variables.  Projecting  the  above  equation  on  (x|,  and  taking  the  real  part  of  the 
result,  we  obtain  the  following  equation  for  the  function  S : 


1 

2  M 


=  E, 


Vqu  =  F+  V'  +  V"  , 


V'  = 

V"  = 


V=  (x|Z/r|x>/(x|x> 


ih  dS 
2M6R 

h>_  \ 
2M2 


dx 

dR 

d2x 

dR2 


X  -  X 


dx 

dR 


X)  +  X 


2^1 


d  X 


dR 


<m> 

I  <X|X>. 


(5) 


Therefore,  the  function  S  satisfies  a  time-independent  Hamilton- Jacobi  equation 
with  a  quantum  potential  that  depends  on  the  electronic  state,  which  can  be  solved 
introducing  trajectories  [22].  Defining  the  momentum 


P  = 


dS 
dR  ’ 


(6) 


it  follows  that  trajectories  can  be  obtained  for  a  fixed  energy  E  introducing  position 
and  momentum  functions  of  the  time  t. 

We  use  the  eikonal  approximation  for  short  wavelengths  X  =  hf  |P|  imposing 
that 
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(*!} 

/<* l*> 

v  ^X\  1 

x*a// 

\  «*/ 

The  eikonal  approximation  allows  us  to  neglect  V  and  V"  so  that  S  is  the  mechanical 
action  given  by 

rR 


S(  R)  =  S(  R 


Jr, 


dR'P(R) 


along  the  trajectories  of  the  nuclear  variables  determined  by  the  effective  (momen¬ 
tum  independent)  potential 

F(R)=  £d(R)  +  ^r, 


£el(R)  =  <x|tfei|x>/<x|x>,  (9) 

where  Ee]  is  the  electronic  energy.  Therefore,  we  can  rewrite  the  Hamilton-Jacobi 
equation  as  Hamiltonian  equations 


dR  =  _dV 
dt  dR’ 


(10) 


and 

=  —  (11) 
dt  M 

The  eikonal  approximation  also  simplifies  the  differential  equation  for  the  preex¬ 
ponential  function  so  that 

+  F(R)1|X(R)>  =  0-  (12) 

M  i  dR 

Since  the  nuclear  motions  are  now  described  by  time-dependent  position  vectors, 
it  is  convenient  to  introduce  time-dependent  electronic  states  by  means  of 

| X[R(/)] )  =  exp[/a(0]  >  (13) 

where  the  phase  can  be  chosen  to  eliminate  a  term  in  t  in  the  differential  equation 
for  X  to  obtain 

-|  +  i/el(oll<KO>  =  0.  (14) 

l  ot 

The  trajectories  and  these  orbitals  evolve  in  time  in  a  self-consistent  way,  and 
their  coupled  differential  equations  must  be  solved  simultaneously.  The  choice  of 
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atomic  orbital  parametrization  replaces  the  equation  for  0  by  equations  for  its  time- 
dependent  parameters.  These  equations  can  be  derived  from  a  variational  procedure 
as  follows. 


Time  Evolution  of  Atomic  Orbitals 

The  Time-Dependent  Variational  Principle 

We  use  a  time-dependent  variational  principle  for  an  atomic  orbital  0  in  terms 
of  an  action  functional  given  by  [  1 3  ] 

-AIM*]  =  P^X[0,0*],  (15) 

with  the  Lagrangian  functional  X  being  expressed  as 

-£[0,0*]  =  0))-<0|tfeil0>  /<0|0>,  (i6) 

where  we  show  the  dependence  on  the  complex  valued  state  0,  whose  complex 
conjugate  0*  appears  in  the  “bra”s.  It  is  convenient  to  separate  from  1 0)  a  time- 
dependent  normalization  factor  N(t), 


0(r,£)  =  N(t)\f/(r,t) ,  (17) 

so  that  (0 1 0)  =  1.  Variation  of  the  action  JL  with  respect  to  N(t)  gives  [23] 


In  the  limit  of  large  internuclear  distances  we  have  that  He \  ~  HT ,  the  electronic 
Hamiltonian  for  the  target  atom.  We  choose  0  so  that  asymptotically  //r0  = 
Xei(oo)0,  which  gives 


N(t)  ~  exp [~(i/h )Xe] (oo )(t  -  f,)]  .  (19) 

We  express  each  one  of  the  orbitals  0(r  ,t),  in  terms  of  a  complex  time-dependent 
exponential  parameter 


HO  -  MO  +  i*(t) .  (20) 

We  therefore  have  two  real  variational  parameters  per  orbital.  The  function  0[r;f(O] 
is  normalized  to  unity  before  the  collision  and  will  stay  normalized  during  the 
collision  because  the  Hamiltonian  He i  is  real. 

Since  0[r;f(O]  depends  on  time  only  through  the  parameters  X  and  k,  we  can 
write 
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1  \  A  — 

8t  8\  +  K  8k  ’ 

with  a  dot  indicating  a  derivative  with  respect  to  time. 
Now  we  can  express  the  Lagrangian  as 

-£[0,</>*]  =  L(A,*,\,/c)  , 


L(x^)  =  f[/^(n(x|  +  ^U(f) 


~{(i'k+lc£)mHn)  -<^)|W(f)>-  (23) 

The  application  of  the  variational  principle  5A  =  0  to  the  Lagrangian  in  Eq. 
(22)  leads  to  the  Euler-Lagrange  equations 


rf/azA  _  aL  =  0 

dt\  8\  8\ 


o. 

dt\  8k  8k 


Time  Dependence  of  Exponents  in  Hydrogenic  Orbitals 

Atomic  Orbitals  and  Energies 

We  use  normalized  complex  time-dependent  hydrogenic  orbitals,  written  in  terms 
of  both  f  and  its  complex  conjugate  for  Is,  2s,  and  2p  orbitals, 


Vh s(r;fis)  - 


1  /fls+  f*V/2 


/jr  \  2 


-I  £>-f" 


fe(r;f2s) 


i  /f2s  +  f2*s\3/2r,  /f2s  +  r£\  ri  _fJ. 


2  VS  \  2 


2  /  2 


\Mr,f2p) 


1  /  -f2pr/2 


4  VS  \  2 


re  r2pr/  cos(0) 


(r,^2p) 


— L_  /^2p  +  f2p\  re  f2pr/2  sin ((f)) 

4VS  V  2  / 


re  f2pr/2  sin(0)  cos(</>) .  (30) 
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The  electronic  energies  for  the  electron  in  the  orbitals  depend  on  R  in  two 
ways:  through  the  dependence  of  the  electronic  Hamiltonian  and  also  through  the 
variation  of  the  exponent  parameters  with  R.  These  dependencies  can  be  obtained 
from 


Ee  ,(R) 


_  <ftR)|J7„(R)|ftR)) 


These  energies  are  given  for  the  Is  orbital  by 

£,S(R)  =  Wls  -  (ZFe2/R)M\lsR) 


for  the  2s  orbital  by 


f\s(x)  =  1  -  ( 1  +  x)  exp(— 2x) ; 


£2s(R)  =  -  (. ZPe2/R)f2s(\2sR ) 


/  3  jc2  x3\ 

f2s(x)  =  1-  ^1  +  4^  +  -^-  +  y]  exP(~*) ; 
and  for  the  2p  orbitals  by 

£2Pi(R)  =  E2Pz,(R)  cos2(0)  +  E2Px,(R)  sin2(0) , 

E2 p,(R)  =  E2py,(R) , 

E2px( R)  =  E2pJR)  cos2(0)  +  E2Pi,(R)  sin2(0) , 
where  0  is  the  angle  between  R  and  the  z-axis,  and 


E2pxW(R)  =  W2p  +  ~  f2Px'(y’)(^2pR)  » 

fiPx,(x)  =  1  — ^  +  f +  —  +  2  +  exp(-x) 


X  VX  X 


E2pJR)  =  W2p  -~f2pJX2pR) 


.  ,  ,  ,  ,12  / 12  12  „  11  3  ,  x3\ 

AJx)  +  —  +  7  +  —  x  +  -x2  +  yj  exp(-x) ,  (42) 


h2/\h  +  K2„,\  ZTe 


2m  \  n‘ 
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We  have  used  the  (x,z)-plane  as  the  collision  plane,  with  the  initial  projectile 
velocity  parallel  to  the  +z-axis,  and  xP(t)  and  zP(t)  the  projectile  coordinates,  so 
that 

R(t)  =  izP(t)2  +  xP(t)*9  cos[0(?)3  ~  zP(t)/R(t) .  (44) 

To  simplify  calculations,  we  have  evaluated  the  matrix  elements  for  p  orbitals 
in  the  body-fixed  frame,  the  (x\z')  coordinate  system  with  the  z'-axis  going  from 
the  target  to  the  projectile  location,  and  rotated  them  to  the  space  fixed  frame,  the 
(x,z)  coordinate  system,  in  which  the  time  propagations  have  been  performed. 


Physical  Meaning  of  the  Exponential  Parameters 

The  real  components  of  the  complex  parameters,  \nh  are  related  to  the  expectation 
value  of  the  radius 


<^MM«0>  =  ^-[3«2-/(/+ 1)].  (45) 

lknl 

This  shows  that  when  \nl  grows  during  the  interaction,  then  the  orbital  shrinks  as 
expected. 

The  imaginary  components  of  the  complex  parameters,  Knh  can  be  understood 
in  connection  with  the  radial  flux  of  the  probability 


(nr-  J nlk) 


h 

2  mi 


1  h 

- *„/  (46) 

n  m 


At  large  distances  the  flux  must  be  zero,  so  that  k  must  be  chosen  to  begin  with 
equal  to  zero.  As  time  increases,  a  positive  k  indicates  a  negative  radial  electron 
flux,  moving  toward  the  nucleus,  while  a  negative  k  indicates  outward  flux. 


Equations  of  Motion  for  the  Orbital  Exponents 
For  the  Is  orbital  the  equations  of  motion  are 

mUs  =  |  h\2lsKls , 


(47) 


and 

^  ,,  x  2  ^ls(0  \  /  yj  Q  ^ 

m/c,s  =  -  hXis - Ais  ,  (4o) 

3  L  ao 

where  a0  is  the  hydrogen  Bohr  radius  and  the  effective  charge  Zls(0  is  given  by 

ZuV)  =  ZT+ZPFlt(\lBR),  (49) 

where 

FisW  =  ( 1  +  2x)  exp(-2x) .  (50) 
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One  should  keep  in  mind  that  what  we  mean  by  effective  charge  Zn[k(t)  is  not  the 
same  as  the  effective  nuclear  charge  in,  for  instance,  Slater-type  orbitals. 

In  a  similar  way  we  find  for  the  2s  orbital  the  following  equations  of  motion: 


m\2s  =  Y2  ftX2s*2s 


mk2s  =  ~  h\2\^A  ~ 


with  the  effective  charge  being 


Z2s{t)  ~  ZT  +  ZPF2s(\2sR) 


where 


/  x2  x3\ 

F2s{x)  =  (l  +X-y  +  yjexp(-x). 


For  the  2p*  orbitals  we  derive  the  equations  of  motion  from  the  average  Lagran- 
gian  L  =  (Lx  +  Ly  +  Lz)/ 3,  where  Lk  is  obtained  from  <f>2pki  to  obtain 


jq  ^^2p^2p 


mk2p  =  “  h\22p 


Z2p(t) 


where  the  effective  charge  Z2p  =  (Z2px  +  Z2py  +  Z2p2)/3  is  obtained  in  the  space- 
fixed  axes  from 

Z2pz(0  =  zr+  ZpZ-^A(\2vR)  +  ZP^^B(X2pR) ,  (57) 

Z2p(t)  =  ZT+  ZPB(\2pR) ,  (58) 


Xp(t)2  Zp(t)2 

Z2Px(t)  =  ZT+Zp-^A(\2pR)  +  Zp~^B(\2pR)  . 


Here  A{x)  and  B(x)  are  defined  by 


A(x)  =  +  ^  +  17  4-  5x  +  |  x2  +  ^x3)  exp(-x) ,  (60) 


U  x"  x 
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and 

B(x)  =  ^  +  —  +  7  +  x]  exp(—  x) ,  (61) 

X  \X  XX  J 

from  which  Z2p  =  Zr  +  ZPF2p(X2pR)  with  F2p  =  (A  +  2B)/3. 


Equations  of  Motion  for  the  Trajectories 

The  potential  gradients  to  be  used  in  the  Hamilton  equations  can  be  obtained 
by  differentiation  of  the  electronic  energies  written  as  functions  of  xP,  zP,  k,  and  X. 
We  show  here  the  results  for  the  Is  and  2s  orbitals.  For  these  orbitals  V  depends 
only  on  R  and  the  gradients  are  given  by 


dV  (dV\  idV  .  dV  . 

dR~[dR  )i  k  +  \  8k  K+  d\X 


R 


(62) 


with  the  time  derivatives  of  the  exponent  parameters  obtained  from  the  previous 
equations,  and  the  time  derivative  of  the  radius  R  obtained  from  the  Hamilton 
equations.  Keeping  the  exponent  parameters  fixed,  the  gradients  give  the  forces 
when  changes  in  the  effective  charges  are  ignored. 

Letting  Q  be  either  xP  or  zP,  we  obtain 


dVls\  _  _  2  Q 


=  ZPe2f-3\\-(l+2\lsR  +  2MsR*) 


Xexp(— 2Xj SR)  - ZPZT —3,  (63) 


— ?  =  Z^2  ^  1  -  1  +  \2sR  +  -\22sR2  +  ~  XtsR‘ 


Xexp (~\2sR)  ~ZPZTf 3.  (64) 


The  above  gradients  are  used  in  the  Hamilton  equations  to  give  the  trajectories. 


Numerical  Results 

Numerical  results  were  obtained  for  a  proton  colliding  with  a  hydrogen  atom  at 
rest,  at  a  range  of  projectile  velocities  and  impact  parameters  typical  of  slow  collisions 
between  ions  and  atoms.  Therefore,  ZP  =  Zp  =  1,  and  M  =  1836  a.u.  A  selection 
of  results  is  presented  in  what  follows.  In  these  examples,  the  velocities  of  the 
projectile  were  vP  =  0.01  and  0.02  a.u.,  corresponding  to  collision  energies  of  nearly 
2.5  and  10.0  eV. 

For  each  orbital  the  four  Hamiltonian  equations  for  a  trajectory  [Eqs.  (10)  and 
(11)],  coupled  to  the  two  equations  of  motion  for  the  orbital  exponents  [Eqs.  (47 ) 
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and  (48)  for  the  Is  orbital,  and  Eqs.  (51)  and  (52)  for  the  2s  orbital]  were  solved 
numerically  using  a  Runge-Kutta  fourth-order  algorithm  [19],  with  the  forces 
from  Eqs.  (63)  and  (64).  We  tested  the  algorithm  with  different  input  values  of 
time  steps  and  starting  distances,  and  the  numerical  results  are  consistent.  We  also 
obtained  the  same  results  using  a  predictor-corrector  algorithm  to  solve  the  equations 
of  motion. 

Figures  1  and  2  show  the  time  evolution  of  X  and  /c,  respectively,  for  the  1  s  orbital, 
at  the  two  velocities  vP  =  0.01  and  0.02  a.u.  and  impact  parameters  of  1.0,  2.0, 
3.0,  and  4.0  a.u.  Figures  3  and  4  show  the  projectile  trajectories  for  the  above 
velocities  and  impact  parameters  and  the  Vls  potential.  Figures  5  and  6  show  X  and 
k  results  for  the  2s  orbital  at  vP  =  0.02  a.u.,  and  impact  parameters  of  4.0,  5.0,  6.0, 
and  7.0  a.u.  Finally  Figure  7  gives  the  trajectories  for  a  target  2s  orbital  at  vP  - 
0.02  au  and  the  mentioned  impact  parameters. 

The  two  sets  of  peaks  in  Figure  1  correspond  to  the  two  velocities;  starting  at  the 
same  distance  along  the  z-axis,  the  interaction  occurs  sooner  at  the  larger  projectile 
velocity,  so  that  the  peaks  to  the  left  are  for  the  larger  velocity,  vP  =  0.02  a.u.,  and 
the  ones  to  the  right  for  the  smaller  value  of  0.0 1  a.u.  The  real  parts  of  the  exponents 
are  found  to  increase  over  time,  reflecting  the  larger  charge  seen  by  the  electron  as 
the  projectile  approaches.  The  effect  is  quite  large,  with  higher  peaks  for  the  larger 
velocity  and  smaller  impact  parameters,  and  wider  peaks  for  the  lower  velocity. 
This  reflects  the  longer  interval  during  which  the  orbital  is  distorted  at  the  smaller 


Figure  1 .  Real  part  X  of  the  orbital  exponent  for  a  proton  colliding  with  a  hydrogen 
atom,  for  the  Is  orbital,  vs.  time,  in  atomic  units,  at  projectile  velocities  vP  =  0.01  a.u. 
(peaks  at  right)  and  0.02  a.u.  (peaks  at  left)  and  impact  parameters  of  1.0,  2.0,  3.0,  and 
4.0  a.u.,  with  the  Vls  potential. 


DA  COSTA  AND  MICHA 


Figure  7.  Projectile  trajectories  for  the  conditions  of  Figure  5,  in  the  ( x,z )  plane. 
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velocity.  The  imaginary  parts  of  the  exponents,  in  Figure  2,  change  sign,  first  in¬ 
creasing  to  positive  values  and  then  taking  negative  values.  This  shows  that  the 
electron  flux  is  inwards  as  the  charge  approaches,  and  outwards  as  it  departs.  Again 
peaks  are  higher  for  the  larger  velocity  and  wider  for  the  smaller  velocity.  The 
numerical  change  is  much  smaller  than  for  the  real  part,  but  is  clearly  present. 

The  trajectories  are  quite  different  for  the  several  impact  parameters  at  a  given 
velocity,  and  show  large  deflections  at  small  impact  parameters.  The  ones  with 
large  scattering  angles  involve  the  more  pronounced  changes  in  the  effective  charge 
and  the  electron  flux.  The  changes  are  much  smaller  when  the  deflection  is  small, 
and  scattering  is  nearly  forward. 

The  situation  is  similar  for  the  2s  orbital.  Here  the  deflection  angles  are  smaller 
because  the  impact  parameters  are  larger.  However,  the  changes  in  effective  charge 
and  flux  are  even  larger  than  for  the  Is  orbital,  because  the  2s  orbital  is  more  spread 
out  and  therefore  more  sensitive  to  the  presence  of  the  projectile  charge.  The  con¬ 
clusion  is  that  effective  charges  are  important  even  at  low  scattering  angles  for 
higher  excited  orbitals. 


Conclusion 

Combining  the  eikonal  approximation  with  a  time-dependent  variational  deri¬ 
vation  of  the  differential  equations  for  the  parameters  of  the  electronic  wavefunc- 
tions,  it  has  been  possible  to  describe  the  time  evolution  of  an  atomic  orbital  distorted 
by  a  moving  charge.  This  depends  not  only  on  the  distance  between  nuclei,  which 
is  a  conformational  effect,  but  also  depends  on  impact  parameters  and  velocities. 
Therefore,  our  treatment  provides  insight  which  goes  beyond  what  we  know  from 
electronic  structure  calculations  for  molecules. 

Generally,  we  have  found  it  necessary  to  work  with  complex  electronic  wave- 
functions,  to  properly  describe  electronic  flux  effects.  The  presence  of  an  imaginary 
part  in  orbital  exponents  means  that  orbitals  acquire  an  oscillatory  nature  in  time 
when  a  projectile  charge  is  present  at  short  distances.  Expectation  values  of  properties 
show  also  a  time  dependence;  in  the  case  of  the  electronic  flux,  this  turns  out  to 
change  sign  with  time  as  the  projectile  approaches  and  then  departs:  flux  goes  in 
the  incoming  part  of  the  trajectory,  and  out  in  the  outgoing  part. 

Results  for  the  evolution  of  the  real  and  imaginary  components  of  the  complex 
parameters  of  Is  and  2s  orbitals  show  large  variations  of  the  real  parts  over  time 
with  respect  to  asymptotic  values,  and  that  imaginary  parts  remain  small. 

The  real  part  of  the  atomic  orbitals  change  extensively  with  time,  with  the  change 
being  larger  for  higher  velocities  and  low  impact  parameters.  Variations  are  more 
pronounced  for  the  excited  atomic  orbital. 
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Abstract 

We  have  carried  out  quantum-mechanical  studies  of  infrared  multiphoton  excitation  and  dissociation 
of  diatomic  molecules  modeled  by  Morse  oscillators.  Two  different  aspects  of  such  processes  are  inves¬ 
tigated:  one  concerns  quantum  dynamics  near  a  nonlinear  resonance  induced  by  an  external  field  and 
calculations  of  the  associated  quasienergies  and  quasi-eigen  functions;  and  the  other  addresses  the  effects 
of  dipole  moment  functions  on  the  above-threshold  dissociation  spectra,  obtained  by  solving  the  time- 
dependent  Schrodinger  equation  directly  using  the  momentum-space  Fourier-Grid  method.  ©  1994  John 
Wiley  &  Sons,  Inc. 


Introduction 

Morse  oscillators  have  often  been  studied  as  a  model  of  molecular  photodisso¬ 
ciation,  for  its  simplicity  and  availability  of  exact  quantum  solutions  [1].  In  this 
article  we  investigate  two  quite  different  aspects  of  its  quantum  solutions:  In  the 
first  part  we  apply  a  recent  approach  of  Holthaus  [2]  to  analyze  the  quantum 
dynamics  near  a  nonlinear  resonance  of  a  driven  molecule,  a  subject  related  to 
quantum  Kolmogorov-Arnold-Moser  (kam)  theorem  [3].  For  a  driven  system, 
Holthaus  procedure  involves  a  local  quantization  of  the  unperturbed  Hamiltonian 
in  action  space  near  a  resonant  level  and  arriving  at  a  Schrodinger  equation,  which 
is  shown  to  be  equivalent  to  a  Mathieu  equation.  Applying  the  Floquet  theorem 
to  the  Mathieu  equation,  Holthaus  shows  that  for  1:1  and  1:2  resonances  quasi¬ 
energies  and  wave  functions  are  easily  calculable,  being  related  to  the  well-known 
characteristic  values  and  Mathieu  functions.  The  results  indicate  that  the  effect  of 
a  sinusoidal  field  is  to  rearrange  the  energy  levels,  similar  to  the  reordering  of  levels 
caused  by  a  DC  field,  that  is,  the  Stark  or  Zeeman  effects.  His  theory  has  been 
applied  to  the  driven  systems  of  a  particle  in  a  box  and  a  particle  in  a  triangular 
well,  yielding  very  interesting  results.  For  instance,  tunneling  between  invariant 
manifolds  and  scarring  of  Floquet  states  by  classical  periodic  orbits  can  be  studied 
analytically. 

In  this  article  we  present  some  preliminary  results  of  applying  Holthaus  theory 
to  the  photo-excitation  of  an  NO  molecule,  modeled  by  a  driven  Morse  oscillator. 
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This  system,  being  noncompact,  is  more  complicated  than  the  above  two  systems. 
Details  of  the  theory,  applied  to  a  Morse  system,  will  be  given  in  the  next  section 
and  results  presented  in  the  third  section. 

In  the  second  part  of  the  article  we  continue  the  investigation  of  the  molecular 
photodissociation  process  by  solving  the  Schrodinger  equation  in  the  momentum 
space  [4]  and  probe  the  roles  of  the  dipole  moment  function  in  the  above-threshold- 
dissociation  (atd)  spectra  of  molecules.  The  numerical  method  that  we  use  will 
be  described  and  results  presented. 

Quantum  Theory  of  Nonlinear  Resonance 

The  Hamiltonian  of  a  Morse  oscillator  with  an  effective  dipole  moment  function 
#efr/u(r)  in  the  presence  of  an  electromagnetic  field  can  be  written  as 

2 

H  =  f  -  +  D(  1  -  -  q,nE(t)n(r) .  ( 1 ) 

2m 

Using  the  dimensionless  variables  defined  in  Ref.  [  5  ] ,  we  can  rewrite  the  Hamil¬ 
tonian  in  the  following  form 

=  H0  —  fi(x)  cos  12/  (2) 


Ho  =  ~~  + 


(1  -e~xy 


where  x  =  a(r  —  r0),  px  is  the  momentum  conjugate  to  x,  /  is  the  action  variable 
of  the  Morse  oscillator,  and  is  the  optical  frequency.  The  effective  field  amplitude 
function,  A(t)  =  (q^E(t)/ a&D),  will  be  assumed  constant  in  the  first  part  of  our 
studies  and  be  given  a  sine-squared  pulse  shape  in  the  second  part. 

Both  the  coordinate  and  momentum  variables  can  be  expressed  in  terms  of  the 
action-angle  variables,  (1,0).  In  particular,  the  coordinate  expression  is  given  by 

,r„,  ,  /I  +  ^2E  cos  6\ 

*</.»)  -  '■(  _  2E  -)  ■  (D 

which  is  periodic  in  8  and  can  thus  be  expanded  into  a  Fourier  cosine  series  given 
below: 


x(I,0)  =  2  W)  cos  nd  , 


where  fn(I)  is  given  by 


foil)  =  In 


1  +  Vl  -  2 E 
2(1  -  2 E) 


and 
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In  the  above  equations,  E  =  H0(I)  is  the  oscillator  energy.  We  then  expand  the 
dipole  moment  function  about  the  equilibrium  interatomic  distance  and  keep  only 
the  linear  term  for  the  nonlinear  resonance  analysis.  By  substituting  the  Fourier 
series  of  x  in  the  total  Hamiltonian  we  obtain 


M  =  E-  2  /„(/)[cos(«0  -  Qt)  +  cos  (nd  +  fl/)] .  (8) 

^  n= 0 


Chirikov’s  resonant  approximation  consists  of  keeping  the  Mh  term  which  satisfies 
the  condition:  Nd  ~  Qt  and  dropping  all  fast  varying  terms  in  the  above  series.  The 
Hamiltonian  around  a  1 :7V  resonance  then  becomes 

JT(IM  =  E(In)  +  0>(In)AI  -  COS (N6  -  Qt) .  (9) 

where  we  have  also  expanded  E  about  the  resonant  action,  IN,  and  the  oscillator 
frequency  at  resonance,  a)(IN ),  is  defined  by 

«(/»)- (10) 

We  note  that  the  expansion  of  E  to  the  second  order  in  A I  is  exact  for  a  Morse 
oscillator. 

Following  Holthaus  we  quantize  the  system  at  this  stage  by  replacing  A I  by 
—iheff(d/d6)  and  obtain  a  time-dependent  Schrodinger  equation,  which  can  be  re¬ 
duced  into  a  Mathieu  equation: 

u"(z)  +  [a  —  2q  cos(2z)]  u(z)  =  0  (11) 

under  the  transformation  2 z  =  NO  —  Ht  and 

Ud,t)  =  fo9t)  =  u(z)e~im/h*« .  ( 12) 

In  the  Mathieu  equation,  a  and  q  are  defined  by 


3TT  IE(In)  ~  W) 


q  ~  n^i/nUn) 


where  is  the  effective  Planck  constant  associated  with  the  dimensionless  variables. 
In  the  case  of  a  1 : 1  resonance,  on  which  we  shall  focus  in  the  next  section,  solutions 
of  the  Mathieu  eigenvalue  equation  are  given  by  the  7r-periodic  Mathieu  functions 
and  the  allowed  eigenvalues  a  correspond  to  the  characteristic  values  a2n(q ), 
M<7)[2,6]. 
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Nonlinear  Resonance  in  no 

In  this  section  we  apply  the  theory  of  the  last  section  to  the  analysis  of  a  1:1 
resonance  in  an  NO  molecule  and  assume  that  the  photon  energy  is  such  that  the 
1 1th  (n  =  10)  vibrational  level  is  in  resonance  with  the  field.  NO  has  55  vibrational 
states  in  the  ground  electronic  state  such  that  fteff  is  about  j$,  or  more  precisely,  the 
inverse  of  55.04.  /j  is  then  equal  to  ( n  +  \  )ftefr  =  0.1908,  thus  w(/i)  =  fl  =  1  — 
/j  =  0.8092  or  1 540.9  cm-1 .  For  this  photon  field  we  present  in  Figure  1  the  phase 
portrait  on  the  Poincare  surface  of  section  at  A  -  0.020,  where  we  see  a  large  1:1 
primary  resonance  zone  centered  close  to  l\  in  the  figure.  We  estimate  from  the 
area  in  phase  space  that  this  resonance  zone  can  support  about  7  bound  states.  The 
1:2  and  1:3  primary  resonance  zones  are  also  clearly  visible  in  Figure  1. 

We  have  calculated  the  quasienergy  spectrum  of  the  Floquet  states  which  are 
quantized  around  the  1 : 1  resonance  region.  The  quasienergies  of  the  lowest  1 3 
states  are  plotted  in  Figure  2  as  a  function  of  the  field  amplitude  A .  It  is  interesting 
to  see  that  degeneracy  at  low  fields  (presumably  brought  about  by  the  resonant 
photon  energy)  is  lifted  up  as  the  perturbed  field  strength  increases  and  at  higher 
fields  the  quasienergy  levels  become  almost  regularly  spaced.  On  the  other  hand, 
the  results  seem  to  indicate  that  for  a  strongly  bound  molecule  like  NO  (D  —  0.239 
a.u.)  one  needs  a  much  more  intense  field  than  A  =  0.05  to  perturb  effectively  the 
10th  states  of  the  NO  molecule. 

We  plot  in  Figure  3  the  probability  density,  \\p(x,  /)  |2,  of  the  a0(q)  and  b2(q) 
Floquet  states  at  A  =  0.02  in  xt-space.  These  two  states  correspond  to  the  two 


6 

Figure  1.  Poincare  section  plot  of  30  classical  trajectories  for  the  driven  Morse  oscillator 
at  A  =  0.02  and  =  0.8092  in  action-angle  space. 
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topmost  levels  in  Figure  2.  Figure  3  shows  that  the  quantum  wave  functions  are 
clearly  influenced  by  the  vortex  tubes  [  7  ]  surrounding  the  classical  periodic  orbits. 
Calculations  aiming  at  comparing  the  above  results  with  the  exact  quantum  solutions 
using  the  method  described  in  later  sections  are  in  progress.  Related  to  the  com¬ 
parison,  we  note  that  since  the  system  is  noncompact,  the  quasienergies  are  complex 
numbers.  The  quasienergy  spectrum  shown  in  Figure  2  represents  only  the  real 
parts  of  the  quasienergies  and  is  meaningful  when  the  imaginary  parts  are  negli¬ 
gibly  small. 

Staggered  Time  Discretization  Algorithm 

In  the  second  part  of  the  article — that  is,  the  rest  of  the  sections — we  concentrate 
on  the  atd  spectra  of  the  HF  molecule  and  see  how  they  are  influenced  by  two 
different  types  of  dipole  moment  functions.  This  has  been  done  by  solving  directly 
the  time-dependent  Schrodinger  equation 

<9i h 

ih  ~  =  H(t)yp  .  (15) 

First,  let  us  decompose  the  state  into  real  and  imaginary  parts,  namely,  \p  =  \ pr  + 
i\pi .  Considering  the  staggered  time  discretization  tn  =  nAt  up  to  the  second  order, 
we  obtain 


Figure  3.  Contour  plots  of  probability  density  distributions  of  Roquet  states  around  the 
1:1  resonance  zone  of  no  at  A  =  0.02  and  fl  =  0.8092.  (a)  The  a0  state  (b)  The  b2  state. 
Time  t  is  scaled  by  the  optical  period,  T. 
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4+1/2)  -  'I'ritn- 1/2)  +  HktTpi^tn) 

Mtn+ 1)  =  *,(*,)  -  Hktypr(tn+\/2)  ■  (16) 

For  a  real  Hamiltonian,  we  rewrite  by  setting  h  =  HAt  the  above  equations  in 
matrix  form 

Un+l  =  (_/2  Y-h^)Un~AUn 

un  =  Anu0  (18) 


where  un  —  (\br(4_1/2),  ^  (/»)).  The  stability  criterion  is  provided  by  the  eigenvalues 
of  A ,  given  by 


(h2  —  2)  ±  V(/z2-  2): 


from  which  we  obtain  |  X  |  =  1  when  A?  <  2/ H .  The  step  size  is  determined  by  the 
energy  eigenvalue  of  largest  absolute  value.  For  a  time-dependent  Hamiltonian,  we 
need  to  find  max,[eigval(//)]  [8].  Once  the  stability  condition  has  been  satisfied, 
the  staggered  leap-frog  time  discretization  is  much  faster  than  the  Burlisch-Stoer 
algorithm  used  in  a  previous  study  [4]. 

For  practical  applications  of  the  staggered  leap-frog  algorithm,  we  often  have 
to  modify  the  physical  Hamiltonian  as  done  in  the  driven  hf  molecule,  de¬ 
scribed  below. 


Modified  Morse  Hamiltonian 

We  investigate  in  the  following  sections  the  multiphoton  dissociation  dynamics 
of  an  HF  molecule  driven  by  a  laser  pulse  with  the  shape  function  E(t)  [in  the 
Hamiltonian  ( 1 )]  given  by 


E(t)  =  Em  sin(art)  •  sin2 


(20) 


The  molecular  constants  ofHFare:  D  =  0.225,  r0  =  1.7329,  <2  =  1.1741,  #eff  =  0.31 
and  m  =  1744.7  (given  in  atomic  units).  In  this  article  we  compare  the  difference 
in  dynamics  of  two  different  types  of  dipole  moment  functions;  in  one  case  the 
dipole  function  is  approximated  by 

li(r)  =  (x+  a)  exp  ,  (21) 

where  a  —  2,  b  =  2  will  be  used  later.  This  form,  introduced  by  Heather  and  Metiu 
[9]  and  used  by  Lu  et  al.  [10],  mimics  qualitatively  the  ab  initio  and  observed 
dipole  function  of  hf.  In  the  second  case,  we  use  the  simplest  possible  form  in 
momentum  space,  that  is, 

n(p)  =  A-p,  (22) 


where  A  is  the  vector  potential  of  the  field. 
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An  efficient  and  accurate  way  to  treat  the  continuum  in  a  photodissociation 
problem  is  to  solve  the  time-dependent  Schrodinger  equation  directly  in  momentum 
space  as  illustrated  in  Ref.  [4].  To  employ  the  staggered  algorithm  in  momentum 
space,  we  artificially  add  a  symmetric  part  of  the  potential  and  n(  r)  in  the  negative 
r  region.  Without  external  field,  the  Hamiltonian  becomes  parity  symmetric.  For 
each  eigenvalue,  we  now  have  doubly  degenerate  eigenfunctions  with  even-  and 
odd-parity.  We  note  that  the  added  negative  r  part  of  the  potential  will  not  affect 
the  physics  of  the  problem,  because  the  potential  barrier  at  r  =  0  is  very  high  relative 
to  the  dissociation  energy,  the  set  of  odd-parity  eigenfunctions  corresponds  to  the 
pseudo-complete  set  of  the  physical  Morse  potential.  We  verify  this  point  by  noting 
that  the  odd-parity  eigenfunctions  obtained  here  are  in  exact  agreement  with  our 
previous  results  [4],  where  no  extension  of  the  potential  was  made. 

Momentum-Space  Fourier-Grid  Hamiltonian  Method 

We  solve  the  time-dependent  Schrodinger  Eq.  ( 16)  using  the  momentum-space 
Fourier-Grid  Hamiltonian  (/?-FGH)  formalism  as  introduced  in  Ref.  [4].  With  the 
present  extention  of  symmetric  Hamiltonian,  the  Schrodinger  equation  reads  as: 

ih  ^  M)  =  +  f  V(k-  k')t(k’)  dk' 

at  2m  J 

-  gtfrE(t)  J  n{k  -  k')yp{k')  dk' ,  (23) 

where 

i  r+oo 

V(k  -  k')  =  —  v(r)e-i{k-k')r  dr,  (24) 

Ztt  J— co 

and 

i  p+oo 

!i{k  -  kr)  =  —  fi(r)e~i{k~k')r  dr.  (25) 

27T  J —co 

With  symmetrized  V(r)  and  ju(r),  their  Fourier  transforms  are  all  real.  In  our 
calculation,  480  grid  points  are  used  in  an  evenly  spaced  grid  and  the  range  of  x  is 
16  a.u.  and  of  momentum  k  is  —94.2  to  +94.2  a.u.  Diagonalization  of  the  unper¬ 
turbed  Schodinger  equation  gives  24  bound  levels  and  216  discretized  pseudocon¬ 
tinuum  levels.  Each  level  is  doubly  degenerate  and  the  physical  basis  is  the  set  of 
240  odd-parity  eigenfunctions.  The  reliability  of  the  time-dependent  p-FGU  cal¬ 
culation  has  been  established  in  [4]  on  the  transition  probabilities  from  the  ground 
state  to  excited  states  |  n  >  of  a  harmonic  oscillator  and  will  not  be  repeated  here. 
We  integrate  Eq.  (24)  by  p-FGH  with  the  staggered  time  discretization  algorithm 
already  discussed. 


Results  on  ATD 

We  study  the  atd  spectrum  of  an  hf  molecule  initially  prepared  in  the  14th 
vibrational  state  driven  by  a  laser  field  of  frequency  12  =  0.0425  ( 1074  nm).  The 
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photon  frequency  is  such  that  absorption  of  a  single  photon  will  reach  beyond  the 
dissociation  limit  and  the  range  of  momentum  grid  used  permits  a  maximal  ab¬ 
sorption  of  nearly  60  photons.  In  our  calculations  we  use  a  pulse  duration  of  100 
optical  cycles  and  a  time  step  of  0.1  a.u.  which  fulfills  the  stability  criterion.  Fur¬ 
thermore,  to  prevent  alias  errors  from  the  grid  boundaries,  a  filter  function  f(k)  = 
( 1  +  exp(  -fli  |  k  -  b\  |  ))_1  with  ax  =  1 .25,  b\  =  85  is  multiplied  to  the  wave  function 
at  each  time  step. 

We  calculate  the  multiphoton  dissociation  spectra  of  a0  =  0.025  (about  100 
terawatt/cm2),  where 

a0  =  Emlmtt2.  (26) 

Figure  4  shows  an  example  of  such  a  calculation  in  which  we  plot  the  projection 
of  the  wave  function  at  the  end  of  a  laser  pulse  onto  unperturbed  eigenstates,  that 
is,  we  plot  the  population  of  each  level  versus  the  eigenenergy.  The  peak  at  4.3  htl 
corresponds  to  the  initial  14th  vibrational  state.  The  higher-energy  peaks  evenly 
spaced  in  htt  represent  the  contributions  from  the  above-threshold-dissociation 
process.  We  have  superposed  results  from  the  two  different  dipole  moment  functions 
[Eqs.  (21)  and  (22)]  in  Figure  4.  It  makes  clear  that  while  dominant  features  are 
essentially  the  same  between  contributions  of  these  two  types  of  dipole  functions, 


Figure  4.  Projection  of  the  final  wave  function  on  the  unperturbed  hf  eigenstates  for 
a0  =  0.025.  hf  is  initially  prepared  in  the  14th  state  with  eigenenergy  4.3  htt.  The  stars 
mark  results  at  eigenenergies,  which  are  connected  by  straight  lines  to  guide  the  eyes. 
Starred  curve  corresponds  to  the  results  of  the  exponential  dipole  moment  function,  and 
the  unstarred  curve  denotes  results  of  A  •  p  function. 
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Figure  5.  Husimi  distribution  of  the  hf  wave  function  at  80 T  for  a0  =  0.025.  hf  is 
initially  prepared  in  the  14th  state  with  eigenenergy  4.3  ft  ft.  The  pulse  duration  is  100T. 
(a)  Results  for  the  exponential  dipole  function  and  (b)  for  the  A  -p  function. 


there  are  distinct  differences.  Clearly  the  A  •  p  function  causes  more  rapid  transitions 
between  eigenstates  than  the  xe~x  function  (called  the  exponential  function  below), 
thus  higher-order  ATD  peaks  are  more  pronounced  for  the  A  •  p  function.  We  can 
examine  the  situation  from  another  point  of  view. 
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P 


Figure  6.  Momentum-space  density  distribution  of  hf  at  80T  for  a0  =  0.025.  HF  is 
initially  prepared  in  the  14th  state  with  eigenenergy  4.3  hQ..  The  pulse  duration  is  100T. 
(a)  Results  for  the  exponential  dipole  function  and  (b)  for  the  A  -p  function. 


In  Figure  5  we  plot  the  Husimi  distributions  of  the  momentum  wave  functions 
at  /  =  80  optical  periods  for  these  two  cases.  Besides  other  details,  they  show  again 
that  the  A  •  p  function  seems  to  spread  out  the  probability  density  more  quickly 
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towards  dissociation.  This,  however,  does  not  correspond  to  wider  momentum 
distributions,  just  some  more  pronounced  peaks  at  low  momenta,  as  revealed  in 
Figure  6. 

These  figures  also  illustrate  the  power  of  the  momentum-space  method  in  solving 
problems  involving  a  continuum.  Our  calculation  reveals  that  the  dissociation 
channel  mainly  corresponds  to  the  low  momentum  part  of  the  wave  packet  and 
the  momentum  distribution  is  usually  confined  throughout  the  entire  dissociation 
process.  In  contrast,  a  wave  packet  in  the  coordinate  space  will  quickly  spread  out 
when  it  reaches  continuum  to  cause  difficulties  in  the  wave  propagation  method. 
Finally,  we  note  that  the  Husimi  and  momentum  distributions  for  these  two  cases 
appear  to  eyes  to  be  very  similar  for  the  short-time  evolution  of  the  pulse.  It  is  only 
when  it  is  close  to  the  end  of  the  pulse  that  the  differences  become  more  apparent. 

Our  results  also  show  that  the  higher-order  atd  peaks  grow  with  laser  intensity, 
which  is  characteristic  of  atomic  above-threshold-ionization  (ati)  as  well.  But  the 
ATD  structure  is  basically  different  from  ATI,  because  the  dissociation  fragments 
are  neutral  atoms,  which  can  not  be  driven  to  oscillations  by  the  laser  field  as  easily 
as  atomic  electrons.  The  important  ponderomotive  shift  which  causes  the  suppres¬ 
sion  of  lower  ati  peaks  is  therefore  absent  in  the  ATD  spectra. 

As  mentioned  at  the  end  of  the  third  section  that  work  is  in  progress  in  applying 
the  p-FGH  method  to  the  no  molecule  so  that  we  can  compare  results  with  those 
from  the  Mathieu  equation  approach. 
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Abstract 

The  linearized  atomic-cell  orbital  (laco)  method  for  local  density  functional  calculations  has  been 
implemented  in  a  program  package  for  energy  bands  and  valence  shell  energies  of  regular  periodic  solids. 
This  method  has  been  used  to  compute  spin-dependent  electrical  resistivity  due  to  scattering  by  displaced 
interface  atoms  in  a  layered  CuCo  superlattice.  The  magnetoresistance  ratio  AR / R  (f  f )  obtained  for  this 
scattering  mechanism  is  25.03.  When  the  computed  interface  resistivity  is  weighted  by  estimated  inter¬ 
penetration  concentration  and  combined  with  bulk  resistivity,  A R/R  reduces  to  the  expected  experimental 
magnitude  for  adjacent  CuCoCu  layers.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Multiple  scattering  theory  (MST)  is  used  in  the  energy-band  formalism  of  Kor- 
ringa,  Kohn,  and  Rostoker  (kkr)  [1,2]  to  construct  an  exact  solution  of  the 
Schrodinger  equation  for  a  spatially  periodic  muffin-tin  potential,  which  is  constant 
in  the  interstitial  region  between  nonoverlapping  spheres.  The  great  advantage  of 
this  KKR  formalism  is  that  the  wave  function  is  constructed  from  functions  com¬ 
puted  separately  within  each  atomic  sphere,  allowing  the  use  of  exact  local  solutions 
of  the  Schrodinger  equation  at  a  given  energy  as  basis  functions  for  the  full  many- 
atom  problem.  This  localization  has  been  exploited  most  fully  in  the  linearized 
muffin-tin  orbital  (lmto)  method  of  Andersen  and  collaborators  [3].  The  lmto 
method  uses  multiple  scattering  theory  to  fit  periodic  boundary  conditions  by  form¬ 
ing  linear  combinations  of  basis  functions  on  all  atoms.  The  mto  basis  functions 
are  evaluated  at  a  set  of  specified  energy  values.  Interpolation  in  this  energy  grid  is 
achieved  by  using  these  basis  functions  in  the  variational  principle  for  the 
Schrodinger  or  Dirac  equation.  One-electron  energies  are  obtained  as  eigenvalues 
of  an  effective  Hamiltonian  matrix  for  each  point  of  different  translational  symmetry 
in  the  Brillouin  zone  of  a  periodic  solid. 

Generalization  of  multiple-scattering  formalism  to  space-filling  local  potentials, 
not  restricted  to  muffin-tin  form,  is  important,  because  such  potentials  are  needed 
in  solids  and  molecules  to  describe  covalent  bonding,  partial  charge  transfer,  and 
locally  nonspherical  fields.  Several  different,  and  to  some  extent  conflicting,  deri¬ 
vations  of  generalized  kkr  equations  have  been  published,  all  based  on  the  integral- 
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equation  formalism  used  originally  by  KKR.  The  derivation  of  Williams  and  Morgan 
[  4  ]  can  be  taken  as  a  point  of  reference.  The  laco  method  is  a  form  of  full-potential 
multiple  scattering  theory  in  which  the  standard  Green-function  formalism  is  con¬ 
verted  to  expressions  involving  Wronskian  surface  integrals  over  the  interfaces  be¬ 
tween  adjacent  space-filling  atomic  cells.  Since  the  LACO  formalism  is  derived  as 
an  extension  of  classical  potential  theory  to  the  Schrodinger  equation,  certain  formal 
results  required  for  the  validity  of  mst  can  be  proven  in  a  more  direct  way  than  is 
possible  in  the  integral-equation  approach  [  5  ] . 

Model  calculations  [6]  have  shown  that  the  suspected  formal  failure  of  mst  for 
space-filling  cells  in  fact  reduces  to  the  very  practical  issue  of  convergence  of  com¬ 
putational  methods  in  which  a  variational  trial  function  cannot  be  made  exactly 
continuous  with  continuous  normal  derivative  across  an  interface  between  adjacent 
cells.  The  LACO  method  replaces  the  usual  Rayleigh-Schrodinger  variational  prin¬ 
ciple  by  that  of  Schlosser  and  Marcus  [  7  ] ,  which  explicitly  includes  a  correction 
term  for  mismatch  across  an  interface.  LACO  differs  from  the  variational  cellular 
method  (  vcm  )  of  Leite  and  collaborators  [  8  ] ,  based  on  this  Schlosser-Marcus  ( SM ) 
variational  principle,  by  organizing  the  computations  in  analogy  to  the  lmto 
method.  If  atomic-cell  orbitals  (acos)  are  defined  as  the  extension  of  local  basis 
functions  across  cell  boundaries  by  fitting  to  energy-derivative  functions  in  other 
cells,  the  correct  linear  combinations  of  these  functions  to  form  periodic  Bloch 
waves  are  determined  by  structure  constants  of  standard  LMTO  form.  At  each  point 
in  the  reduced  Brillouin  zone,  these  ACO  basis  functions  are  used  in  the  SM  variational 
principle.  The  usual  Hamiltonian  matrix  elements  are  modified  by  surface-integral 
terms  due  to  mismatch  of  the  trial  wave  function.  These  Schlosser-Marcus  or  VCM 
terms  provide  a  closure  correction  for  internal  sums  that  occur  in  the  usual  for¬ 
mulation  of  MST  [9] .  Inclusion  of  these  terms  extends  the  accuracy  of  earlier  LACO 
calculations,  reported  in  self-consistent  energy-band  calculations  on  fee  Cu  [10]. 

Giant  Magnetoresistance 

Giant  magnetoresistance  (GMR)  is  observed  in  layered  Fe/Cr  [11],  and  in  other 
materials  in  which  magnetic  layers  are  separated  by  a  nonmagnetic  spacer  metal 
[  1 2, 1 3  ].  If  the  magnetization  of  successive  magnetic  layers  is  originally  not  parallel, 
electrical  resistivity  is  greatly  reduced  when  parallel  alignment  is  forced  by  an  applied 
magnetic  field.  Application  of  GMR  to  magnetic  recording  devices  may  greatly  in¬ 
crease  data  densities  compared  with  existing  technology.  The  present  work  is  in¬ 
tended  to  clarify  the  underlying  mechanism  of  GMR. 

GMR  has  been  studied  [14]  by  a  spin-dependent  generalization  of  the  Fuchs- 
Sondheimer  theory  of  thin  film  conductivity  [15].  Hood  and  Falicov  [14]  have 
recently  extended  the  theory  to  include  potential  wells  that  depend  on  spin  align¬ 
ment,  with  a  more  complete  description  of  spin-dependent  interface  scattering. 
This  parameterized  theory  is  in  reasonable  accord  with  experiment,  except  that  use 
of  experimental  mean  free  paths  for  bulk  scattering  tends  to  underestimate  the 
magnetoresistance.  The  Fuchs-Sondheimer  theory  treats  interface  scattering  dif¬ 
ferently  from  bulk  scattering.  Several  authors  [16-18]  have  used  quantum  theory 
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based  on  the  Kubo  formalism,  gmr  occurs  when  the  mean  free  path  in  a  magnetized 
material  is  strongly  spin-dependent,  primarily  due  to  spin-dependent  interface  scat¬ 
tering,  but  also  from  spin-dependent  bulk  scattering.  All  of  these  prior  theoretical 
studies  depend  on  parameterization  and  on  simplified  models  of  the  electronic 
energy  band  structure. 

The  present  work  reports  first-principles  calculations  of  the  electronic  structure 
and  electrical  conductivity  of  a  superlattice  model  of  spin-polarized  CuCo.  The 
translational  unit  cell  is  (CuCoCu)(CuCoCu)  on  a  001  tetragonal  lattice  whose 
atoms  have  the  geometry  of  fee  Cu.  Transport  theory  is  simplified  here  in  several 
ways  in  order  to  make  calculations  tractable,  but  consequences  of  the  spin-dependent 
energy  band  structure  are  worked  out  in  detail.  The  essential  approximations  are 
that  the  interface  penetration  scattering  mechanism  is  treated  in  the  weak-scattering 
limit  of  isolated  impurities,  and  scattering-in  terms  (vertex  corrections  in  Kubo 
theory)  are  neglected  in  the  Boltzmann  equation.  This  latter  approximation  was 
used  by  Butler  and  Stocks  in  calculations  on  substitutional  alloys,  later  extended 
using  Kubo  theory  by  Butler  [19].  The  approximation  appears  to  be  justified  for 
transition  elements  when  s,d-to-s,d  transitions  are  important,  and  has  recently  been 
used  for  gmr  calculations  [20] .  The  formalism  developed  by  these  authors  can  be 
applied  to  extend  the  present  calculations  to  fully  quantitative  results,  but  this  was 
not  considered  essential  for  the  present  exploration  of  basic  scattering  mechanisms. 
Calculation  of  interface  scattering  beyond  the  present  weak-scattering  limit  requires 
treating  interpenetrating  atomic  layers  as  a  binary  alloy. 

If  there  is  no  thermal  gradient,  and  if  mean  free  paths  are  larger  than  the  trans¬ 
lational  unit  cell,  the  Boltzmann  equation  for  a  regular  periodic  solid  requires  only 
Fermi  surface  data  and  state-to-state  transition  probabilities  for  dissipative  scattering 
mechanisms.  All  required  quantities  relevant  to  the  Fermi  surface  are  obtained 
here  from  first-principles  calculations,  as  are  transition  probabilities  for  scattering 
due  to  random  interpenetration  of  adjacent  Co  and  Cu  layers.  Bulk  scattering  is 
treated  much  less  specifically  by  using  empirical  values  of  the  mean  free  path  for 
residual  resistivity  in  pure  Co  and  Cu. 

In  multiple-scattering  theory,  as  used  here  in  the  LACO  (linearized  atomic-cell 
orbital)  method  [9],  energy  bands  e(k,s)  are  determined  by  the  secular  equation 
det(7  -  tg)  =  0.  Here  /(e)  is  a  site-diagonal  atomic-cell  scattering  matrix  and  g( k) 
is  an  energy-independent  matrix  of  structure  constants.  The  energy-band  structures 
of  magnetic  and  spacer  species  in  typical  gmr  materials  match  closely  at  the  Fermi 
energy  for  one  spin  direction,  while  there  is  substantial  mismatch  for  the  opposite 
spin  [21] .  The  /-matrices  of  the  two  species  must  have  a  similar  relationship,  since 
energy  bands  in  a  fixed  space  lattice  are  determined  entirely  by  the  /-matrices.  If 
atoms  are  interchanged  due  to  interpenetration  at  a  spacer  interface,  the  difference 
of  /-matrices  must  be  small  for  one  spin  direction  and  large  for  the  other,  implying 
strong  dependence  of  scattering  on  spin.  The  transition  amplitude  for  this  scattering 
mechanism  is  determined  in  the  weak-scattering  limit  by  the  impurity  scattering 
matrix  At,  the  difference  between  the  /-matrices  of  the  interchanged  species,  eval¬ 
uated  at  the  Fermi  energy. 
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Comparison  of  Co  and  Cu  densities  of  states  indicates  that  matrix  elements  At 
should  be  small  for  majority-spin  electrons  and  large  for  minority-spin  electrons, 
in  layered  CuCo  with  parallel  Co  spins.  This  expectation  is  confirmed  by  the  present 
computations.  Matrix  elements  At  are  of  intermediate  size  for  antiparallel  Co  spins. 
Spin  dependence  of  the  computed  transition  probabilities  implies  spin  dependence 
of  the  resistivity.  The  present  computations  indicate  that  for  parallel  alignment  of 
magnetization  in  the  Co  layers  the  minority-spin  resistivity  is  much  greater  than 
the  majority-spin  resistivity.  Since  the  resistivity  for  antiparallel  alignment  is  of 
intermediate  magnitude,  the  magnetoresistance  is  very  large. 

This  spin-dependent  scattering  mechanism  differs  from  the  well-known  Mott 
effect,  which  provides  an  explanation  of  the  increased  resistivity  of  transition  metals 
compared  with  noble  metals.  Because  transitions  into  final  d-states  reduce  the  re¬ 
laxation  time  of  a  wave  packet  at  the  Fermi  energy,  resistivity  due  to  any  dissipative 
scattering  mechanism  depends  on  spin  if  the  density  of  states  does  so.  When  the 
transition  probability  itself  depends  on  spin,  stronger  spin  dependence  is  possible 
than  that  due  to  the  Mott  effect  alone.  Scattering  due  to  the  atomic-cell  ^-matrices 
by  themselves  is  phase-coherent,  described  by  transmission  and  reflection  coefficients 
that  depend  on  spin  and  direction,  but  does  not  contribute  to  electrical  resistivity. 
In  contrast,  scattering  due  to  At  is  dissipative,  in  the  sense  that  each  Bloch  wave 
acquires  an  energy  width  or  inverse  lifetime,  characteristic  of  impurity  scattering 
by  a  displaced  atom  at  a  random  site. 

A  relaxation  time  r  is  defined  if  collisions  cause  a  perturbed  distribution  function 
to  relax  locally  in  its  parameter  space  and  exponentially  in  time  to  the  equilibrium 
distribution  f0 .  If  r  is  defined,  the  classical  steady-state  Boltzmann  equation  for  a 
homogeneous  material  with  no  thermal  gradient  implies  the  electrical  conductivity 
tensor 

(Tij  =  - e 2  J  rVi(dfo/mdVj)  d\.  ( 1 ) 

The  semiclassical  Boltzmann  equation  for  electrons  replaces /(x;  v)  by  the  Fermi- 
Dirac  distribution  function  f(k,b,s),  indexed  by  distinct  eigenstates,  specified  by 
k  in  the  reduced  Brillouin  zone,  by  a  band  index  b ,  and  by  a  spin  index  5.  Classical 
electron  velocity  is  replaced  by  wave-packet  group  velocity,  v  =  ( [/2h)Vke,  in 
atomic  units  if  the  energy  e  is  in  Rydberg  units.  In  the  reduced  Brillouin  zone  for 
a  translational  cell  of  volume  12,  the  number-of-states  element  is  [2]  (12/ 87t3)(  1  / 
2hv)dedS,  where  dS  is  an  element  of  the  constant-e  surface.  In  a  spin-polarized 
system,  the  factor  indicated  here  by  [2]  is  replaced  by  a  sum  over  the  spin  index. 
On  the  Fermi  surface,  Vj  =  vFlt ,  where  1  is  a  unit  normal  vector,  and  at  T  =  0  K, 
dfolhdkj=  -2 VFljd(e  -  eF). 

If  scattering-in  terms  in  the  Boltzmann  equation  are  neglected,  the  relaxation 
time  r  is  the  reciprocal  of  the  transition  probability  at  the  Fermi  surface, 

r-'(kWV5  2  dS'(hv'F)-'^  |(k',6';5|A/s|k,6,5)|2. 

07 r  ,,  J  n 


(2) 
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In  Eq.  (2)  the  final  density  of  states  is  proportional  to  the  reciprocal  Fermi  velocity. 
An  increased  final  density  of  states  for  either  spin  implies  a  relative  decrease  of  the 
relaxation  time.  This  is  the  Mott  effect,  giving  spin-dependent  conductivities  even 
for  spin-independent  transition  probabilities. 

The  implied  semiclassical  electric  current  density  is  jt  =  'Zi(TijEj ,  where  the  con¬ 
ductivity  tensor  is 


rVplilj  dS. 


(3) 


A  sum  over  distinct  energy  bands,  assumed  in  Eq.  (3),  is  not  indicated  explicitly. 
Equation  ( 3 )  implies  that  electrical  conductivity  can  depend  on  spin  only  through 
spin  dependence  of  the  Fermi  surface  or  of  the  mean  free  path  X(k,Z?,s)  =  tvf.  In 
general,  er  is  dominated  by  the  largest  values  of  X,  and  spin  dependence  of  X  implies 
magnetoresistance. 


Calculations 

The  LACO  system  of  programs  for  electronic  structure  calculations  has  recently 
been  extended  to  ferromagnetic  metals.  Preliminary  calculations  on  fee  Cu  and  on 
both  paramagnetic  and  ferromagnetic  fee  Co  provided  initial  radial  density  functions 
for  001 -tetragonal  (CuCoCu)(CuCoCu),  on  the  Cu  lattice,  for  which  self-consistent 
calculations  were  carried  to  convergence  for  both  parallel  f  f  and  antiparallel  \\ 
alignments  of  the  Co  spin  axes.  A  local  basis  designated  by  spd  { fg }  was  used.  The 
orbital  basis  sets  on  each  atom  included  ( spd )  orbital  basis  functions  at  each  elec¬ 
tronic  band  energy,  with  intermediate  boundary  sums  carried  through  (fg)  solid 
harmonics. 

The  matrix  At  for  the  interface  scattering  mechanism  considered  here  is  the 
difference  of  spin-dependent  self-consistent  atomic-cell  /-matrices  computed,  as 
described  recently  [22],  from  rectangular  matrices  characteristic  of  full-potential 
theory.  Matrix  elements  of  At  were  computed  using  Bloch  wave  eigenvectors  of  the 
secular  matrix.  Incoherent  scattering  by  each  displaced  atom  was  assumed.  Equa¬ 
tions  (2)  and  (3)  were  used  to  compute  the  state-dependent  relaxation  time  and 
spin-dependent  electrical  conductivity  tensor,  respectively.  Integrals  over  the  Fermi 
surface  required  in  these  expressions  were  carried  out  by  a  tetrahedral  interpolation 
scheme.  These  calculations  have  not  been  carried  to  convergence  with  respect  to 
the  number  of  points  in  the  Brillouin  zone,  but  test  calculations  indicate  that  the 
interpolation  method  used  is  relatively  stable,  and  that  the  preliminary  results  given 
here  are  not  subject  to  large  error.  A  subsequent  publication  will  give  details  of  the 
computations. 

Relaxation  times  due  to  the  interpenetration  effect  were  computed  in  the  form 
ct,  where  c  is  the  (unknown)  concentration  of  Co  atoms  exchanged  with  Cu  in 
adjacent  planes.  The  spin  dependence  of  ct  and  dependence  on  spin  polarization 
is  very  large,  characterized  in  atomic  units  by  the  orders  of  magnitude 

ttCO  ct( f )  ~  104  ct(\  )  ~  10 l; 

f|(tf)  cr(f)~102  cr(^)~102. 
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The  resulting  conductivity  tensor  is  strongly  spin-dependent.  Computed  in-plane 
conductivities,  again  multiplied  by  the  concentration  c,  are  (in  atomic  units) 

ff  (/)  ca( f  )  =  2.0378,  ca(\ )  =  0.0032,  ca{f)  =  2.0410, 

\\(a)  ca( f )  =  0.0392,  cc(\ )  =  0.0392,  ca(a)  =  0.0784. 

The  implied  magnetoresistance  ratio,  defined  here  by  [p(f)  -  p(a)]/ p(f)  = 
[o{f)  -  v(a)]/<T(a),  is  25.03  from  this  mechanism  alone.  Results  of  similar  large 
magnitude  have  recently  been  reported  from  layered  kkr/cpa  calculations  [20]. 

Recent  first-principles  calculations  by  Sticht  [23  ]  are  similar  to  the  present  work, 
but  do  not  evaluate  a  specific  scattering  mechanism  and  take  the  relaxation  time 
to  be  constant  and  independent  of  spin.  This  is  inconsistent  with  Eq.  (2)  here,  since 
spin-dependent  Fermi  surface  and  Fermi  velocity  imply  spin-dependent  r.  The 
present  results  include  this  Mott  effect,  together  with  the  direct  effect  of  spin- 
dependent  interface  scattering.  In  order  to  separate  these  two  aspects  of  the  physical 
mechanism,  the  calculations  were  repeated  with  the  state-dependent  transition 
probability  replaced  in  Eq.  (2)  by  its  average  over  initial  state  and  spin.  The  modified 
values  of  ca  computed  in  this  way  are 

ft(/)  c<r(f)  =  0.0419,  ca{\ )  =  0.0027,  ca(f)  =  0.0446, 

)  ca( f  )  =  0.0038,  ca{\  )  =  0.0038,  ca{a)  =  0.0076. 

The  implied  magnetoresistance  ratio  is  4.87.  Hence,  in  the  present  example,  the 
Mott  effect  by  itself  leads  to  significantly  smaller  gmr  than  does  the  combination 
of  Mott  effect  and  spin-dependent  interface  scattering. 

A  model  of  bulk  scattering  is  provided  by  using  spin  and  state  independent  values 
of  the  mean  free  path  X  in  Eq.  (3).  Empirical  low-temperature  values  for  typical 
experimental  materials  are  [24]  X(Cu)  =  200  A,  X(Co)  =  70  A.  1/A  for  each 
species  is  weighted  by  the  relative  number  of  atoms  in  the  unit  cell.  The  implied 
values  of  <r  for  pure  bulk  scattering  are 

ttCO  <Kt)  =  1-2288,  <r(|)  =  1.1610,  a(f )  =  2.3898, 

tl(a)  <Kt)  =  1-4350,  <t(\)  =  1.4350,  o  (a)  =  2.8700. 

The  implied  magnetoresistance  ratio  for  spin-independent  bulk  scattering  is  —0. 17, 
opposite  in  sign  to  the  observed  gmr.  This  result  indicates  that,  despite  the  spin 
dependence  of  the  Fermi  surface,  purely  spin-independent  scattering  cannot  account 
for  GMR. 

For  comparison  with  observed  gmr,  resistivities  due  to  bulk  scattering  and  to 
interface  scattering  must  be  combined.  In  the  weak-scattering  limit,  scattering  prob¬ 
abilities  due  to  bulk  and  interface  mechanisms  are  additive,  and  the  implied  spin- 
dependent  resistivities  must  be  added  separately  for  the  spin-up  and  spin-down 
electrons  in  the  two-fluid  model  postulated  here.  The  resulting  spin-indexed  current 
densities  are  independent,  and  are  of  greatly  different  magnitude  in  the  spin-aligned 
material.  Adding  the  spin-indexed  conductivities  gives  a  rational  formula  that  in¬ 
terpolates  A R/R  between  pure  bulk  and  interface  scattering  limits  as  a  function  of 
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c.  If  the  interpenetration  concentration  is  cm  0.10,  which  is  not  unreasonable  for 
atomic  species  closely  spaced  in  the  periodic  table,  and  other  quantities  are  taken 
from  the  data  given  above,  this  formula  implies  AR/R  ~  1.0,  comparable  to  an 
estimate  of  ~2.0  [24]  for  the  empirical  limit  in  Cu/Co  at  low  temperature,  with 
adjacent  magnetic  and  spacer  atomic  layers,  but  no  true  bulk  spacer  material. 

Conclusions 

The  present  first-principles  calculations  show  that  interface  scattering  due  to 
interpenetration  of  adjacent  magnetic  and  spacer  atomic  layers  can  produce  large 
magnetoresistance,  if  the  difference  between  atomic  scattering  ^-matrices  at  the 
Fermi  energy  depends  strongly  on  spin  orientation.  Hence  spin-dependent  energy 
band  structure  can  be  correlated  with  the  occurrence  of  gmr.  Comparing  published 
densities  of  states  [  25  ] ,  Co/Cu  bands  agree  for  majority-spin  electrons  and  disagree 
for  minority-spin  electrons  at  the  Fermi  energy,  while  the  opposite  is  true  for  Fe/ 
Cr  and  Fe/Rh.  The  present  study  predicts  that  electric  current  is  spontaneously 
polarized  in  spin-aligned  magnetized  gmr  materials.  Opposite  polarization  is  ex¬ 
pected  in  Co/Cu  and  Fe/Cr  (or  Fe/Rh),  respectively. 
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Abstract 

Some  reactive  scattering  codes  have  been  ported  on  different  innovative  computer  architectures  ranging 
from  massively  parallel  machines  to  clustered  workstations.  The  porting  has  required  a  drastic  restructuring 
of  the  codes  to  single  out  computationally  decoupled  cpu  intensive  subsections.  The  suitability  of  different 
theoretical  approaches  for  parallel  and  distributed  computing  restructuring  is  discussed  and  the  efficiency 
of  related  algorithms  evaluated.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

The  possibility  of  carrying  out  extended  reactive  scattering  calculations  strongly 
relies  on  the  availability  of  highly  performing  computers  and  on  the  ability  of 
constructing  computational  procedures  capable  of  taking  advantage  of  the  inno¬ 
vative  architectural  features  of  these  machines.  Due  to  the  intrinsic  limitations  of 
single  processor  architectures  in  reaching  high  computing  speed,  the  only  way  of 
performing  extended  reactive  scattering  calculations  using  commercially  available 
technology  is  to  split  computer  codes  into  weakly  coupled  sections  and  make  them 
run  concurrently  on  several  processors. 

A  way  of  designing  an  efficient  concurrent  architecture  is  to  connect  several 
processors  through  a  dedicated  network.  The  processors  execute  independently 
some  processes  and  receive  through  the  network  the  necessary  information  to  start, 
continue,  and  terminate  the  calculations.  Through  the  network  they  also  return 
the  results  and  eventually  receive  instructions  on  how  to  start  new  processes.  The 
design  of  the  dedicated  network  is  most  often  the  key  feature  of  a  parallel  machine. 
In  designing  the  network  of  a  parallel  computer  much  care  has  to  be  put  in  ensuring 
an  efficient  handling  of  communications  between  distant  (in  terms  of  network 
connections)  processors.  On  the  contrary,  the  role  played  by  individual  processors 
is  mainly  to  provide  (possibly  fast  and  cheap)  computing  power  and  some  I/O 
functionalities.  Therefore,  the  computing  nodes  are  quite  often  sufficiently  well- 
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performing  commercial  (or  custom)  processor  chips.  More  complex  (mainly  front- 
end)  operations  are  usually  handled  by  a  (sometimes  assembled  in  the  same  cabinet) 
workstation. 

An  alternative  practical  way  of  distributing  the  calculations  over  concurrent  pro¬ 
cessors  is  to  make  use  of  clustered  workstations.  In  this  case,  the  computing  element 
is  a  real  computer  that  can  provide  a  richer  working  environment.  In  this  case, 
however,  communications  are  dealt  by  either  a  local  or  a  geographical  network. 
Obviously,  such  a  network  is  less  efficient  than  that  of  parallel  computers  making 
it  vital  that  applications  are  partitioned  into  truly  loosely  coupled  processes. 

To  take  advantage  of  concurrent  processing,  one  needs  software  tools  providing 
easy  means  to  maximize  the  efficiency  of  the  different  processors  and  minimize  the 
data  traffic  on  the  network.  To  this  end  use  can  be  made  of  a  language  suitable  for 
the  exploitation  of  the  parallelism  of  the  problem.  The  language  to  use  may  be  a 
sequential  one  enriched  with  parallel  extensions  (e.g.,  HPF,  pvm,  etc.)  or  a  native 
parallel  language  (e.g.,  Occam,  etc.).  Use  can  also  be  made  of  semiautomatic 
parallelization  tools,  like  Mimdizer,  which  can  detect  parallel  features  of  the 
source  code. 

There  is,  however,  no  surrogate  for  an  efficient  design  of  the  application.  This 
can  only  result  from  a  combination  of  a  deep  understanding  of  the  physics  of  the 
investigated  problem,  a  detailed  knowledge  of  the  properties  of  the  adopted  algo¬ 
rithms  and  an  appropriate  exploitation  of  the  peculiar  architectural  features  of  the 
used  machine.  What  is  more,  there  is  not  a  unique  parallelization  golden  recipe 
even  for  established  applications.  The  efficiency  of  parallel  algorithms,  models  and 
architectures  may  drastically  vary  (even  for  the  same  application)  when  the  value 
of  the  parameters  changes.  For  these  reasons,  in  our  approach  to  concurrent  com¬ 
puting,  we  adapted  our  choices  both  to  the  nature  of  the  problem  and  to  the  char¬ 
acteristics  of  the  adopted  computational  procedure. 

Since  our  interests  focus  on  reactive  scattering  calculations,  we  have  first  restruc¬ 
tured  a  classical  trajectory  code  to  run  on  a  distributed  memory  mimd  parallel 
machine  of  the  hypercube  type.  The  parallel  implementation  of  the  atom-diatom 
classical  trajectory  code  on  an  ncUBE  2  machine  is  discussed  in  the  next  section. 
Due  to  the  difficulty  of  fitting  the  parallel  tasks  of  a  reduced  dimensionality  method 
into  the  limited  local  memory  of  the  nCUBE  available  to  us,  we  have  used  for  its 
implementation  on  a  multiprocessor  architecture  a  fewer  processor  machine  with 
larger  node  memory.  Details  of  this  parallel  implementation  are  given  in  the  third 
section.  Higher  memory  and  cpu  demands  of  alternative  or  more  rigorous  ap¬ 
proaches  to  chemical  reactivity  have  prompted  the  use  of  more  complex  compu¬ 
tational  nodes.  To  gain  experience  on  the  use  of  parallel  architectures  based  on 
more  complex  computing  elements,  we  chose  to  implement  the  reduced  dimen¬ 
sionality  code  on  clustered  workstations.  To  this  end,  we  made  also  use  of  some 
parallelization  software  tools.  A  description  of  these  efforts  is  given  in  the  fourth 
section. 


The  Trajectory  Code 

The  simplest  computational  approach  to  the  calculation  of  the  reactive  properties 
of  molecular  systems  are  those  inspired  by  classical  mechanics.  In  these  approaches 
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the  nuclei  of  the  atoms  are  considered  as  mass-points  moving  on  a  potential  energy 
surface  and  obeying  Newtonian  mechanics.  Obviously,  any  formulation  of  the  mo¬ 
tion  equations  equivalent  to  the  Newtonian  one  can  be  adopted  to  describe  the 
motion  of  the  reactive  system.  Our  trajectory  code,  originally  inspired  by  Ref.  [1], 
integrates  Hamilton’s  equations. 

Though  extensively  described  in  the  literature  [2] ,  the  main  features  of  the  quas- 
iclassical  approach  to  atom-diatom  reactivity  are  outlined  here  to  facilitate  the 
understanding  of  the  parallel  restructuring. 


The  Quasiclassical  Method 

For  an  atom-diatom  system  A  +  BC  at  a  given  value  of  the  translational  energy 
Elr,  the  detailed  quasiclassical  cross  section  from  the  reactant  vibrotational  v  j  to 
the  product  ( AB ,  BC  or  AC)  vibrotational  v'j'  state  is  defined  as 


\Etr) = k  f  r*...r 

Jo  Jo  Jo 


dikfvjMiu  fc,...,  fe;  Etr)  (l) 


In  Eq.  ( 1 )  K  is  a  normalization  constant  and  k  is  5  when  a  proper  choice  of 
the  initial  reference  frame  is  made.  In  this  case,  the  £  variables  are  defined  as  £i 
=  ( 1  -  cos  tf°)/2,  £2  =  *7°/(2t r),  £3  =  ( b/bmax )2,  £4  =  £°/t r,  £5  =  <f>°/( 2tt)  where 
the  angle  #  (the  superscript  “0”  means  initial  value)  gives  the  orientation  of  the 
diatom  BC  with  respect  to  the  atom-diatom  A ,  BC  vector,  the  phase  angle  77  de¬ 
termines  the  BC  oscillator  elongation,  b  and  (3  are  the  distance  of  the  relative  collision 
velocity  vector  from  the  BC  center  of  mass  and  the  angle  it  forms  with  the  plane 
of  the  triatom,  respectively,  and  0  is  the  angle  formed  by  the  rotational  angular 
momentum  of  BC  with  the  same  plane.  The  function  fVjtV'f(£  1,  £2,  £3,  £4,  £5;  Etr) 
is  a  Boolean  function.  Its  value  is  one  only  when  after  integrating  the  Hamilton 
equations  of  the  system  starting  from  the  given  set  of  initial  conditions  and  the 
vibrotational  vj  state,  the  products  can  be  assigned  to  the  final  v'j'  vibrotational 
quantum  state.  In  a  quasiclassical  approach  the  continuum  classical  state  v'J'c  is 
usually  discretized  by  assigning  it  to  the  closest  (discrete)  quantum  state  v'j'. 

In  general,  the  integral  of  Eq.  ( 1 )  is  evaluated  using  a  Monte  Carlo  method.  As 
a  result,  the  detailed  reactive  cross  section  can  be  written  as 


avj,v'j'  (E,r)  = 


Tb2  N 


N 


i=l 


(2) 


where  N  is  the  number  of  integrated  trajectories.  More  frequently,  however,  ex¬ 
perimental  quantities  are  less  detailed  than  the  state-to-state  cross  section.  As  an 
example,  in  many  gas  kinetics  studies,  what  is  actually  needed  is  the  rate  coefficient 
K,vf(  T)  defined  as 


KAT) 


l  8  ' 

,1/2  oo 

V 

(2 j  +  l)e-‘J,kT  r 

'  2=0 
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where  c,  is  the  energy  of  the  rotational  state  j ,  QR(  T)  is  the  BC  rotational  partition 
function,  k  is  the  Boltzmann’s  constant,  p  is  the  A ,  BC  reduced  mass  and  c yVj,v'(Etr ) 
is  the  degeneracy  averaged  detailed  reactive  cross  section  summed  over  product 
rotational  states.  In  the  case  of  rate  coefficient  calculations,  a  further  integration 
over  the  energy  is  needed  and,  altogether,  a  six-dimensional  integral  has  to  be 
evaluated.  However,  when  using  a  Monte  Carlo  approach,  the  same  algorithm 
described  for  the  calculation  of  the  cross  section  can  be  used  and,  as  a  consequence, 
the  same  parallelization  recipes  can  be  applied. 


The  Structure  of  the  Computer  Code 

As  sketched  in  the  scheme  of  Figure  1,  on  a  conventional  computer,  the  trajectory 
code  can  be  divided  into  three  sections. 

The  first  section  of  the  code  is  dedicated  to  input  the  data  and  calculate  the 
variables  of  common  use  throughout  the  code.  This  section  sets  the  different  options 
for  selecting  the  initial  conditions  (e.g.,  single  energy  or  thermal  distributions,  fixed 
or  random  orientation,  etc.) ,  the  type  of  the  potential  used,  the  mode  and  frequency 
of  printing  out  information  on  the  individual  trajectory  evolution  and  on  collective 
indicators.  This  section  does  also  work  out  the  mass  factors,  convergence,  and 
accuracy  check  variables,  debugging  flags  and  limits  of  the  integration  domain. 

The  second  section  iterates  over  the  number  of  trajectories  to  be  integrated  the 
necessary  operations.  The  first  of  these  operations  is  the  generation  of  a  string  of 
pseudo-random  numbers  and  its  conversion  into  validated  initial  values  of  the 
integration  variables.  This  has  to  be  strictly  sequential  to  be  deterministic  (i.e.,  to 
generate  the  same  sequence  of  events  when  changing  working  conditions  like  when 
distributing  the  trajectories  on  a  different  number  and  type  of  processors).  To  this 
end,  the  pseudo-random  sequence  is  generated  using  a  routine  that,  given  a  seed, 
generates  a  new  seed  and  a  random  number  in  the  interval  0- 1 .  The  second  operation 
of  this  section  consists  of  integrating  the  12  Hamilton  first-order  differential  equa¬ 
tions  using  a  numerical  integrator.  The  integration  is  carried  out  from  a  sufficiently 


I  SECTION 

Input  data 

Calculate  preliminary  quantities  of  common  use 

II  SECTION 

LOOP  on  trajectories 

Determine  initial  conditions 
Integrate  the  trajectory 
END  the  trajectory  loop 

III  SECTION 

Perform  the  final  statistical  treatment 
Printout  the  results 

Figure  1 .  Scheme  of  the  quasiclassical  trajectory  code. 
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large  atom-diatom  distance  to  a  point  where  one  of  the  three  internuclear  distances 
is  again  large  enough  to  consider  the  process  ended  (the  check  of  the  internuclear 
distances  is  performed  at  each  integration  step).  The  third  operation  consists  of 
working  out  the  product  characteristics  from  the  final  value  of  the  integrated  vari¬ 
ables  (i.e.,  vibrational,  rotational,  and  translational  energy  of  the  products  as  well 
as  their  vector  properties).  This  allows  an  update  of  the  statistical  indicators  related 
to  distributions  of  scalar  and  vector  properties  of  the  products. 

The  final  section  of  the  code  carries  out  a  statistical  analysis  of  the  final  outcome 
of  the  integrated  trajectories  to  print  out  cross  sections  and  product  distributions 
for  the  different  reaction  channels. 

The  Parallel  Restructuring  of  the  Trajectory  Program 

A  parallel  version  of  the  trajectory  code  was  implemented  on  the  nCUBE  2  com¬ 
puter  [  3  ] .  The  nCUBE  machine  is  a  multiprocessor  computer  made  of  2d  (d  is  the 
dimension  of  the  cube.  The  machine  used  for  trajectory  calculations  has  d  =  7 ) 
computing  elements  connected  on  a  hypercube  network.  On  this  network  each 
processor  is  assigned  a  d- bit  code  having  the  property  of  differing  by  one  bit  only 
for  adjacent  nodes.  Each  computing  element  (or  node)  is  provided  with  a  custom 
processor  having  a  peak  performance  of  2.5  Mflops  (double  precision)  and  a  node 
local  memory  of  4  Mbytes.  Only  neighboring  nodes  are  directly  connected.  All 
other  nodes  communicate  through  intermediate  nodes  (though  communication 
does  not  interfere  with  current  execution).  Routing  functions  allow  a  dynamical 
allocation  of  the  communication  paths.  Once  communication  paths  have  been  set, 
they  remain  active  till  an  “end-of-transmission”  packet  is  forwarded.  A  sun  work¬ 
station  acts  as  a  front-end  computer  and  provides  a  user-friendly  environment  for 
development  and  production. 

Message  receiving  and  sending  occurs  through  nread  and  n  write  functions  [4]. 
The  issuing  of  an  nread  instruction  blocks  the  execution  while  when  using  n  write 
the  current  process  continues  executing.  The  parallelism  can  be  implemented  using 

the  following  library  functions:  n _ frun,  n _ fork,  n _ sleep,  n _ npwait  and 

n _ wake.  The  n _ frun  function  activates  a  new  process  on  one  of  the  allocated 

nodes  of  the  hypercube.  Once  a  process  has  been  successfully  activated,  it  returns 
its  identification  number.  To  duplicate  the  calling  process  on  the  same  node  use 

has  to  be  made  of  the  n _ fork  function.  To  suspend  a  process  either  for  a  given 

interval  of  time  or  until  a  given  condition  applies  the  n _ sleep  and  n _ npwait 

instructions  have  to  be  issued.  The  resumption  of  a  process  is  obtained  by  means 

of  the  function  n _ wake.  To  handle  I/O,  functions  like  n _ fcreate,  n _ fopen, 

n _ f close,  n _ fread,  and  n _ fwrite  can  be  used  to  create,  open,  close,  read,  and 

write  a  file  in  a  parallel  fashion. 

A  simplistic  way  of  implementing  on  the  nCUBE  a  parallel  version  of  the  trajectory 
code  is  to  distribute  among  several  processors  the  section  of  the  program  devoted 
to  the  integration  of  the  motion  equations.  In  fact,  once  the  sequential  determination 
of  the  five  initial  conditions  is  performed  each  integration  of  the  motion  equations 
(i.e.,  each  trajectory  calculation)  is  a  fully  independent  process.  Therefore,  when 
adopting  a  farm  parallel  model  [  5  ] ,  the  farmer  can  distribute  the  integration  of 
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the  different  trajectories  to  the  several  worker  nodes  once  the  related  initial  conditions 
have  been  determined.  Unfortunately,  such  an  approach,  though  appropriate  for 
a  few  processor  machine,  fails  to  give  high  speedups  (the  speedup  SP  is  defined  as 
the  ratio  TS/TP  i.e.,  the  ratio  between  the  times  necessary  to  run  the  application 
on  a  single  node  and  that  on  NP  nodes)  when  a  large  number  N  of  trajectories  needs 
to  be  integrated  in  a  many  processor  machine. 

Both  Ts  and  7>  can  be  partitioned  into  a  time  needed  to  run  the  strictly  sequential 
part  (j1  and  5', respectively)  and  the  time  needed  to  run  the  part  that  can  be  executed 
in  parallel  ( p  and  p\  respectively).  In  the  limit  of  ^  =  s'  =  0  and  p  =  NPp',  the 
speedup  is  exactly  NP.  Deviations  from  this  limiting  value  are  due  to  overheads 
associated  with  the  introduction  of  parallelization  functions  as  well  as  to  waiting 
times  caused  by  communication  synchronization  or  by  the  non-negligibility  of  5. 
All  these  elements  become  important  when  the  number  of  trajectories  to  be  inte¬ 
grated  is  large  as  is  quite  often  the  case  of  reactions  relevant  to  practical  applications. 

For  this  reason,  we  performed  a  more  appropriate  restructuring  of  the  code  by 
minimizing  the  communications  and  placing  the  generation  and  validation  of  initial 
conditions  inside  the  worker  process.  Only  the  generation  of  the  first  seed  of  each 
trajectory  was  kept  at  the  master  level  to  ensure  determinism.  Once  the  seed  vector 
(of  dimension  N)  has  been  constructed,  it  is  distributed  to  the  worker  processors. 
From  this  vector,  following  a  message  of  the  master,  the  worker  takes  the  element 
corresponding  to  the  trajectory  to  be  run.  Starting  from  this  seed,  a  pseudo-random 
substring  and  related  initial  conditions  are  generated  and  validated  as  a  part  of  the 
in-node  work.  This  allows  a  dramatic  reduction  of  both  the  workload  of  the  master 
and  the  amount  of  transmitted  data. 

A  similar  modification  was  introduced  to  improve  the  return  of  the  results.  More 
in  detail,  communications  were  drastically  reduced  by  performing  an  in-node  update 
of  the  statistical  indexes.  Statistical  indexes  are  collected  by  the  farmer  at  the  end 
of  the  worker  activity  to  assemble  a  global  statistical  analysis. 

In  this  way,  it  has  been  possible  to  obtain  a  speedup  of  1 19  when  running  4096 
trajectories  on  128  nodes.  Presently,  the  parallelized  version  of  the  code  is  used  for 
production  runs  to  integrate  batches  of  100  000  trajectories  per  job  in  an  average 
time  of  7  hr.  This  has  allowed  a  highly  detailed  analysis  of  isotope  and  threshold 
properties  of  the  Li  +  FH  reaction  [6]. 


The  Reduced  Dimensionality  Quantum  Approach 

Accurate  treatments  of  atom-diatom  reactive  scattering  are  quantum-mechanical. 
Unfortunately,  up  to  date,  some  theoretical  and  computational  difficulties  prevent 
a  straightforward  extension  of  these  methods  to  the  calculation  of  cross  sections 
and  rate  constants  of  generic  atom-diatom  reactions.  Therefore,  the  most  popular 
approach  to  the  calculation  of  these  quantities  are  based  on  the  reduction  of  the 
dimensionality  of  the  problem  by  introducing  some  dynamical  constraints.  The 
reduced  dimensionality  method  we  considered  for  a  parallel  implementation  is  the 
Reactive  Infinite  Order  Sudden  (RIOS)  one. 
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The  Reactive  Infinite  Order  Sudden  Method 

A  detailed  description  of  the  Rios  method  is  given  in  the  literature  [  7  ] .  In  the 
present  article  we  discuss  only  those  details  that  help  the  understanding  of  the 
restructuring.  The  Rios  state  (u)  to  state  (v ')  integral  reactive  cross  section  avy(Etr) 
is  defined  as: 

VvAEtr)  =  £5  2  (2/  +  1 )  J‘  I  E,r)  \ 2  d  cos  0  (4) 

where  kl  =  2  fi(E  —  ev),  E  is  the  total  energy,  ev  is  the  energy  of  the  vibrational  state 
v,  l is  the  (decoupled )  orbital  quantum  number,  and  *SW,«'(©;  Etr )  is  the  detailed  fixed 
0S  matrix  element.  To  evaluate  rate  coefficients  from  Rios  cross  sections  one  can 
use  a  j  shifting  model  to  approximate  the  dependence  of  the  cross  section  from  the 
initial  rotational  quantum  number  j .  According  to  this  model 

Cvj,v'(Etr )  =  avy(Etr  —  evj  +  c„)  (5) 

where  evj  is  the  vibrotational  energy  of  the  v  j  state.  To  calculate  the  Etr) 

elements  the  following  set  of  coupled  differential  equations 

^-D  fl*;0)-O  (6) 


depending  on  whether  we  are  dealing  with  the  cartesian  or  the  polar  region.  In  fact, 
though  Eq.  (6)  is  formulated  in  mass  scaled  Jacobi  coordinates  R  and  r  (the  inte¬ 
gration  coordinate  in  this  region  is  R)  in  the  strong  interaction  region  the  related 
circular  coordinates  a  and  u  are  used  (in  this  case  the  reaction  coordinate  is  the 
arc  u  which  is  linked  to  r\  by  the  relationship  tj  =  1  +  u/ a).  Equations  (6)  were 
obtained  by  expanding  the  Rios  scattering  wavefunction  EIOS(i? ,  r\  0)  in  the  partial 
differential  equation 


\-Hl(LJLR  +  l*Lr-4.-5L\ 

2 n\RdR2  r  dr2  R2  r2 ) 


+  V(R,r;Q)-E  alos(R,  r;Q)  =  0  (7) 
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in  terms  of  the  eigenfunctions  ^(r;  R,  0)  of  the  equation  of  the  bound  coordi¬ 
nate  r 


h2  d 2 

+  F(r;  R,  0)  -  ev 

2  p  dr 


UnR,Q)  =  0 


(8) 


Equation  (7)  is  obtained  from  the  exact  (electronically  adiabatic)  nuclear  Schro- 
dinger  equation  by  applying  at  the  same  time  both  the  centrifugal  sudden  and  the 
energy  sudden  approximations.  In  Eq.  (7)  Ai  -  h  -2/(/  +  1 )  and  Bj  -  h  ~2j(j  +  1 ) 
are  the  coefficients  of  the  decoupled  orbital  and  rotational  terms  of  the  Hamiltonian 
with  /  and  j  being  the  related  quantum  numbers. 

To  integrate  Eqs.  (6)  for  each  value  of  0,  the  (R,  r)  plane  is  divided  into  two 
regions  by  a  straight  line  originating  at  R  =  r  =  0  and  following  the  ridge  separating 
the  entrance  from  the  exit  channel.  By  connecting  the  reactant  and  product  half 
planes  onto  a  single  one,  one  can  uniquely  match  entrance  and  exit  channel  so¬ 
lutions.  Then,  the  two  channels  are  segmented  into  several  small  sectors  and  for 
each  sector  the  eigenvalues  ev  and  eigenfunctions  fv(r;  R,  0)  are  calculated.  The 
asymptotic  solution  is  constructed  using  an  initial  value  propagation  method.  The 
method  propagates  sector  by  sector  the  R  matrix.  From  its  asymptotic  value,  the 
detailed  fixed  0  scattering  reactive  matrix  elements  jS„/,W'(0;  E)  are  derived  by  im¬ 
posing  the  appropriate  boundary  conditions. 


The  Structure  of  the  Computer  Code 

The  schematic  structure  of  the  Rios  code  is  illustrated  in  the  scheme  of  Figure 
2.  As  illustrated  in  the  figure,  the  program  consists  of  three  sections. 

In  the  first  section  input  data  are  read-in  and  variables  of  general  interest  are 
evaluated. 

Scattering  calculations  are  performed  in  the  second  section  II  in  which  a  DO  loop 
(the  outermost  loop)  runs  over  the  collision  angle  values  (as  will  be  discussed 
below,  this  DO  loop  was  at  first  not  activated  in  the  parallel  versions  of  the  code 
and  each  job  was  performing  single  angle  calculations).  In  the  first  subsection, 
during  the  first  run,  all  the  energy-independent  quantities  necessary  for  the  inte¬ 
gration  of  Eqs.  (6)  are  calculated  and  stored  on  disk  for  use  in  next  energy  calcu¬ 
lations  (even  from  different  runs).  The  three  calculation  blocks  of  this  subsection, 
devoted  to  take  cuts  of  the  potential  energy  channel,  calculate  related  vibrational 
eigenvalues  and  eigenfunctions  and,  finally,  assemble  overlap  and  coupling  matrices, 
are  to  be  executed  in  a  strict  sequential  order.  In  the  second  subsection,  after  reading 
the  information  from  disk,  the  integration  of  fixed  angle,  fixed  energy,  and  fixed  / 
values  scattering  equations  is  performed  by  propagating  the  ratio  between  the  func¬ 
tion  and  its  derivative  (R  matrix  method).  The  sector-by-sector  propagation  is  a 
strictly  recursive  process.  At  the  end  of  the  propagation,  the  S  matrix  is  derived 
from  the  R  matrix  and  stored  on  disk  for  further  use  by  programs  devoted  to  the 
calculation  of  differential  and  integral  cross  sections. 

The  third  section  is  devoted  to  final  calculations  and  results  printout. 
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SECTION  I 

Input  data 

Calculate  quantities  of  common  use 


SECTION  II 

LOOP  on  collision  angles 

subsection  a 


IF  first  run  THEN 

Search  the  turning  center 
LOOP  on  sectors 

Calculate  the  sector  potential  cut 
Store  the  sector  potential  cut 
END  the  sector  loop 
LOOP  on  sectors 

Calculate  sector  eigenvalues  and  eigenfunctions 
Store  sector  eigenvalues  and  eigenfunctions 
END  the  sector  loop 
LOOP  on  sectors 

Calculate  the  intersector  overlap  matrix 
Store  the  intersector  overlap  matrix 
Calculate  the  sector  coupling  matrix 
Store  the  sector  coupling  matrix 
END  the  sector  loop 
ENDIF 


subsection  b 


LOOP  on  energies 

Read  the  intersector  overlap  matrix 
Read  the  sector  coupling  matrix 

Embed  the  energy  dependence  into  the  coupling  matrix 
LOOP  on  l  quantum  number 

Integrate  fixed  angle,  fixed  l  scattering  equations 
Storage  of  detailed  S  matrix  elements  on  disk 
Check  for  convergency  on  l 
END  the  l  loop 

Calculate  the  fixed  angle  contribution  to  the  cross  section 
Printout  the  fixed  angle  contribution  to  the  cross  section 
END  the  energy  loop 
Integration  over  the  collision  angle 
END  the  angle  loop 


SECTION  III 

Perform  final  calculations 
Printout  the  reactive  cross  section 

Figure  2.  Scheme  of  the  Rios  code. 

The  Parallel  Restructuring  of  the  Rios  Code 

A  first  parallel  version  of  the  Rios  code  was  implemented  on  a  multiprocessor 
IBM  3090  [  8  ] .  As  already  mentioned,  to  this  end,  use  was  made  of  a  single  angle 
version  of  the  program  and  the  single  energy,  single  /  propagation  (the  more  cpu 
demanding  part  of  the  second  section)  was  distributed  among  concurrent  processors 
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for  parallel  execution.  The  choice  of  exploiting  the  parallelism  at  such  an  inner 
level  was  motivated  by  the  queueing  policy  usually  adopted  by  computer  centers 
for  time-sharing  computers.  The  need  for  handling  the  large  files  containing  fixed 
angle  quantities  necessary  to  carry  out  the  propagation  as  well  as  the  heavy  cpu 
time  request  make  the  RIOS  jobs  be  assigned  low  priorities  when  running  on  cen¬ 
tralized  computers.  Therefore,  by  keeping  memory  occupation  at  the  lowest  (single 
angle)  level  possible,  we  had  the  possibility  of  exploiting  the  parallelism  on  /.  Such 
a  choice  is  able  to  ensure  a  satisfactory  load  balancing  even  at  moderately  high 
energies. 

In  practice,  the  restructuring  of  the  Fortran  code  consisted  in  incorporating  into 
a  single  routine  the  whole  body  of  the  /  do  loop  (the  propagation)  with  the  exception 
of  the  convergency  check.  Such  a  restructuring  led  to  fairly  high  speedups  [  8  ] . 

A  limitation  of  the  implementation  of  the  code  on  a  shared  memory  machine 
was  that,  due  to  the  little  number  of  processors  available,  the  batch  of  (single  angle) 
energies  dealt  by  individual  jobs  had  to  be  kept  small  to  the  end  of  keeping  turn¬ 
around  times  in  the  limit  of  a  few  hours. 

A  parallel  distributed  machine  that,  while  offering  a  sufficient  node  computing 
speed  and  memory  size,  gave  us  the  possibility  of  distributing  the  calculation  over 
a  number  of  processors  larger  than  that  of  shared  memory  architectures  was  the 
Meiko  Computing  Surface  [9],  This  machine  is  made  of  a  set  of  modules  each 
containing  several  boards.  Each  board  is  based  on  transputers  used  as  processors 
dedicated  to  communication  and  scalar  calculations.  Vector  boards  integrate  the 
transputers  with  Intel  i860  chips.  Specialized  boards  for  parallel  I/O,  display  ele¬ 
ments,  etc.  can  also  be  added.  The  Computing  Surface  has  an  adjustable  node 
interconnection  topology.  An  additional  feature  of  this  machine  is  the  System  Su¬ 
pervisor  bus.  This  provides  a  separate  global  way  of  communication  allowing  a 
direct  control  over  individual  processing  elements.  The  Computing  Surface  we  used 
was  operated  by  a  sun  Sparcstation  using  CStools  [10]  as  a  parallelization  tool. 
CStools  (Communication  Sequential  Tools)  supports  the  programming  of  multi¬ 
processor  applications  by  structuring  the  parallel  application  as  a  set  of  sequential 
programs  exchanging  data  and  synchronizing  the  operations  by  means  of  message 
passing  routines.  To  this  end,  CStools  provides  the  user  with  the  following  types  of 
functionalities:  (a)  communication  services  (CSN)  which  take  care  of  interprocess 
communications  (retransmissions,  multiplexing,  buffering,  etc.)  transparently  with 
respect  to  the  actual  physical  connections  of  the  system  which  is  seen  as  fully  con¬ 
nected  and  homogeneous;  (b)  system  services  (rte)  which  provide  standard  local 
and  remote  operating  system  support  to  the  nodes  and  processes;  (c)  configuration 
tools  which  make  it  easier  to  describe  the  layout  of  the  parallel  application  and 
how  the  application  will  be  mapped  on  the  hardware;  (d)  runtime  development 
tools  which  assist  the  user  in  the  implementation  of  parallel  applications. 

The  machine  used  for  implementing  the  parallel  version  of  RIOS  was  made  of 
INTEL  i860  nodes  having  a  peak  speed  of  80  Mflops  and  a  8  Mbyte  memory.  On 
this  machine,  we  have  been  able  to  run  RIOS  using  400  sectors  and  a  vibrational 
basis  set  of  dimension  15  (a  dimension  more  than  sufficient  in  the  investigated 
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energy  range).  Following  a  processor  farm  model,  the  main  process  was  copied 
into  a  computing  element  (node  zero)  acting  as  a  farmer  while  the  propagation 
section  was  copied  inside  all  remaining  computing  elements  acting  as  worker  nodes. 
After  feeding  at  the  beginning  all  the  worker  processors  with  one  /  value  of  the  first 
energy,  the  farmer  waits  for  the  first  result  to  return.  As  soon  as  a  worker  sends 
back  the  result  of  its  calculation,  the  farmer  checks  for  convergency  of  the  cross 
section  with  /.  In  case  of  convergency,  it  stops  the  workers  and  starts  a  new  energy 
(if  there  is  any  left).  If  convergency  has  not  yet  been  reached,  the  farmer  supplies 
the  worker  with  a  new  /  value  to  be  processed.  Speedups  measured  using  the  parallel 
version  of  the  code  amount  to  1.86,  3.52,  6.02,  and  12.5  when  using  2,  4,  7,  and 
16  worker  processors.  Such  a  high  performance  allowed  us  to  carry  out  extensive 
calculations  of  the  RIOS  cross  sections  of  the  Li  +  FH  reaction  using  a  fine  energy 
grid  (0.001  eV)  to  compare  with  time-dependent  treatments  [11]. 

Towards  Three-Dimensional  Approaches 

Presently,  we  are  working  at  building  parallel  computational  procedures  able 
to  cope  with  the  problem  of  determining  three-dimensional  ( 3D)  quantum  cross 
sections  for  atom-diatom  reactions.  To  this  end,  we  are  proceeding  along  two 
different  lines.  The  main  line  is  to  extend  to  higher  dimensionality  the  approach 
followed  here,  i.e.,  single  out  a  reaction  coordinate  on  which  carry  out  an  initial 
value  integration  of  the  coupled  differential  equations  obtained  after  averaging 
over  the  remaining  coordinates.  Details  of  this  type  of  parallel  approaches  are 
given  below.  The  second  line  consists  of  exploiting  radically  different  parallel 
approaches. 

As  far  as  the  exploitation  of  potential  parallelism  of  alternative  approaches  is 
concerned,  in  collaboration  with  G.  G.  Balint-Kurti  two  of  us  compared  the 
performance  of  parallel  implementations  of  time-independent  and  time-depen¬ 
dent  reduced  dimensionality  numerical  procedures  [11].  Presently,  reduced  di¬ 
mensionality  time-independent  approaches,  such  as  that  on  which  the  Rios  code 
is  based,  seem  to  be  better  suited  for  parallel  implementation  than  time-depen- 
dent  ones.  To  find  an  alternative  theoretical  approach  more  naturally  leading 
to  strongly  decoupled  computational  procedures,  we  are  following  the  guidelines 
given  by  Fox  and  collaborators  in  Ref.  [  1 2  ]  for  solving  wave-function  problems. 
To  this  end,  we  have  reformulated  the  Hamiltonian  in  a  way  that  confines  the 
problem  into  a  finite  space  and  provides  a  more  natural  scheme  for  using  regular 
grids  '[13]. 

Progress  along  the  main  stream  of  our  research  lines  seems  to  be  more  re¬ 
warding  in  the  short  term.  To  this  end,  we  have  started  to  design  the  restructuring 
of  a  three-dimensional  quantum  reactive  scattering  program  based  on  the  Adi- 
abatically  adjusting  Principal  axes  of  inertia  Hyperspherical  (aph)  coordinates. 
This  approach  [14]  takes  as  a  reaction  coordinate  the  hyperradius  p  defined  as 
p2  =  R2  +  r2.  Remaining  aph  coordinates  are  the  hyperangles  6  and  X  defined 
as 
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[(R2  -  r2)2  +  4 R2  •  r 2  cos2  Q] 1/2 
2  R  •  r  sin  0 


2R  •  r  cos  0 


sin(2x)  _  r2^2  _|_  4^2  .  y.2  cos2  0j  1/2 


cos(2x) 


i?2  -  r 


[(i?2  -  r2)2  +  4 R2-r2  cos2  0] 1/2 


In  the  aph  approach  the  scattering  matrix  elements  are  obtained  by  integrating, 
for  all  values  of  the  total  angular  momentum  /leading  to  non-negligible  reactivity, 
the  following  set  of  coupled  differential  equations  in  p. 

f-2  +  (/>)  -  %  2  | *ftj5jw->*fr(p)  ( io) 

op  n  J  fi  t,A, 

Equations  ( 10)  are  the  3D  hyperspherical  analogous  of  the  Rios  Eqs.  (6).  In  them 
Hi  =  T„  +  Tr+  Tc+  V„+  V(p,  6,  X)  (11) 


ft2  /  4  a  .  a  1  a2 
2^p2  \sin  2@  aa sin  aa  sin2  a  ax : 

=  a(p,  e)Jl  +  5(p,  a)7j  +  c(P,  e)Jl 

_  ih  cos  6  _d 
c  pp2  sin  26  y  dx 


with  A(p,  6 ),  £(p,  0),  and  C(p,  0)  being  defined  as 


(16) 

(17) 

(18) 


The  integration  of  Eqs.  ( 10)  is  carried  out  from  near  zero  to  a  large  value  of  the 
hyperradius.  At  large  p  values,  the  equations  are  usually  reformulated  in  Jacobi 
coordinates  (at  large  distance,  Jacobi  coordinates  separate  the  different  reaction 
channels).  As  for  the  RIOS  code,  the  integration  along  the  reaction  coordinate  is 
performed  by  segmenting  the  p  interval  into  several  small  sectors.  What  makes  the 
aph  approach  really  differ  from  the  RIOS  one  is  the  fact  that,  within  each  sector  i, 
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the  fixed  total  angular  momentum  /and  related  projection  M global  wave  function 

gtJMpn  -g  expancje<i  as 

=  2  P“5/V/r(p)*/«e,  x;  p,)D{PM(a,  ft.7)  ( 19) 

tA 

where  DJam(<x,  ft  7)  is  the  normalized  Wigner  function  in  the  three  Euler  angles, 
1//  is  a  function  of  the  reaction  coordinate  p  while  $  is  a  function  of  the  two  hyper¬ 
angles  chosen  to  satisfy  at  a  fixed  value  of  the  hyperradius,  p, ,  the  two-dimensional 
differential  equation 

[Th  +  Gh2J(J+  1)+  VPi  +  (C~G)h2  A2 

+  V(6,  X;  p,)  +  «;£(p, )]*#(«,  X;  p,)  =  0  (20) 

with  (7  =  (^  +  2?)/2.  The  higher  dimensionality  of  Eq.  (20)  with  respect  to  Eq. 
(8),  makes  the  memory  demand  of  the  3D  method  approach  much  larger  than 
that  of  RIOS.  As  a  result,  to  go  beyond  the  memory  limits  of  the  previously  used 
machines,  we  decided  to  distribute  the  aph  3D  calculations  on  a  cluster  of  work¬ 
stations. 

To  gain  experience  on  how  to  distribute  quantum  reactive  scattering  numerical 
procedures  on  clustered  workstations,  we  started  by  implementing  the  RIOS  program 
on  this  type  of  architectures.  In  fact,  the  structure  of  the  Rios  computational  pro¬ 
cedure  is  similar  to  that  of  aph  when  an  analogy  between  the  DO  loop  over  the 
collision  angle  of  the  former  and  the  DO  loop  over  the  total  angular  momentum  of 
the  latter  is  established. 

Software  Tools  for  Parallel  Distributed  Computing 

To  pursue  our  goals,  we  used  two  different  parallelization  software  tools  aimed 
at  making  easier  the  restructuring  and  the  implementation  of  computer  codes  on 
distributed  systems.  Some  of  these  tools  are  of  public  domain  and,  therefore,  they 
can  be  easily  obtained  in  the  proper  version  for  a  variety  of  machines.  Other  tools 
are  commercial  products  and  therefore  need  to  be  bought.  The  tools  we  used  are: 
PVM  [15]  (Parallel  Virtual  Machine)  version  2.4.1  and  Network  Linda  2.5  [16]. 

pvm  is  a  product  developed  jointly  at  the  Oak  Ridge  National  Laboratory  and 
at  the  Emory  University.  It  is  available  as  a  source  via  an  anonymous  ftp.  pvm  is 
based  upon  a  message  passing  philosophy  and  is  made  of  a  library  of  primitives. 
To  run  pvm  a  daemon  must  be  made  active  on  all  the  clustered  machines.  The 
daemon  is  the  execution  time  support  of  pvm  .  The  primitives  are  user  interfaces 
that  need  to  be  inserted  into  the  Fortran  code  to  make  explicit  the  parallelism 
according  to  the  message  passing  model.  A  pvm  parallel  application  is  based  on 
executable  versions  of  cooperating  programs  (components)  loaded  on  one  or  more 
machines  considered  as  parts  of  a  large  virtual  machine.  The  master  daemon  (the 
first  started  daemon)  generates  local  daemons  on  the  machines  of  the  cluster  as 
specified  by  the  configuration  file.  Local  daemons  take  care  of  creating  and  managing 
pvm  processes  running  on  the  same  machine  as  well  as  of  communicating  with  the 
other  daemons. 
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With  the  pvm  release  we  used  (more  recently  pvm  has  been  modified  to  cope 
with  the  problem)  the  application  has  to  be  written  in  a  way  that  takes  care  of 
possible  faults.  That  pvm  version,  in  fact,  while  providing  functions  for  detecting 
malfunctioning  of  the  individual  components  of  the  virtual  machine  is  unable  to 
restart  on  a  different  computer  of  the  cluster  an  abnormally  ended  process  on  a 
different  computer  of  the  cluster.  As  far  as  the  communication  between  the  different 
processes  is  concerned,  PVM  adopts  the  one-to-one  (point-to-point)  and  the  one- 
to-many  (broadcast)  communication  procedures  without  limitations  for  the  size 
and  the  number  of  exchanged  messages.  When  sending  a  message,  the  control  is 
returned  to  the  calling  process  as  soon  as  the  buffer  can  memorize  the  next  message 
regardless  of  the  state  of  the  receiver.  When  receiving  a  message,  if  the  message  is 
nonblocking,  the  control  immediately  returns  to  the  calling  process  with  specifi¬ 
cations  about  whether  or  not  the  message  has  been  received.  On  the  contrary,  for 
blocking  messages,  the  control  is  returned  to  the  calling  process  only  after  completion 
of  the  process. 

Linda  has  been  originally  designed  at  the  Yale  University  and  is  presently  com¬ 
mercialized  by  a  private  company  [17].  Contrary  to  pvm,  Linda  has  a  shared  mem¬ 
ory  approach  to  the  parallelism  (even  when  dealing  with  distributed  memory  ar¬ 
chitectures  such  as  clustered  workstations).  Linda  is  based  on  four  simple  operations 
which  need  to  be  inserted  in  the  high  level  language  source  code  to  create  and 
execute  parallel  processes.  Linda  acts  through  a  common  space  (mple  space)  con¬ 
sisting  of  a  series  of  objects  (mples)  which  are  handled  by  a  pattern  recognition 
procedure  and  accessed  through  a  template.  A  template  and  a  mple  match  only 
when  having:  (a)  the  same  number  of  fields  (value  and  formal  fields);  (b)  the  same 
type  of  homologous  formal  fields;  (c)  the  same  content  for  the  homologous  value 
fields.  A  mple  may  be  passive  (data)  or  active  (process).  When  a  process  comes  to 
an  end,  it  becomes  a  passive  mple  containing  in  one  field  the  results  generated  by 
the  process.  There  are  two  ways  of  generating  mples:  one  in  which  the  mple  passed 
as  a  parameter  is  evaluated  inside  the  calling  process  and  added  to  the  mple  space, 
the  other  in  which  the  evaluation  is  carried  out  by  a  new  ad  hoc  generated  process 
while  the  calling  process  goes  on  executing  without  waiting  for  the  evaluation  of 
the  mple.  There  are  also  two  operations  to  read-and-eliminate  mples  (each  with  its 
nonblocking  variant).  One  of  these  verifies  the  correspondence  between  the  template 
and  the  mple.  If  the  mple  satisfies  such  a  correspondence  it  is  eliminated  ( a  random 
one  if  there  are  several).  The  other  read-and-eliminate  mple  verifies  the  corre¬ 
spondence  without  elimination.  The  process  is  stopped  if  no  mple  satisfies  the 
correspondence. 

The  Code  Restructuring  for  Distributed  Computing 

Critical  features  of  loosely  coupled  architectures  (such  as  clustered  workstations) 
are  the  high  latency  and  the  small  bandwidth  of  the  communication  network.  For 
this  reason,  only  computational  tasks  needing  little  (or  no)  communication  can  be 
efficiently  distributed  on  a  cluster  of  workstations.  Therefore,  whenever  possible, 
several  computational  tasks  should  be  collected  together  in  the  same  distributed 
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process  and  related  communications  grouped  together.  On  the  contrary,  a  great 
advantage  of  using  distributed  architectures  is,  as  already  mentioned,  the  possibility 
of  having  shorter  turnaround  times  than  on  centralized  time  sharing  supercomputers 
thanks  to  the  power  of  the  processor  and  to  the  more  friendly-to-use  environment. 
For  the  RIOS  code  this  has  led  to  the  choice  of  including  the  outermost  DO  loop  in 
single  job  calculations. 

A  first  restructuring  was  inspired  to  the  distribution  scheme  adopted  on  shared 
memory  machines,  i.e.,  by  adopting  a  distribution  at  an  inner  level.  In  this  case, 
however,  since  memory  occupation  is  not  a  great  problem,  the  distribution  was 
moved  from  the  /  to  the  energy  loop.  This  means  that  one  or  more  single  energy 
entire  DO  loop  over  /  is  assigned  for  execution  to  each  individual  worker .  Obviously, 
such  a  strategy  is  successful  only  when  the  number  of  energies  to  be  calculated  is 
large  enough  (with  respect  to  the  number  of  used  processors)  to  allow  a  true  dy¬ 
namical  assignment  of  the  work  by  the  farmer  and  that  enough  /  values  are  needed 
at  each  energy  before  reaching  convergence.  Only  in  this  way  can,  in  fact,  a  good 
load  balance  be  achieved.  Such  a  situation  was  rather  infrequent  in  the  past.  In 
fact,  as  already  mentioned,  in  calculations  on  centralized  supercomputers  we  could 
deal,  in  general,  only  with  a  limited  number  of  energies  per  job.  The  enhanced 
processor  speed  and  the  possibility  of  distributing  the  calculations  among  different 
machines  without  being  delayed  by  a  penalizing  queue  policy,  have  made  it  feasible 
to  extend  the  calculations  to  many  more  energies  than  before  while  leading  to  a 
lower  turnaround  time.  This  has  made  it  possible  to  investigate  in  detail  the  energy 
dependence  of  several  reactions  of  interest  for  practical  applications. 

When  implementing  the  pvm  version  of  the  code  based  on  this  restructuring 
strategy,  we  kept  the  same  processor  farm  organization  adopted  for  the  Meiko 
Computing  Surface.  Accordingly,  the  RIOS  code  was  divided  into  a  farmer  program 
MSIOS  running  on  the  master  machine  and  a  worker  program  (SLIOS)  running  on 
slave  machines.  The  farmer  program  takes  care  of  reading  input  data  and  distributing 
the  work  among  the  workers  which  carry  out  one  or  more  cycles  of  the  energy  do 
loop.  Using  pvm,  data  to  be  distributed  to  the  worker  processes  can  be  properly 
packed  and  handled  as  a  single  message.  In  PVM  language  this  means  that,  after 
the  processes  have  been  enrolled  as  pvm  farmer  or  worker  processes,  an  fput  in¬ 
struction  is  issued  by  the  MSIOS  pvm  master.  This  converts  the  data  to  be  sent  to 
the  worker  SLIOS  pvm  process  into  the  proper  xdr  transfer  format.  Then  MSIOS 
executes  also  the  fsend instruction  to  perform  the  transfer  to  SLIOS.  On  the  worker 
side  the  corresponding  operations  are  frcv  to  collect  the  data  sent  by  the  farmer 
process  and  fget  to  back-convert  them  from  the  xdr  format.  Similarly,  when  closing 
the  DO  loop  after  reaching  convergence  on  /,  the  worker  transfers  the  results  to  the 
farmer  process  using  an  fput  and  an  fsend  instruction  which,  respectively,  convert 
and  return  the  results.  These  operations  are  matched  by  an  frcv  and  an  fget  instruc¬ 
tion  of  the  farmer  to  back-transform  and  collect  the  results.  Since  a  sending 
does  not  inhibit  the  continuation  of  the  process  execution,  when  worker  processes 
deal  with  more  than  one  energy,  single  energy  results  can  be  returned  while  next 
energy  is  being  processed.  When  the  calculations  concerning  the  batch  of  the  assigned 
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energies  is  completed,  the  pvm  processes  enrolled  by  farmer  and  worker  processors 
are  released  by  executing  a  j leave  instruction. 

The  same  processor  farm  structure  was  retained  also  when  using  Linda.  The 
main  difference  between  pvm  and  Linda  is  that  the  set  of  instructions  used  for  data 
transfer  in  Linda  are  presented  as  shared  memory  operations.  As  typical  of  shared 
memory  approaches,  the  code  was  organized  as  a  main  (msiosl)  calling,  among 
others,  the  subroutine  SLIOSL  embodying  the  process  to  be  distributed.  In  analogy 
with  the  pvm  version,  SLIOSL  contains  DO  cycles  of  the  iteration  over  energy.  The 
main  program  MSIOSL  reads-in  the  data  from  the  input  file  and  activates  the  primitive 
worker  processes  through  eval  Then  the  execution  of  SLIOS  and  related  data  are 
dealt  as  mples  of  a  common  memory  space.  The  content  of  the  mple  space  is  shared 
between  the  farmer  and  worker  processes.  Communications  are  implemented  by 
creating,  consuming,  and  reading  mples.  More  in  detail,  the  farmer  writes  mples 
containing  data  for  the  calculations  in  the  related  space.  These  are  read-in  without 
being  consumed  by  the  worker  processes.  The  mples  related  to  fixed  energy  cal¬ 
culations  completed  by  the  workers  are  consumed  by  th z  farmer  at  the  returning 
of  the  results.  When  a  worker  reads  a  mple  containing  the  information  about  the 
calculations  to  be  carried  out,  it  consumes  that  mple.  If  more  calculations  need  to 
be  performed  it  then  creates  a  new  mple  containing  updated  information.  When  a 
worker  finds  that  there  is  no  more  work  to  be  carried  out  (i.e.,  when  all  energies 
are  completed  for  all  angles)  it  does  not  create  a  new  mple.  As  a  result,  the  master 
finds  that  the  work  to  be  done  has  been  completed  and  writes  a  mple  containing 
the  message  “ended-job”  for  all  the  workers.  The  flexibility  of  both  hardware  and 
software  working  environments  offered  us  the  possibility  of  experimenting  different 
parallel  implementations  of  the  code.  Though  the  work  is  still  in  a  preliminary 
stage,  speedups  of  3.6  and  4.7  were  reached  when  using  pvm  and  Linda,  respectively. 


Conclusions 

In  an  attempt  to  produce  fast  and  accurate  means  of  calculating  reactive  properties 
of  simple  molecular  systems,  we  ended  up  by  facing  a  basic  (and,  under  some 
aspects,  also  old)  question:  how  does  the  evolution  of  computer  technology  impact 
the  way  chemists  design  new  theoretical  approaches  and  build  related  computational 
procedures?  Under  this  respect,  in  fact,  the  present  evolution  of  the  computer  tech¬ 
nology  is  a  real  challenge  for  theoretical  chemists.  In  about  ten  years  we  have  seen 
the  birth  and  the  death  of  single  highly  powered  processor  supercomputers  while 
concurrent  processor  architectures  have  grown  so  fast  to  make  available  peak  speeds 
of  giga  and  teraflops.  Using  resources  available  at  our  institutions  we  followed  an 
itinerary  that  provides  a  tentative  partial  answer  to  the  mentioned  question  by 
considering  as  a  case  study  theoretical  approaches  to  reactive  scattering. 

We  started  by  implementing  the  quasiclassical  procedure  for  calculating  reactive 
cross  sections  of  atom-diatom  systems  since  this  is  the  most  parallel-oriented  com¬ 
putational  approach  to  reactive  scattering.  For  this  approach  the  use  of  a  small 
memory  parallel  multicomputer  (an  ncUBE  2)  was  found  to  be  appropriate  since 
the  application  was  cpu-intensive.  It  was  not  difficult,  however,  to  realize  that  even 
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“embarrassingly  parallel”  problems  may  lead  to  poor  speedups.  We  learned  that 
to  make  full  profit  from  a  highly  parallel  machine  the  distribution  of  the  work 
among  concurrent  nodes  has  to  be  carefully  designed  to  avoid  conflicts  and  bottle¬ 
necks  associated  both  with  the  sequential  parts  of  the  code  and  the  way  of  distributing 
the  workload  between  the  farmer  and  the  workers. 

Then  we  started  the  implementation  of  the  more  accurate  quantum  codes.  This 
prompted  us  another  question  on  how  far  the  technology  may  go  without  facing 
the  problem  of  confronting  itself  with  user  needs.  In  fact,  while  working  on  finding 
alternatives  to  present  theoretical  approaches  (which  were  developed  using  single 
processor  technology,  i.e.,  sequential  architectures),  to  carry  out  present  and  near- 
future  dynamical  investigations  we  had  to  implement  parallel  versions  of  some  of 
the  existing  quantum  numerical  procedures.  In  general,  quantum  techniques  are 
based  on  numerical  approaches  making  use  of  large  matrices  whose  operations  can 
be  parallelized  at  the  price  of  a  significant  loss  of  efficiency.  To  minimize  the  loss 
of  efficiency,  the  node  memory  has  to  be  large  enough  to  accommodate  all  the 
matrices  needed  by  the  parallelized  section  of  the  code  while  the  considered  section 
has  to  be  self-containing  to  the  extent  of  requiring  little  data  transfer.  For  this 
reason,  we  moved  to  a  computer  architecture  having  a  more  modular  (less  inte¬ 
grated)  structure  but  larger  node  memories  (after  all,  other  conceptually  appealing 
features  of  innovative  architectures  had  to  be  dropped  in  the  past  when  moving 
from  prototypes  to  commercial  machines). 

By  switching  to  parallel  machines  with  bigger  memories,  we  have  been  able  to 
efficiently  run  parallel  versions  of  the  RIOS  program  without  sacrificing  the  size  of 
the  expansion  basis  set  and  the  number  of  sectors  to  be  used  for  the  propagation. 
This  was  still  insufficient  for  dealing  with  the  accurate  3D  quantum  approach  which 
needs  much  larger  memories.  To  this  end  we  have  considered  the  use  of  a  cluster 
of  workstations.  For  this  large  (virtual)  machine,  memory  is  less  a  problem  since 
each  computing  element  is  a  complete  computer.  This  makes  it  possible  to  handle 
the  very  large  matrices  usually  associated  with  3D  quantum  methods.  Obviously, 
other  problems  arise  when  using  clustered  workstations  (faults  handling,  software 
and  hardware  alignment,  the  slowness  of  the  network,  etc.).  This  makes  clustered 
workstations  an  unsatisfactory  solution  for  handling  reactive  scattering  calculations. 
Fortunately,  commercially  available  machines  are  evolving  towards  massively  par¬ 
allel  architectures  having  local  memories  of  several  hundreds  or  thousands  mega¬ 
bytes.  Then,  the  problem  becomes  the  choice  of  the  right  mix  of  memory  size, 
network  bandwidth,  and  cpu  speed  which  makes  a  parallel  architecture  suitable  for 
a  particular  reactive  scattering  calculation. 
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Abstract 

A  coupled  channel  theory  of  resonances  has  been  formulated  within  the  propagator  approach  of  many- 
body  theory  and  applied  to  the  ls3s2  resonance  of  ^-helium  scattering.  This  system  has  previously  been 
studied  both  experimentally  and  theoretically.  Our  results  for  the  width  of  the  resonance  agree  well  with 
these  earlier  findings.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

The  interest  in  propagator  approaches  to  the  calculation  of  atomic  and  molecular 
properties  has  increased  in  recent  years  due  to  highly  sophisticated  approaches  for 
the  computation  of  the  fundamental  operators  of  the  theory.  Among  them,  the 
construction  of  the  self-energy  operator  through  fourth  order  of  many-body  per¬ 
turbation  theory  [1]  and  of  superoperators  partially  through  fourth  order  [2,3], 
and  the  numerical  implementation  of  self-energy  operators  complete  in  third  order 
[  4  ]  deserve  particular  mention.  Another  strong  motivation  for  renewed  interest  in 
propagator  approaches  may  be  attributed  to  the  ease  using  the  propagator  concept 
of  developing  physical  intuition  when  the  theory  is  represented  in  terms  of  Feynman 
diagrams.  While  most  existing  propagator  calculations  evaluate  bound-state  prop¬ 
erties  of  atomic  systems,  the  application  of  the  Green  function  concept  to  scattering 
problems — after  isolated  early  work  [5,6]  and  noteworthy  progress  in  electron- 
molecule  scattering  [7,8] — is  relatively  unexplored.  The  present  work  is  the  exten¬ 
sion  of  a  theoretical  approach  formulated  in  an  earlier  publication  [9]  and  con¬ 
tains  the  first  numerical  application  of  the  coupled  propagator  method  to  Feshbach 
resonances. 

Although  this  study  focusses  on  width  calculations  for  electron  scattering  reso¬ 
nances  when  more  than  one  decay  channel  is  open,  the  approach  evaluates  also 
resonance  energies  as  the  discussion  of  our  results  in  the  fourth  section  will  show. 
In  the  framework  of  Green  function  theory,  the  presence  of  several  decay  channels 
requires  the  introduction  of  nondiagonal  self-energy  operators  and  has  to  our 
knowledge  not  been  addressed  before  in  a  numerical  study. 

Until  recently  the  major  method  of  solving  for  poles  of  the  resolvent  operator 
(e  -  H)~ 1  which  correspond  to  resonances  was  the  Feshbach  method  [10,11]. 
Following  the  work  of  Balslev,  Aguilar,  and  Combes  [12,13],  however,  there  has 
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been  considerable  activity  in  using  the  alternative  method  of  dilatation  transfor¬ 
mations  [14-24].  At  the  same  time  it  has  been  shown  [25,26]  that  Siegert’s  reso¬ 
nance  theory  [27]  can  be  set  up  directly  as  a  well-behaved,  complex  matrix  eigen¬ 
value  problem.  In  the  framework  of  both  the  dilatation  transformation  approach 
and  the  direct  Siegert  method,  the  problem  of  solving  for  the  complex  energy  can 
be  reduced  to  a  problem  of  diagonalizing  matrices  the  structure  of  which  is  similar 
to  those  encountered  in  configuration  interaction  solutions  for  normal  bound-state 
eigenvalues.  The  complex  nature  of  the  approach,  however,  requires  computation¬ 
ally  more  demanding  variational  criteria  which  have  been  addressed  in  a  recent 
paper [28]. 


Review  of  the  Siegert  Method 

The  central  equation  of  the  propagator  approach  with  or  without  resonance 
boundary  conditions  is  Dyson’s  nonrelativistic  pseudo-eigenvalue  equation 

l$n(e)-£(e)]$n(r,e)  =  0  (1) 

with  Layzer’s  operator  [29] 

X(e)  =  -£V2+  F+2(*)  (2) 

which  contains  an  energy  independent  one-electron  part  V  including  the  Hartree- 
Fock  contribution  (for  Feshbach  resonances  preferably  in  a  VN~X  representation) 
and  an  integral  operator 

S(e)$(r,  e)  =  J  d3r'  2(r,  r';  e)$(r',  e)  (3) 

which  describes  the  effects  of  the  many-electron  background  in  principle  accurately. 
The  eigenvalues  <£„(«  =  £„),  i.e.,  taken  at  values  of  the  energy  input  parameter  e 
which  coincide  with  one  of  the  calculated  eigenvalues,  correspond  to  energy  dif¬ 
ferences  +  { S0(N)  -  &„(N±  1 ) }  and  may  be  identified  with  electron  affinities  and 
ionization  potentials,  respectively,  if  real  valued,  and  with  the  energies  of  electron 
scattering  and  Auger  resonances,  respectively,  if  complex  valued.  In  simple  cases 
with  not  more  than  one  decay  channel  open,  the  imaginary  parts  of  the  energy  are 
related  to  resonance  widths  T„  by  the  usual  relation 

3/n  &n  =  —  \  zT„  ( 4 ) 

This  equivalence  holds  also  above  the  first  excitation  threshold  of  the  remaining 
atom:  The  ls3s2  state  of  the  negative  helium  ion,  as  an  example,  can  decay  into 
continua  associated  with  the  Is2  and  the  ls2s  and  ls2p  states  of  the  helium  atom. 
Since  the  excited  states  of  the  target  occur  either  as  spin  singlets  or  triplets,  there 
are  five  open  continua  to  be  considered.  In  such  cases,  the  imaginary  part  of  the 
energy  is  then  related  to  the  total  width  of  the  compound  state. 

In  the  single-channel  direct  Siegert  approach  the  complex  eigenmatrix  (<£  -  £) 
to  be  diagonalized  is  set  up  in  a  basis  of  self-consistently  obtained  orbitals  in  the 
VN~l  representation  plus  one  Siegert  orbital  of  the  form 
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k(r)  =  /(r) r-1exp [/(/(>  +  ix/)]7/w(S2)  X  spin  function  (5) 

with  <  0.  Her e/(r)  is  a  cutoff  function,  e.g.,  [1  -  exp(-/3r)]/+I ,  required  for 
potentials  which  are  singular  at  the  origin.  During  the  iterative  calculation  the 
complex  exponent  K  of  the  Siegert  orbital  has  to  be  determined  such  that  = 
e  =  \K2  for  the  resonance  to  be  evaluated.  This  is  the  self-consistency  condition. 

The  form  of  the  matrix  to  be  diagonalized  in  the  Siegert  method  with  just  one 
channel  open  is  given  by 

(k\A(e)\k)  •••  <fc|JB(OI0> 

(a\S(c)\k)  <a|JB(OI0> 

with 

JB(e)  =  iK2  -  £(e) 


(6) 

(7) 


and 

(k\A(e)\k)  =  ~\{k\{d[\nf{r)\ldr}2\k)  -  (k\  Vf(c)\k >  .  (8) 

The  latter  matrix  element  derives  originally  also  from  the  operator  $(c)  given  in 
Eq.  ( 7 )  but  has  all  potentially  divergent  contributions  cancelled  out  from  the  start. 
Here  Vf  stands  for  the  finite  range  parts  of  all  the  potentials.  All  divergent  terms  in 
the  matrix  solution  of  Dyson’s  equation  cancel  exactly  once  self-consistency  has 
been  achieved.  The  explicit  cancellation  above  does  neither  affect  the  final  solution 
nor  the  iterative  process  to  obtain  the  final  solution. 


The  Case  of  Two  Open  Channels 

The  formalism  described  so  far  yields  the  complex  energy  of  a  two-particle-one- 
hole  ( 2p  1  h )  state  in  the  case  of  a  Feshbach  resonance  and  a  2hlp  state  in  the  case 
of  an  Auger  type  resonance.  In  both  cases,  it  has  been  assumed  that  the  metastable 
state  undergoes  autoionization  to  the  ground  state  as  the  only  possible  decay  mode 
and  that  radiative  decay  can  be  ignored.  Above  the  first  excitation  threshold  of  the 
target  system,  however,  there  are  other  decay  channels  available  and  we  face  the 
complexity  of  a  multichannel  problem.  As  we  focus  here  primarily  on  width  cal¬ 
culations,  we  use  an  approximate  version  of  the  self-energy  operator  introduced  by 
other  groups  [30],  which  is  of  sufficiently  simple  structure  to  present  the  method 
without  unwieldy  formalism.  To  compare  our  calculated  widths  to  previous  ex¬ 
perimental  and  theoretical  results,  we  apply  the  method  to  the  helium  atom,  although 
larger  atoms  are  feasible  and  even  better  candidates  for  applications  of  many-body 
theory. 

For  illustration  purposes  we  restrict  our  presentation  here  to  a  fictitious  case  of 
two  interacting  decay  channels.  This  is  adequate  for  the  present  example  if  we 
disregard  temporarily  the  existence  of  the  ls2p  excited  target  state  as  well  as  triplet 
configurations  of  the  ls2s  and  ls2p  of  the  target.  In  reality,  the  ls3s2  state  of  helium 
has  five  distinct  decay  channels  which  are  all  coupled  together.  For  the  excited 
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states  ls2s  and  ls2p  of  the  target,  we  have  included  both  singlet  and  triplet  sym¬ 
metries.  This  has  been  done,  however,  in  an  energetically  degenerate  approximation, 
so  that  the  problem  can  be  set  up  as  one  of  two  interacting  s-wave  channels  and 
one  p-wave  channel.  The  calculated  widths  are  little  affected  by  small  variances  in 
the  energy. 

With  only  two  open  decay  channels  (distinguished  by  superscripts  0  and  1 )  the 
coupled  equations  which  replace  Eq.  ( 1 )  take  on  the  following  form: 

(<?o(e)  -  MV2  +  F]}$o (r,  e)  -  J  2[0'01 (r,  r';c)$0(r’,e)  dh' 

=  Js  V,'Hr,r';e)$l(r',e)d}r/  (9) 

and 


{ <?o(e)  -  [-4  V2  +  V+  X] }  $,(r,  £)  -  J  2[l’  ‘V,  r';e)$dr',  <0  d3r’ 

=  J  ^''°\r,r'-c)i0(r',e)d3r'  (10) 

In  the  second  equation  we  have  added  and  subtracted  the  first  excitation  energy 
X  of  the  target  realizing  that  <£0(e)  =  \k\  +  X.  The  potential  V ,  which  is  part  of 
both  equations,  contains  the  Hartree-Fock  potential  corresponding  to  the  ground 
state  of  the  target .  While  this  is  appropriate  for  the  first  equation,  it  is  not  so  for 
the  second.  The  first  two  terms  of  2J[1, 1] — explicitly  given  in  Eq.  (12) — contain 
the  necessary  modification  for  an  excited  reference  state.  The  remaining  terms  of 
2[I’ 1]  describe  correlation  and  polarization  of  the  compound  system  in  the  inter¬ 
mediate  state.  In  the  following  it  is  assumed  that  the  excited  configuration  is  specified 
by  a  hole  v  in  the  valence  shell  with  the  corresponding  electron  promoted  to  the 
orbital  t. 

If  the  functions  <£  are  represented  as  a  linear  superposition  of  basis  orbitals,  the 
problem  can  be  solved  as  a  complex  matrix  eigenvalue  problem.  This  is  the  approach 
taken  here  in  solving  the  coupled  propagator  equations.  In  particular,  the  matrix 
elements  of  the  self-energy  operators  2 [<r’  a']  ( here  with  cr,  a  either  0  or  1 )  are  given 
below.  We  use  the  abbreviated  form  (  ah  |  |  cd)  for  the  antisymmetrized  two-electron 
matrix  elements  of  the  Coulomb  interaction. 


stnni  1  ^  < <*P\  \hh')(hh’\  Wp)  [  1  ^  <«P|  \PP')(PP'\  l«V>  (U) 


C  "f"  Cy  £p  £p'  Spp ' 


2[i.,ij  =  A[i]>+  j  <>gl  I <*’P>  Am  !  1  ^  (“/’l  \hh')(hh'\  | a'p) 

h+v,p±l  ch~  ep  2  p+tjljlt+v  (£  —  X)  +  ep  —  £/,  —  £),' 

+ 1  2  <a?|  \pp')(pp'\  Wt) 

2  P'p'PrJiPv  (e  —  X)  +  £i  ~  £p~  £p'  ~  Spp> 


(12) 
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and 


yto.  n_  y  h  oi  _  1  y  <> I  IP/*')<>'I  Wv) 

aot>  a'a  2.  (r  -  \  )  +  r  -  r  -  f  >  ° 

z  '  et  ep  €p 


P  P 


PP‘ 


+  2<«tM«7>)A|ii-  (13) 

p±t  eP  £v 


In  these  equations  the  operator  A [I]  expresses  the  difference  between  the  self-energy 
operator  for  the  excited  target  state  and  that  for  the  ground  state  as  a  reference 
configuration.  It  is  defined  as 

+/  occ 

A[1)  -  2  </|  |/>  -  2  </|  |/>  =  (t I  U)  -  <»|  |o>  (14) 

with  nonvanishing  matrix  elements  A  .  The  first  of  these  sums  runs  over  all  orbitals 
occupied  in  the  hf  ground  state  minus  the  vth  one  plus  the  initially  unoccupied 
orbital  t.  The  term  SPP'  indicates  a  shift  of  the  energy  denominators  corresponding 
to  repeated  interactions  of  the  two  excited  (active)  electrons.  In  the  VN~{  repre¬ 
sentation  the  particle-hole  ladders  have  been  included  in  zeroth  order,  and  we  are 
left  with  the  summation  of  just  the  particle-particle  ladders 

Spp'=i(pp'  |  | pp').  (15) 


The  factor  \  has  been  introduced  to  simulate  the  effect  of  nondiagonal  ladder  terms 
according  to  a  procedure  suggested  by  Born  and  Ohrn  [30].  The  exact  inclusion 
of  nondiagonal  ladder  sums  in  a  multichannel  approach  requires  substantially  more 
computational  effort.  It  turns  out  that  the  ladder  correction  affects  the  energy  sub¬ 
stantially  only  when  the  active  electrons  are  both  in  the  same  n  /-orbital. 


Calculation  and  Results 

The  first  step  in  our  calculation  is  a  Hartree-Fock-Roothaan  calculation  with 
respect  to  the  ground  state  of  the  helium  atom.  We  have  in  the  past  studied  various 
forms  of  analytic  basis  sets  which  in  the  present  problem  have  both  to  represent 
bound  electron  states  and  to  approximate  the  continuum  orbitals  within  a  reasonably 
extended  region  about  the  origin.  The  Slater  type  basis  used  in  the  present  study  is 
the  one  used  by  Doll  and  Reinhardt  [  3 1  ]  in  their  landmark  study  of  Green  function 
theory  applied  to  atoms.  This  basis  has  been  augmented  by  two  s  and  p  orbitals 
each  to  better  represent  the  outer  region. 

The  prominent  role  of  the  continuum  orbitals  in  resonance  calculations  warrants 
the  computation  of  initially  unoccupied  orbitals  in  the  VN~{  representation.  While 
in  general  we  have  to  distinguish  between  different  such  choices,  for  two  electron 
systems  the  only  possibility  corresponds  to  the  removal  of  one  Is  electron.  The 
present  approach  assumes  a  single  hole  in  the  valence  shell  also  for  heavier  atoms 
or  ions.  The  corresponding  Fock  operator  is  then  given  by 

S?  =  -  ±V2  -  Z/r+2Sv  -  (16) 

with  the  familiar  definition  of  the  direct  and  exchange  operators  d  and  3i. 
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The  effect  of  this  particular  VN~{  representation  is  to  calculate  the  virtual  electron 
orbitals  in  the  field  of  only  N  -  1  electrons  as  opposed  to  the  rather  unphysical 
mean  field  provided  by  the  N  electrons  of  Hartree-Fock  theory.  Although  the  re¬ 
sulting  orbital  energies  e„i  in  zeroth  order  are  not  expected  to  be  really  close  to 
experimental  values,  the  corresponding  orbitals  are  at  least  bound  and  require 
therefore  less  correction  from  higher-order  terms  of  many-body  perturbation  theory. 
With  a  basis  of  seven  s-,  five  p-,  and  two  d-orbitals,  we  obtain  three  bound  orbitals 
of  s-symmetry  and  one  each  of  p-  and  d-symmetry. 

With  this  real  basis  we  routinely  evaluate  an  approximate  real  value  of  the  res¬ 
onance  energy  by  evaluating  the  diagrams  given  in  Figure  1 .  The  analytic  contin¬ 
uation  to  the  complex  energy  plane  is  then  being  performed  by  adding  two  complex¬ 
valued  Siegert  orbitals  to  the  real  basis  of  Slater  type  orbitals.  As  in  our  previous 
work,  we  choose  the  form  of  the  complex  orbitals  as 

$Kj(r)  =  [1  -  exp ( — ft r ) ] /+ 1  exp ( iKj  -  iwl/2)  (17) 

where  the  subscripts  j  =  0,  1  now  distinguish  between  the  two  interacting  s-continua 
associated  with  the  decays  Is [( 3s)2  !S]  ls[(lsAT0s)  !S]  and  1  s [( 3s)2  lS]  -> 
ls[(2sA^is)1S],  respectively.  The  decay  ls[(3s)2  *S]  -►  ls[(2p£p)  lS]  contributes 
to  the  width  of  the  resonance  as  well.  In  fact,  it  is  by  far  the  biggest  contribution 
as  will  be  discussed  below. 

The  first  factor  on  the  right-hand  side  of  Eq.  ( 17 )  is  a  cutoff  function  to  regularize 
the  Siegert  orbital  at  the  origin.  Its  form  is  arbitrary  except  that  experience  has 
shown  that  the  chosen  form  is  simple  enough  to  allow  for  analytic  evaluation  of 
integrals  containing  the  Siegert  orbital  and  that  its  use  leads  to  very  reasonable 
results.  A  complete  basis  set  for  the  expansion  of  the  interior  region  would  not  be 
affected  by  the  choice  of  the  cutoff  factor.  In  connection  with  a  less  than  complete 
basis  set,  however,  one  expects  some  dependence  of  the  results  on  the  particular 
choice.  We  have  studied  this  dependence  by  varying  the  exponent  ft  and  found  that 
there  is  always  a  reasonably  wide  range  for  (3  within  which  the  calculated  width 
remains  almost  constant.  This  interval  ranges  from  /3  =  10  to  200.  The  results  for 
the  real  part  of  the  energy  are  even  less  affected  by  the  cutoff  function. 

For  the  transition  ls[(3s)2  *S]  -►  ls[(2pAp)  *S]  a  similar  stability  with  respect 
to  the  cut-off  exponent  has  been  found.  The  cutoff  procedure  seems  to  be  an  un¬ 
critical  one. 

The  structure  of  the  matrix  operator  to  be  diagonalized  when  two  interacting 
continua  are  present  is  given  as 

/£0(«)  -  [-5V2  4-  V]  —  St0’0]  -S[0’1] 

\  €0(e)  -  [~$V2  +  V- X,]-S£I>I] 

Here  each  entry  indicates  a  matrix  of  the  type  given  in  Eq.  (6). 

The  final  result  for  the  total  width  of  the  ls(  3s)2  resonance  has  been  determined 
to  be  T  =  25.33  meV.  This  compares  reasonably  well  to  the  experimental  values 
of  T  =  36  meV  of  Brunt  et  al.  [32]  and  T  =  32.5  meV  of  Andrick  [33],  as  well  as 
to  Nesbet’s  early  calculation  [34]  of  T  =  25.2  meV.  We  have  to  remember  that  the 
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Figure  1 .  The  diagrams  included  in  the  present  work  and  corresponding  to  terms  in  Eqs. 
( 1 1 )-( 13).  Diagrams  (a)  and  (b)  correspond  to  the  direct  and  exchange  contributions  of 
the  elastic  channel  in  VN~X  representation.  In  the  case  that  the  system  is  initially  in  the 
excited  state  the  sums  over  the  intermediate  states  are  restricted  as  explicitly  given  in  the 
formulae  of  Eqs.  ( 12)  and  ( 13).  The  boxes  symbolize  the  contribution  from  the  diagonal 
ladder  sum  [Eq.  (15)]  and  the  double-stroke  fermion  lines  indicate  renormalized  hole 
energies:  In  VN~l  representation  the  infinite  sums  of  particle-hole  interactions  are  taken 
into  account  in  zeroth  order  which  decouples  the  hole  lines  from  the  rest  of  the  diagrams. 
In  order  to  improve  the  energy  denominators,  it  is  justified  to  replace  the  Hartree-Fock 
orbital  energy  by  a  higher  order  approximation,  e.g.,  the  hole  energy  of  the  present  Green 
function  calculation.  Because  in  the  present  work  the  energy  is  of  small  concern,  we  do 
not  emphasize  this  particular  point  and  use  Hartree-Fock  values  for  all  occupied  orbital 
energies.  Diagram  (c)  is  the  direct  diagram  corresponding  to  an  inelastic  scattering  event. 
The  corresponding  exchange  diagram  has  the  same  topology  with  the  upper  labels  p  and 
p'  interchanged.  The  terms  ( e )  and  (g)  have  to  be  included  to  obtain  consistency  through 
second  order.  Diagrams  (d)-(h)  affect  the  change  in  reference  state  which  is  roughly 
equivalent  to  the  familiar  ASCF  method.  The  corresponding  potential  is  symbolized  by 
AI1]  and  is  defined  by  Eq.  ( 14). 


basis  employed  in  this  study  is  minimal.  But  given  the  small  size  of  the  basis  set 
and  the  approximations,  the  results  of  this  study  are  encouraging  because  the  nu¬ 
merical  results  exhibit  a  high  degree  of  computational  stability.  In  particular,  the 
long-standing  problem  of  finding  zero  eigenvalues  of  the  Dyson  equation  close  to 


110  ZHAN,  ZHANG,  AND  WINKLER 

the  zeroes  of  the  denominators  of  the  self-energy  operator  seems  to  have  disappeared 
since  we  use  the  Simplex  algorithm  AMOEBA  [  35  ]  instead  of  the  more  conventional 
Newton-Raphson  approach. 

Our  result  for  the  total  width  consists  of  the  combined  contributions  of  the  decays 
leading  to  the  interacting  s-continua  ris+2s  =  7.13  meV  and  with  the  contribution 
from  the  p-channel  included  rls+2s+2p  =  25.33  meV.  It  is  interesting  to  turn  off 
the  channel  interaction  between  the  s-channels  in  Eqs.  (9)  and  (10).  The  partial 
widths  of  the  individual  contributions  then  are  =  2.8  meV  for  the  transition  to 
ls[(  ls&os)  lS]  and  T2s  =  1.5  meV  for  the  transition  to  ls[(2s/cjs)  JS].  The  sum  of 
these  contributions  is  noticeably  different  from  the  value  for  T is+2s  given  above. 
This  result  demonstrates  the  increased  accuracy  of  the  solution  functions  $Kq  and 
$Kl  in  the  coupled  approach  as  opposed  to  the  solutions  in  the  uncoupled  case. 

The  calculated  real  part  of  the  energy  is  22.53  eV  which  compares  reasonably 
well  to  the  experimental  result  [32]  of  (22.45  ±  0.02)  eV. 

Discussion 

The  aim  of  the  direct  Siegert  method  is  to  calculate  a  decaying  state  as  if  it  were 
stationary.  In  order  to  achieve  this  goal,  an  analytic  continuation  onto  the  complex 
energy  plane  has  to  be  performed.  This  has  been  done  here  by  augmenting  a  localized 
basis  set  of  exponentially  decreasing  functions  with  functions  that  satisfy  the 
asymptotic  resonance  boundary  condition  [27] .  As  this  boundary  condition  depends 
on  the  complex  eigenenergy  of  the  decaying  state,  an  iterative  computation  is  re¬ 
quired.  This  iterative  procedure  is  an  integral  part  of  the  Green  function  method 
even  in  calculations  of  real-valued  ionization  potentials.  The  main  additional  com¬ 
plication  in  resonance  calculations  is  the  need  to  search  the  complex  plane  for  poles 
of  the  propagator.  In  practice,  this  is  being  done  by  varying  the  complex  wave 
number  K  of  the  Siegert  orbital  until  self-consistency  has  been  achieved. 
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Note  Added  in  Proof 

After  submission  of  the  manuscript  we  extended  the  present  calculation  to  fully 
include  the  coupling  of  intermediate  states  in  which  two  p-electrons  are  coupled 
to  zero  orbital  angular  momentum.  This  extension  yields  an  improved  width  of 
T  =  33.01  meV,  which  is  in  almost  perfect  agreement  with  the  experimental  data 
referred  to  in  the  manuscript. 
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Abstract 

Optical  nonlinearities  affecting  absorption  spectra,  above  threshold  dissociation,  bond  softening,  and 
vibrational  trapping  are  among  the  most  important  processes  that  have  so  far  been  examined  when 
describing  the  dynamics  of  multiphoton  excitation  and  fragmentation  of  molecular  systems  subject  to 
intense  laser  fields.  The  relative  merits  of  time-dependent  and  independent  approaches  are  overviewed 
and  discussed  within  the  framework  of  Hj  photodissociation  for  which  recent  experimental  data  are 
available.  Isotopes  Hj  —  behave  in  a  markedly  different  manner  with  respect  to  the  radiative  coupling. 
Some  of  these  differences  are  exploited  for  the  detailed  understanding  of  the  underlying  mechanism  of 
the  bond  softening  process.  It  is  shown  that  this  is  not  merely  a  single  photon  potential  barrier  lowering 
mechanism  as  has  previously  been  invoked,  but  a  competition  between  this  mechanism  and  another 
five  photon  exchange  mechanism,  namely  the  simultaneous  absorption  of  three  photons  followed  by  the 
subsequent  emission  of  two  photons.  As  for  the  vibrational  trapping  process,  the  relative  sensitivity  of 
the  isotopes  to  the  laser  characteristics  is  such  that,  by  adequately  adjusting  the  wavelength  and  the 
intensity,  one  can  suppress  the  fragmentation  of  one  of  the  isotopes  by  locally  confining  it,  whereas  the 
other  dissociates  very  fast.  This  is  used  as  a  possible  scheme  for  isotope  separation  using  intense  lasers. 
©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

The  technological  possibility  to  reach  large  radiation  field  strengths  and  recent 
advances  in  high-intensity  experiments  and  above-threshold  ionization  of  atoms 
[  1  ]  have  oriented  renewed  interest  in  the  strong  laser  induced  multiphoton  disso¬ 
ciation  dynamics  (mpd)  of  small  polyatomics  [2,3]-  Optical  nonlinearities  and 
important  distorsions  in  absorption  lineshapes  have  experimentally  been  observed, 
for  field  intensities  exceeding  10 10  W/cm2  [2].  At  higher  intensities  even  more 
intriguing  is  the  rise  of  equally  spaced  multiple  peaks  in  the  fragments  kinetic- 
energy  distribution  spectra,  corresponding  to  the  absorption  of  several  photons  [4] . 
Situations  where  more  than  one  photon  are  simultaneously  absorbed,  although  the 
energy  carried  by  one  of  them  may  be  enough  for  the  fragmentation  to  occur,  are 
depicted  and  termed  above-threshold  dissociation  (atd)  [3,5].  Following  the  ex¬ 
citation  process,  subsequent  stimulated  absorption  and  emission  mechanisms  may 
take  place  during  the  fragmentation  itself,  leading  to  product  distributions  carrying 
information  on  the  cumulative  history  of  the  overall  dynamics.  Photofragments 
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with  low  kinetic  energy  may  be  very  abundant  in  strong  fields  due  to  the  breaking 
of  their  velocity  by  a  kind  of  bremsstrahlung  effect  resulting  from  a  photon  emission 
[  3  ] .  The  interplay  between  the  multiphoton  excitation  step  and  radiative  processes 
taking  place  in  mpd  reveals  two  notable  intense-field  phenomena:  (i)  the  chemical 
bond  can  be  “softened”  (bond  softening  bs)  by  the  laser-induced  lowering  of  some 
potential  energy  barriers  leading  to  efficient  photodissociation  of  low-lying  vibra¬ 
tional  levels  [4,6];  (ii)  in  contrast  to  the  low-lying  states  which  become  less  and 
less  stable,  for  some  specific  frequencies  and  intensities  the  lifetimes  of  high-lying 
resonances  may  increase  and  consequently  the  molecular  bond  may  be  harder  to 
break.  The  laser  is  confining  the  molecule  in  a  finite  region  of  the  phase  space 
inducing  thus  a  stabilization  and  a  population  trapping  which  have  been  termed 
vibrational  trapping  vt  [7,8]  or  bond  hardening  [9b]. 

Molecular  hydrogen  ion  represents  the  model  system  for  these  studies  mainly 
because  of  its  simple  and  known  structure,  involving  within  the  Bom-Oppenheimer 
approximation,  two  electronic  states  well  separated  from  the  higher  excited  electronic 
manifold.  The  ground  and  dissociative  states  which  describe  the  molecular  disso¬ 
ciation  process: 

Hi  ( 1  sag,  v,J)  +  nhco-+  Hj  (2 pau)  H+  +  H  ( Is) 

correspond  to  ( Ua^)  and  (2pau)  respectively  [10]. 

Isotopes  (Hj ,  Dj )  show  quite  different  behaviors  with  respect  to  strong  radiative 
couplings,  leading  to  barrier  lowerings  or  trappings  which  monitor  the  aforemen¬ 
tioned  multiphoton  bond  softening  or  hardening  mechanisms.  They  have  experi¬ 
mentally  been  addressed  by  Yang  et  al.  [  1 1  ]  who  measured  different  atd  branching 
ratios  for  Hj  and  Df  over  a  wide  range  of  laser  parameters  (8  10nto4  1013W/ 
cm2  for  532  and  527  nm  wavelengths  and  50  ps  to  10  ns  pulses).  Their  results 
showing,  in  particular,  a  higher  two-versus  one-photon  dissociation  ratio  for  the 
heavier  isotope,  are  in  qualitative  agreement  with  the  bs  model. 

The  present  article  is  devoted  to  a  detailed  understanding  of  the  bond  softening 
and  vibrational  trapping  mechanisms  taking  place  during  the  dissociation  of  Hj 
and  its  isotope  Dj  when  subjected  to  intense  continuous-wave  cw  lasers  ( 10 8- 10 14 
W/cm2).  More  precisely,  concerning  bs  studied  at  the  resonant  three-photon  ab¬ 
sorption  at  329.7  nm  wavelength,  it  is  shown  how  the  different  behaviors  of  the 
isotopes  may  enlighten  the  discussion  concerning  the  competition  between  two 
possible  fragmentation  pathways,  i.e.,  a  single  photon  tunneling  process  through  a 
potential  barrier  lowering  mechanism,  or  a  five-photon  process  resulting  from  an 
excitation  by  three-photon  absorption  followed  by  two  subsequent  photon  emissions 
[  12-14] .  As  for  the  vt  mechanism,  an  application  aiming  to  isotope  separation  in 
the  Hj  -  Dj  system  is  proposed.  This  is  actually  based  on  laser-induced  quasi¬ 
bound  states  that  can  suppress  the  photodissociation  of  one  of  the  isotopes  whereas 
the  other  has  still  a  large  dissociation  rate.  Such  states,  the  existence  of  which  has 
previously  been  evidenced  [15],  can  be  reached  by  appropriately  adjusting  laser 
parameters  (wavelength  within  the  range  95  to  120  nm  corresponding  to  a  leading 
single-photon  absorption  and  intensities  of  the  order  of  10 13  W/cm2).  The  next 
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section  gives  a  general  overview  of  the  theoretical  models  commonly  used  in  this 
context.  BS  and  vt  processes  with  their  underlying  mechanisms  referring  to  isotope 
effects  are  discussed  in  the  third  and  fourth  sections,  respectively. 

Theoretical  Models 

A  considerable  amount  of  theoretical  effort  has  recently  been  devoted  to  Hj 
dissociation  in  strong  laser  fields,  which  can  conceptually  be  viewed  within  two 
different  frames.  The  first  is  a  time-dependent  approach  [  16].  It  offers  a  thorough 
physical  picture  of  a  time-resolved  mpd  basically  driven  by  an  intense  electromag¬ 
netic  field  which,  practically,  can  only  be  achieved  using  short  laser  pulses.  One 
has  to  calculate  the  time  evolution  of  a  wave  packet,  under  the  action  of  a  time- 
dependent  Hamiltonian  H(t ).  The  Schrodinger  equation  governing  this  evolution 
is: 


ih 


dt  [iu(R,t)_ 


=  H(t) 


'tg(R,ty 

.MR,t). 


(1) 


where  R  designates  the  dissociative  nuclear  coordinate.  and  tpu  are  the  wave- 
packet  components  on  the  ground  g  and  excited  u  states  which  are  radiatively 
coupled.  The  Hamiltonian,  in  the  Born-Oppenheimer  approximation,  is  written 
as: 


H(t)  = 


-n(R)G(t) 


~1/2i&+ K{R) 


(2) 


and  VU(R)  are  the  corresponding  potential  energies  and  g(R)  the  electronic 
transition  dipole  moment.  In  the  absence  of  any  field,  the  ion  is  supposed  to  be  at 
the  v  =  0  vibrational  level  of  its  ground  state  g.  The  electromagnetic  field  amplitude 
&  (t)  is  given  as  a  product  of  a  time-dependent  shape  function  e(t)  by  a  real  cosine 
form,  with  peak  frequency  w: 

<£(0  =  e(t)  cos  a )t  (3) 

A  variety  of  techniques  have  been  used  to  solve  [Eq.  ( 1 )]  among  which  wave- 
packet  propagations  by  repeated  application  of  short  time  propagators  in  the  split 
operator  form  [16,17];  wave  operator  theory  within  Bloch  formalism  [6,13]  or 
adiabatic  time  evolution  [18].  The  sensitivity  of  the  results  with  respect  to  the  pulse 
shape  has  extensively  been  studied  [13,18]  in  particular  for  an  accurate  determi¬ 
nation  of  the  total  number  n  of  photons  taking  part  in  an  atd  or  BS  mechanism 
ending  up  with  a  net  absorption  of  p  photons  [  1 3  ] .  An  additional  advantage  of  the 
finite  duration  of  the  radiative  interaction  is  that  difficulties  related  with  the  use  of 
the  length  or  velocity  gauges  [  3  ]  can  simply  be  relaxed.  We  have  however  to  point 
out  that  the  possible  presence  of  long-lived  resonances  inducing  the  local  trapping 
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of  the  wave  packet  (as  in  vt  for  instance)  or  merely  its  spreading,  render  time- 
dependent  calculations  unefficient  and  prohibitively  long. 

A  Floquet  expansion  of  the  molecule-plus-field  Hamiltonian,  valid  for  pulses 
long  enough  to  lead  to  near  periodic  laser  fields,  results  into  a  time-independent 
approach.  The  main  advantage  of  this  approach  is  the  interpretative  tool  it  provides, 
in  terms  of  the  stationary  field-dressed  adiabatic  molecular  states  which  have  so  far 
been  the  basis  for  the  understanding  of  the  underlying  molecular  bond  models 
[  3,9  ] .  We  have  to  emphasize  that  the  Floquet  analysis  we  are  referring  to,  becomes 
exact,  only  within  the  assumption  of  a  cw  laser,  for  which: 

&(t)  =  eQ  cos  wt  (4) 

The  Hamiltonian  [Eq.  (2)]  being  time-periodic,  Floquet’s  theory  states  that  the 
solution  for  a  given  energy  E  can  be  expanded  as: 

=  e~iEtlh  2  eim<at<j>e,m{R)  (5) 


with  £  ~  g  or  u.  <£’s  are  the  unknown  nuclear  functions.  By  substitution  of  [Eq. 
(5)]  into  [Eq.  (1)],  one  gets: 


2*' 


\W2+  VAR)  +  mhw-E 


-  n(R)e0[elwt  +  e~m](j)e'±e,m(R) 


0  (6) 


After  multiplication  by  e~inu>\  followed  by  a  time  integration  over  a  finite  interval 
T,  and  making  use  of  the  identity: 


T 


gimwt 


dt 


(7) 


a  system  of  time-independent  close  coupled  equations  results: 

1  d 2 
2dR 


>2  +  Ve(R)  +  nhoo  -  E 


=  -  iL(R)c0[<f>c±e,n-\(K)  +  <f>e'*t,n+i(R)]  (8) 


The  network  of  diabatic  potentials  of  the  dressed  molecule  is  indicated  in  Figure 
1,  where  each  electron-field  channel  is  labeled  by  a  pair  of  indices  (g  or  u  for  the 
electronic  part  together  with  the  specification  of  the  photon  number  n).  It  is  precisely 
this  curve-crossing  model  or  its  adiabatic  representation,  where  the  interchannel 
coupling  is  diagonalized  (see  Fig.  1 ),  which  supports  the  understanding  of  potential 
barrier  lowerings  or  adiabatic  channel  closings,  at  the  origin  of  the  BS  and  vt 
mechanisms. 
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Figure  1.  Potential  energy  curves  of  HJ  dressed  by  a  cw-laser  of  329.7  nm  wavelength 
and  of  intensity  1.4  X  1013  W/cm2.  The  solid  lines  correspond  to  the  diabatic  channels. 
The  dotted  lines  represent  the  adiabatic  curves  resulting  from  the  diagonalization  of  the 

radiative  interaction. 


For  moderate  field  intensities  and  in  cases  where  radiative  couplings  are  localized 
at  curve  crossing  regions  a  semiclassical  approximation*  may  be  assumed  as  a 
simple  interpretative  tool  for  the  relative  probabilities  of  photon  absorption  or 
emission  processes.  More  precisely  the  Landau-Zener  probability  W  fjz  for  a  tran¬ 
sition  between  diabatic  channels  i  and  j  is  given  by  [  19]: 

Wj]z  =  1  -  exp(-x^z)  (9) 


with  the  Landau-Zener  parameter  [7a]: 


2V2 

L7  IJ 

U  ~  hv  |  AF y  | 


=  1.662  10"11 


Ml/2fi2I 

AEl/2\AFij\ 


(10) 


where  Vjj  and  v  are  the  coupling  (radiative  in  the  present  case)  and  the  classical 
velocity  at  the  crossing  point  Rc .  M  is  the  reduced  mass  in  a.m.u.,  (jl  in  a.u.,  I  the 
field  intensity  in  Wf cm2,  A E  the  scattering  energy  in  cm-1  measured  from  the 
energy  at  the  crossing  point  position,  and  A F,y  the  difference  of  slopes  in  cm-1  /A 
of  the  diabatic  potentials  at  their  crossing.  This  formula  shows  in  particular  that 
the  diabatic  transition  probability  from  g  to  u  at  their  crossing  position  (see  Fig.  1 


*  Throughout  this  article  “quantal”  or  “semiclassical”  apply  only  to  the  molecule,  the  field  being 
always  introduced  classically. 
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at  Rc  ~  3.5  a.u.)  is  increasing  with  the  laser  intensity.  In  other  words,  the  adiabatic 
path  (leading  to  a  photon  emission  when  going  from  channel  \2pau,  n  -  2)  to 
channel  |  ls(rg,  n—  1 ))  is  favored  for  higher  intensities  or,  at  a  given  intensity,  for 
the  heaviest  isotope. 

The  semiclassical  theory  has  also  been  invoked  for  supporting  the  interpretation 
of  the  vt  mechanism  [20]  by  the  presence  of  a  field-induced  upper  closed  adiabatic 
potential  (as  in  Fig.  2)  where  the  wave  packet  can  be  trapped  [8].  The  existence 
of  quasi-bond  levels  (or  infinitely  long-lived  resonances)  in  curve  crossing  situations 
has  first  been  pointed  out  by  Bandrauk  and  Child  in  semiclassical  electronic  pre¬ 
dissociation  models,  where,  accidentally,  a  bound  level  supported  by  the  attractive 
upper  adiabatic  potential  coincides  with  a  vibrational  level  supported  by  the  closed 
diabatic  potential  [15].  When  the  curve  crossing  problem  results  from  radiatively 
coupled  field-dressed  molecular  states,  coincidences  may  be  obtained  at  will  by  just 
properly  adjusting  the  laser  wavelength  (i.e.,  the  relative  energy  positioning  of  the 
potentials)  and  intensity  (i.e.,  the  strength  of  the  coupling).  More  precisely,  a  res¬ 
onance  of  zero  width  results  from  the  semiclassical  evaluation  of  the  poles  of  the 
scattering  matrix  when  a  diabatic  level  of  turning  points  a-  and  b+  is  close  to  an 
adiabatic  level  of  turning  points  a+  and  b+  (all  turning  points  being  associated  with 
adiabatic  potentials  V-  and  V+)  [  15,21  ] .  The  destructive  interference  which  results 
can  be  interpreted  [22]  using  Child’s  diagrammatic  method  [23]  given  as  an  inset 
of  Figure  2.  The  lines  with  the  arrows  indicate  the  adiabatic  channel  JWKB  wave 
functions  and  the  rectangular  boxes  designate  the  splitting  and  mixing  operations 


Figure  2.  Same  as  for  Figure  1  for  a  cw  laser  of  120  nm  wavelength  and  of  intensity 
1.25  X  10 13  W/cm2.  The  corresponding  Child’s  diagram  is  shown  as  an  inset. 
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undergone  by  the  wavelets  at  the  crossing  position  Rc.  The  Gamow-Siegert  reso¬ 
nance  criterion  imposes  a  zero  incoming  amplitude  in  the  open  channel  F_,  i.e. 
VL  —  0.  The  corresponding  outgoing  amplitude  VL  results  from  an  infinite  sum¬ 
mation  of  elementary  steps  each  representing  reflections  of  the  wavelets  at  the  left 
( a+  and  a-)  and  right  ( b+ )  turning  points  and  their  mixing  around  Rc.  For  such 
an  elementary  step,  an  amplitude  V'+  deposited  in  channel  V+  at  Rc ,  undergoes  a 
unique  reflection  at  b+,  followed  by  reflections  at  a±  after  a  channel  mixing  at  Rc, 
and  results  into  an  outgoing  amplitude  VL  [  19b, 21  ]: 

VL  =  -X(l  -  x2)i/2e~ix[e2i^++x)  -  e2i(a-+M]V'+  (11) 

where  X  and  the  phase  correction  factor  X  are  given  in  terms  of  a  real  valued 
parameter  v  [  1 9b,2 1  ] : 


X  =  exp(  —  tv) 


(12) 


and 


X  =  arg  T(iv)  —  v  In  v  +  7t/4 


(13) 


For  weak  field  situations  or  constant  (i.e.,  ^-independent)  radiative  couplings,  v  is 
merely  the  Landau-Zener  parameter  given  by  [Eq.  ( 10)].  It  is  also  to  be  noticed 
that  X  varies  from  —  7r/4  in  the  weak  field  limit,  to  0  in  the  strong  field  limit  [22]. 
Other  energy-dependent  phase  factors  building  up  [Eq.  (11)]  are: 


a±(E) 


Ja- u 


k±(R)dR ;  P+(E)  =  k+(R)  dR 


J"b+ 
Rc 


(14) 


where 


k±(R) 


h 


[E~V±(R)] 


1/2 


Actually,  the  width  of  the  resonance  being  given  by  [22,24]: 


r  = 


Is4-""-1’ 


(15) 


(16) 


a  quasi-bound  state  of  infinite  lifetime  corresponds  to  a  vanishing  outgoing  am¬ 
plitude  V  L  in  the  open  channel.  This  can  be  obtained  if  all  elementary  steps  con¬ 
tribute  a  null  amplitude  VL,  by  cancellation  of  the  sum  of  exponentials  in  [Eq. 
(11)]  or  by  fulfilling  the  following  conditions: 

a+(E)  +  0+(E)  +  X  =  (14  +  1/2)t r  (17a) 

a^(E)  +  (3+(E)  =  (v2  +  1/2)7 r  (17b) 

with  integer  values  for  v+  and  v2 .  These  conditions  are  close  to  state  that  the  energy 
E  is  that  of  a  vibrational  bound  state  of  the  adiabatic  potential  F+(i?)  [Eq.  17(a)] 
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or  of  the  ground  diabatic  potential  [Eq.  17(b)].  The  position  Er  and  the  width  of 
the  resonance  as  resulting  from  the  semiclassical  analysis  are  [  19b,21  ]: 

Er  =  (EV2  +  uxEv+)/(  1  +  ux)  (18) 

r  =  TTUX(  \  +  ux2)(EV2-  Ev+)2/[hu2(  1  +  UX)3]  (19) 

where 

u=\'2~  1  (20) 

and 

x  =  0)2/^+  (21 ) 

o>2  and  a>+  being  the  local  energy  spacings.  Equations  (18)  and  (19)  give  the  lowest 
order  formula  in  terms  of  the  energy  differences  involved,  indicating  that  a  coin¬ 
cidence  between  the  v+  adiabatic  level  ( of  energy  Ev+ )  and  the  v2  diabatic  level  ( of 
energy  EVl )  leads  to  a  laser-induced  quasi-bound  state  monitoring  the  suppression 
of  dissociation  by  a  vt  mechanism  in  the  E+(i?)  closed  diabatic  channel. 

The  time-independent  quantum  version  of  the  theory  presents,  however,  some 
difficulties  in  relation  with  the  so-called  persistent  effects  arising  in  the  case  of  a 
homonuclear  ion  characterized  by  a  spatially  diverging  dipole  moment.  The  diabatic 
channels  remain  coupled  at  infinite  R,  in  the  electric-field  gauge  jiG  producing  thus 
a  nonstandard  scattering  situation:  observables  related  to  asymptotically  well- 
behaved  noninteracting  free  fragments  cannot  be  defined.  The  adiabatic  represen¬ 
tation  transfers  the  persistent  effects  into  nonadiabatic  couplings  which  remain 
local,  allowing  thus  a  proper  description  of  the  free  fragments,  but  introduces  ad¬ 
ditional  kinetic  terms  in  Eq.  (8)  [25].  It  has  very  recently  been  shown  how  the 
complex  rotation  of  the  coordinate  R  circumvent  these  difficulties  leading  to  an 
accurate  treatment  of  the  close-coupled  equations  [Eq.  (8)]  even  with  diverging 
interchannel  couplings  [26].  Another  limitation  is  the  proper  description  of  the 
preparation  step.  In  very  strong  field  situations,  leading  to  resonance  overlappings, 
the  correspondence  between  the  field-free  initial  molecular  state  and  laser-induced 
resonances  is  hazardous.  An  elegant  technique  is  the  so-called  artificial  channel 
[12].  The  treatment  is  a  generalization  of  Shapiro’s  work  [27]  and  refers  to  two 
artificial  channels:  the  first  open  artificial  channel  aiming  to  transform  the  otherwise 
half-collision  situation  into  a  full  collision  and  the  second  closed  artificial  channel 
being  the  true  initial  unperturbed  (field-free)  molecular  state.  A  summation  over 
field  induced  resonances  (including  cases  of  overlapping)  is  carried  out  in  an  indirect 
way,  from  the  scattering  amplitude  between  the  artificial  entrance  channel  and  the 
final  physical  continuum. 

The  outcome  of  these  calculations  are  the  total  and  partial  dissociation  rates  or 
reaction  probabilities  (i.e.,  branching  ratios)  to  reach  fragmentation  channels  de¬ 
scribing  the  net  absorption  of  a  given  number  of  photons.  They  are  obtained  either 
by  an  asymptotic  flow  analysis  of  the  wave-packet  components  on  the  different 
electronic  states  in  the  time-dependent  version,  or  by  solving  the  close-coupled 
system  of  the  time-independent  version.  In  the  stationary  approach  the  total  rate 
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results  from  the  quantized  complex  energy  eigenvalue  of  the  resonance  as  its  imag¬ 
inary  part  [9a]  and  the  obtention  of  the  partial  widths  is  based  on  a  complex-energy 
probability-flux  analysis  [26]  or  on  the  evaluation  of  appropriate  matrix  elements 
of  the  scattering  operator  of  the  artificial  full  collisional  model  [12]. 

Although  rotational  effects  have  already  been  discussed  by  McCann  and  Bandrauk 
[28  ] ,  in  calculations  to  be  presented  hereafter  we  are  using  a  J-conserving  approx¬ 
imation,  as  has  often  been  done  in  previous  works  [8].  Such  an  approximation 
may  be  relaxed  by  including  an  expansion  in  a  reasonably  complete  set  of  rotational 
states  in  the  time-independent  version  or  by  an  appropriate  angle-dependent  prop¬ 
agator  in  the  time-dependent  approach  [29]. 

Isotope  Effects  in  Bond  Softening 

Calculations  are  done  for  the  Hj  -  Dj  system,  for  several  field  values  covering 
the  range  from  weak  (/==  108  Wf  cm2)  up  to  very  strong  (I  =  10 14  W/  cm2)  intensities 
for  the  laser  wavelength  of  A  =  329.7  nm  which  corresponds  to  an  energetically 
favorable  vertical  three-photon  transition  from  the  bound  ( 1  S(Tgy  v  =  0  ,j-  1 )  to 
the  continuum  (2 pau)  state.  Time-independent  models  referring  either  to  complex 
quantization  of  the  complex  scaled  Hamiltonian  [26]  or  artificial  channel  technique 
[12]  give  results  which  are  in  close  agreement,  when  convergence  is  reached  with 
respect  to  the  number  of  Floquet  blocks.  In  addition,  time-dependent  wave-packet 
calculations  with  a  pulsed  laser,  also  lead  to  qualitatively  comparable  informations 
[6].  The  results  obtained  for  a  cw  laser  acting  on  Hf  are  displayed  in  Figure  3. 
Four  regimes  are  depicted  with  respect  to  the  intensity: 

( i )  For  low  intensities  ( I  <  1 0 10  W/  cm2 )  a  single  photon  absorption  mechanism 
prevails,  as  is  expected  from  Fermi  golden  rule.  Although  the  total  dissociation  rate 
T  remains  very  low  ( <  1 0  "9  cm-1 )  due  to  very  weak  radiative  couplings  or  inefficient 
tunneling  between  |  lsag9  n  +  1 >  and  1 2ptru,  n)  a  local  intensity  power  law  (T  oc 
/"*),  as  displayed  in  the  inset  of  Figure  3,  gives,  in  conformity  with  a  linear  regime, 
a  plateau  value  at  n*  -  1. 

(ii)  For  intermediate  intensities  (7  =  1010  B7/ cm2- 10 12  W/cm2)  the  dissociation 
basically  proceeds  through  the  absorption  of  three  photons,  i.e.,  through  channel 
\2p<ru,  n-  2)  with  a  dissociation  rate  T  reaching  ca.  10"5  cm-1 .  The  three-photon 
partial  width  is  dominating  over  the  single-photon  width,  with  only  a  moderate 
contribution  from  the  two-photon  one.  This  corresponds  to  an  atd  regime  with 
the  appearence  of  another  plateau  value  for  the  power  law  at  n*  =  3. 

(iii)  For  high  intensities  (7  =  10 12  W/cm2-10 13  W/cm2)  there  is  a  competition 
between  the  three-  and  two-photon  partial  widths.  This  can  be  envisioned  in  terms 
of  partial  fluxes  along  the  adiabatic  path  of  Figure  1.  Following  the  simultaneous 
absorption  of  three  photons  (|  1  sag,  n  +  1)  ->  \2pau,  n  —  2))  one  photon  is 
subsequently  emitted  during  the  dissociation  process  itself  and  the  system  ends  up 
in  channel  |  \sag,  n  -  1 )  with  a  net  amount  of  two  absorbed  photons.  The  power 
law  remains  with  a  plateau  at  n*  =  3  and  the  total  rate  approaches  T  sa  10"2  cm"1 . 

(iv)  For  very  high  intensities  (7  >  10 13  W/cm2)  surprisingly  the  one-photon 
partial  width  is  increasing  very  fast  with  rather  high  total  dissociation  rates  ( T  of 


logl0I(W/cm 2) 

Figure  3.  Branching  ratios  (%)  accounting  for  the  net  absorption  process  of  one  (solid 
line),  two  (dashed  line),  or  three  (dotted-dashed  line)  photons  as  a  function  of  intensity. 
The  corresponding  power  law  n*  as  a  function  of  intensity  is  given  as  an  inset. 


the  order  of  several  cnT1).  The  laser-induced  deformation  of  the  molecular  force 
field  is  so  large  that  the  system  fragments  efficiently  through  the  BS  mechanism. 
There  is  no  integer  n*  which  satisfies  the  power  law.  This  means  that,  if  the  power 
law  is  to  be  related  to  the  Born  expansion  of  the  transition  operator:  T  =  V  + 
VG0V  +  VG0VG0V  +  •  •  •  •  (G0  being  the  resolvent)  in  terms  of  powers  of  the 
radiative  coupling  V,  there  is  no  leading  term  ensuring  the  convergence  of  the  series. 
The  understanding  of  the  underlying  mechanism  is  still  an  open  question. 

An  early  interpretation  emphasizes  the  net  single-photon  mechanism  by  very 
important  lowering  and  flattening  of  the  adiabatic  potential  barrier  at  the  avoided 
curve  crossing  occuring  between  1 1  s<rgi  n  +  1)  and  |2 pau>  n)  channels,  leading 
to  fragmentation  by  tunneling  (cf.  Fig.  1 )  [5,6,12] .  More  recently,  it  has  also  been 
noted  that  another  five-photon  mechanism,  namely  the  initial  absorption  of  three 
photons  followed  by  the  emission  of  two  photons,  may  compete  with  the  previous 
mechanism.  Although,  due  to  this  possible  competition,  there  is  no  plateau  at 
n*  =  1  or  n*  =  5  for  the  total  dissociation  rate,  the  interpretation  of  spectral  widths 
obtained  from  time-dependent  calculations  support  the  five-photon  mechanism 
[  1 3  ] .  It  is  worthwhile  to  point  out  that  a  closer  inspection  of  the  way  in  which  the 
isotope  substitution  affects  the  dissociation  branching  ratios  may  give  additional 
information  concerning  the  inherent  mechanisms  [  14] . 
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A  measure  of  the  efficiency  of  the  dissociation  via  three-photon  versus  two-photon 
channels  (or  two-photon  versus  one-photon)  is  the  ratio  R32  =  P3/P2  (or  R2\  = 
P2fP\)  of  the  partial  widths  corresponding  to  dissociation  with  a  net  absorption  of 
one  ( Pi ) ,  two  ( P2 )  or  three  photons  (P3).  Figure  4  displays  their  values  as  a  function 
of  the  field  intensity  and  for  the  two  isotopes.  As  has  previously  been  stated,  it  is 
clear  from  [Eq.  (9)]  that  the  heavier  isotope  D2  with  less  velocity  behaves  more 
adiabatically,  i.e.,  realizes  in  a  more  efficient  way  the  diabatic  jump,  than  the  lighter 
H2 .  For  intermediate  and  high  intensity  regimes,  where  the  fragmentation  pro¬ 
cess  is  monitored  by  the  three-  and  two-photon  crossing  channels,  the  \2p<yu, 
n  -  2) \lsag,n-\)  diabatic  transition  is  favored  for  the  heavier  isotope.  The 
results  are  in  conformity  with  the  lz  model,  i.e.:  R32  (H2 )  >  R32  (Dj)-  For  very 
high  intensities,  the  branching  ratio  P3  toward  the  three-photon  dissociation  channel 
remains  negligible  and  it  is  the  ratio  jR2i  of  the  two-  to  the  one-photon  channel 
which  is  significant  for  the  discussion.  The  results  displayed  in  Figure  4,  namely 
*2i(H2+)>i?2i(Dj),  cannot  be  interpreted  in  terms  of  a  net  single  photon  absorp¬ 
tion  process  by  a  tunneling  through  an  adiabatic  potential  barrier.  This  mechanism 
would  favor  the  light  H2 ,  which  in  addition  experiences  a  lower  barrier  (smaller 
mass,  higher  vibrational  frequency),  leading  to  i?2i(H2 )  <  i?2I(D2 ). 

Conversely,  a  coherent  five-photon  interpretation  within  the  LZ  model  can  be 
retained,  referring  to  three  steps:  (i)  an  initial  three-photon  absorption  between 
indirectly  coupled  diabatic  channels  |  lso^,  n  +  1 )  and  1 2pou,  n  —  2)  at  the  avoided 
crossing  position  R  ^  2  a.u.  (cf.  Fig.  1 );  (ii)  a  first  photon  emission  at  R  ~  3.5  a.u. 
at  the  position  of  the  second  curve  crossing  between  radiatively  coupled  channels 


Log  [I  (W1 cm2)] 

Figure  4.  Ratio  of  the  partial  widths  (Ry  =  PJPj)  as  a  function  of  the  field  intensity. 
Small  dark  circles  for  R32(HJ);  large  dark  circles  for  R32(DJ),  small  light  circles  for 
R2i(F12  ) ,  and  large  light  circles  for  R2i(D2  ). 
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1 2 p<ju,  n  -  2)  and  |  \S(rg,  n  —  1);  (iii)  a  subsequent  second  photon  emission 
leading  to  a  transition  from  the  diabatic  channel  1 1  sag,  n  -  1 )  to  \2p<ju,  «)  (but 
without  curve  crossing).  For  the  first  two  steps  the  transition  probability  is  larger 
for  the  heavier  isotope  Dj .  Although  the  LZ  formula  cannot  directly  be  transposed 
for  the  last  step,  one  can  presumably  argue  that  the  heavier  isotope  is  again  favored 
for  this  transition,  namely  because  of  its  lower  velocity,  such  that  actually  Dj  is 
exposed  to  the  radiative  field  for  a  longer  time  than  Hf ,  resulting,  finally,  into 
^2i(Hj )  >  i?2i(E>2 ).  We  have  thus  given  an  additional  proof  to  the  fact  that  the 
BS  mechanism  cannot  merely  be  interpreted  as  a  field-dressed  barrier  lowering 
described  by  a  single  photon  exchange.  One  has  rather  to  consider  a  competition 
between  this  single  and  the  aforementioned  five-photon  mechanisms. 

Isotope  Separation  Using  Vibrational  Trapping 

An  important  and  interesting  feature  concerning  the  diabatic-adiabatic  level 
coincidences,  which  are  at  the  origin  of  the  vibrational  trapping  mechanism,  as 
depicted  by  [Eq.  ( 19)],  is  their  great  sensitivity  with  respect  to  the  laser  and  mo¬ 
lecular  characteristics.  In  particular,  a  coincidence  obtained  for  a  given  molecule- 
plus-field  system  does  not  hold  when  one  proceeds  to  an  isotopic  substitution.  A 
very  efficient  D2  versus  Hj  isotope  separation  technique  results,  using  an  intense 
laser  field  (  /  ~  10 13  W/cm2)  to  stabilize  Dj  while  dissociating  Hj . 

The  semiclassical  theory  [Eqs.  ( 1 1  )-(21 )],  apart  from  providing  an  interpretative 
approach  for  the  VT  mechanism,  can  also  be  invoked  to  adjust,  at  least  approxi- 
matively,  the  laser  parameters  (wavelength  and  intensity )  so  as  to  guide  exact  quan¬ 
tum  calculations.  These  are  done  by  solving  the  close-coupled  equations  of  the 
time-independent  Floquet  Hamiltonian  [Eq.  (8)]  and  by  imposing  the  Gamow- 
Siegert  end-point  conditions  within  the  frame  of  the  complex  rotation  of  the  co¬ 
ordinate  [30].  The  two  laser  wavelengths  we  are  considering,  namely  X  =  120  nm 
and  X  =  95  nm,  illustrate  two  distinct  curve-crossing  situations:  c+  (i.e.,  crossing 
at  the  right  of  the  equilibrium  position)  and  c~  (i.e.,  crossing  at  the  left  of  the 
equilibrium  position)  type  of  dissociation,  respectively.  In  addition,  both  wave¬ 
lengths,  being  close  to  the  maximum  of  the  absorption  lineshape  of  Hj  (X  ~  115 
nm) ,  even  for  the  relatively  strong  fields  which  are  used,  validate  the  rotating  wave 
approximation  (rwa)  in  its  time-independent  quantal  and  semiclassical  versions. 
The  results  are  displayed  in  Figure  5. 

Using  a  semiclassical  approach  the  first  occurence,  for  Dj ,  of  a  diabatic-adiabatic 
level  coincidence  is  v2  =  2,  v+  =  0  for  X  =  120  nm,  at  7  =  1.25  10 13  W/cm2,  or 
v2  =  1, 17+  =  0  for  X  =  95  nm  but  with  a  stronger  field  of  /  ca.  2.81  10 13  W/cm2. 
The  semiclassically  estimated  critical  field  intensities  leading  to  coincidences  do 
not  differ  by  more  than  1 5%  when  calculated  by  using  either  X  =  0  or  X  given  by 
the  LZ  approximation  [Eq.  (10)  implemented  in  Eq.  (13)].  The  upper  pannels 
[(a)  and  (c)]  of  Figure  5,  display,  on  an  enlarged  scale,  the  variations  of  the  resonance 
width  in  the  vicinity  of  the  critical  intensities  for  Dj  as  resulting  from  semiclassical 
[Eq.  (19)],  quantal  within  rwa  (one  Floquet  block)  and  multi-Floquet  blocks 
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Figure  5 .  The  resonance  width  ( in  cm- 1 )  as  a  function  of  the  laser  intensity  ( in  W  /  cm2 ) . 
Panels  (a)  and  (b)  correspond  to  v2  =  2  and  X  =  120  nm.  D2  dissociation  in  the  vicinity 
of  the  critical  intensity  is  depicted  in  panel  (a)  where  dots  are  for  the  semiclassical  ap¬ 
proximation;  solid  line  for  the  single  Floquet  block  approximation  and  the  dashed  line 
for  converged  quantal  calculations.  In  panel  (b)  the  solid  line  is  for  the  Dj  dissociation 
rate;  the  dashed  line  for  the  Hj  dissociation  rate.  Panels  (c)  and  (d)  are  the  same  as  (a) 
and  (b),  respectively,  but  referred  to  the  v2  =  1  resonance  for  X  =  95  nm. 


converged  quantal  [Eq.  (8)]  calculations  [20] .  The  semiclassical  approach  system¬ 
atically  leads  to  vanishing  widths  for  larger  intensities  as  compared  with  the  quantal 
approaches,  the  differences  remain,  however,  reasonably  small.  RWA  turns  out  to 
be  valid,  when  compared  with  the  exact  converged  quantal  calculation,  for  both 
wavelengths,  clearly  supporting  that  for  these  wavelengths  (close  to  the  top  of  the 
one-photon  absorption  lineshape),  the  reference  Floquet  block  is  the  one  responsible 
for  the  suppression  of  the  dissociation.  The  lower  pannels  [(b)  and  (d)]  of  Figure 
5  compare  the  intensity  dependence  of  the  widths  of  Dj  and  Hj  on  a  wider  scale. 
The  following  points  can  be  raised: 

(i)  A  linear  regime  for  the  behavior  of  the  widths  with  respect  to  the  intensity 
is  observed  in  the  weak  field  regime  (/  <  10 12  W/cm2).  For  X  =  120  nm,  the 
dissociation  rate  of  Hj  dominates  over  that  of  Dj ,  while  a  reverse  situation  is 
depicted  for  X  =  95  nm. 
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(ii)  Optical  nonlinearities  appear  for  stronger  fields  as  has  been  discussed  pre¬ 
viously  [9a].  The  dissociation  rate  reaches  a  maximum  value  at  /  ^  4  X  1012W/ 
cm2  for  T>i  and  I  ~  10 13  W/cm2  for  Hj ,  at  X  =  120  nm.  The  corresponding  values 
at  X  =  95  nm  are  slightly  higher,  i.e.,  I  a  6  X  10 12  W/cm2  for  Dj  and  I  ca  1.1  X 
1013  W/cm2  for  Hj. 

(iii)  An  overall  decrease  towards  zero  for  both  Dj  and  Hj  and  for  both  wave¬ 
lengths  dominates  the  strong  field  regime.  The  most  important  information  is  that 
the  zero  minima  of  Dj  correspond,  for  both  wavelengths,  to  a  rather  large  width 
of  Hj,  namely  T  ~  600  cm-1  (for  X  =  120  nm)  and  T  ~  100  cm-1  (for  X  =  95 
nm) .  This  clearly  shows  that  one  can  proceed  to  a  specific  choice  for  a  laser  wave¬ 
length  and  intensity  such  that  Dj  dissociation  may  completely  be  suppressed, 
whereas  for  the  same  field  Hj  is  still  dissociating  fast.  We  have  thus  evidenced  the 
possibility  of  an  isotope  separation  based  on  a  vt  mechanism.  Great  efficiency  is 
expected  for  properly  prepared  initial  vibrational  states  of  a  rotation-free  isotope 
interacting  with  a  cw  laser. 

Recent  calculations  [31,32]  show,  however,  that  multiple  rotational  excitation 
dominates  at  three-photon  avoided  crossings  and  even  partly  distroys  the  stabili¬ 
zation  [33].  Isotope  effects  are  thus  basically  sensitive  for  single  photon  vt  mech¬ 
anism  (i.e.,  the  frame  of  the  present  work).  For  the  wavelengths  under  consideration 
and  for  moderately  strong  fields,  the  relative  importance  of  rotational  excitation 
seems  to  contribute  to  the  yield  but  not  to  the  efficiency  of  the  isotope  separation 
scheme  [20]. 
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Abstract 

Molecular  dynamics  simulations  are  used  to  study  hypervelocity  impacts  of  an  ultrathin  flyer  plate 
with  a  semi-infinite  two-dimensional  model  diatomic  molecular  solid.  We  show  that  these  hypervelocity 
impacts  can  produce  a  dissociative  phase  transition  from  a  molecular  to  a  close-packed  solid  in  the  target 
material.  We  also  show  that  hypervelocity  impacts  of  ultrathin  plates  can  produce  extensive  chemical 
reactions  leading  to  a  detonation  accompanied  by  a  phase  transition  in  an  energetic  version  of  the  model. 
©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Advances  in  technology  have  led  to  the  development  of  a  variety  of  methods  to 
launch  flyer  plates,  microprojectiles,  and  pellets  at  velocities  in  excess  of  several 
kilometers  per  second  [1].  The  shock  wave  that  results  when  these  hypervelocity 
projectiles  strike  stationary  targets  typically  separates  the  unperturbed  material  by 
only  a  few  lattice  spacings  from  the  shocked  solid  which  experiences  pressures  of 
several  hundred  kilobars,  temperatures  of  several  thousand  degrees  kelvin,  while 
flowing  at  velocities  of  several  kilometers  per  second  [2].  These  extreme  but 
ephemeral  conditions  have  been  used  to  uncover  new  high  pressure  phases  of  solids 
[3]  as  well  as  explore  synthetic  routes  to  novel  materials  such  as  “hexagonal”  dia¬ 
mond  [4]  (lonsdaleite)  originally  discovered  in  granular  form  in  meteorites.  Indeed, 
as  early  as  1956  Bancroft,  Peterson,  and  Minshall  reported  data  from  shock  loading 
experiments  that  proved  the  existence  of  a  previously  undetected  high  pressure 
phase  of  iron  [  5  ] .  The  signature  of  this  transition  was  the  presence  of  a  split  shock 
wave.  Since  Bancroft,  Peterson,  and  Minshall’s  initial  observations,  shock  wave 
splitting  caused  by  polymorphic  phase  transitions  has  been  observed  in  many  other 
systems  [  3  ] . 

A  better  atomic-scale  understanding  of  the  physical  and  chemical  events  that 
occur  during  shock-wave  loading  should  not  only  aid  in  the  design  of  more  resilient 
materials  but  also  in  the  production  of  safer  and  more  reliable  explosives.  However, 
the  shock  processes  induced  by  hypervelocity  impacts  or  sustained  by  detonations 
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occur  at  such  short  time  (subpicosecond)  and  length  scales  (subnanometer  scales) 
that  they  are  difficult  to  probe  experimentally.  The  short  time  and  length  scale  of 
these  processes,  however,  make  them  ideal  for  classical  molecular  dynamics  sim¬ 
ulations  [6].  In  addition,  although  starting  from  an  atomic-scale  description,  mo¬ 
lecular  dynamics  simulations  have  proven  able  to  treat  sufficiently  many  atoms  for 
long  enough  times  to  reproduce  the  continuum  behavior  of  planar  shock  waves 
described  by  hydrodynamic  theory  [7,8].  Hence,  molecular  dynamics  simulations 
are  ideal  for  modeling  atomic-scale  processes  associated  with  shock  waves  and  for 
relating  this  atomic-scale  behavior  to  the  continuum  properties  of  condensed  phase 
shock  waves. 

Shock  loading  resulting  from  hypervelocity  impacts  can  induce  chemical  as  well 
as  physical  changes  in  materials.  An  extreme  example  is  that  of  a  solid  explosive 
which  when  struck  can  undergo  catastrophic  exothermic  chemical  reactions.  A  less 
spectacular  but  no  less  important  example  is  the  short-time  corrosive  chemistry 
that  can  occur  when  nanometer  scale  clusters  strike  surfaces  at  hypervelocities. 

We  have  developed  a  series  of  related  many-body  reactive  potentials  that  allow 
processes  such  as  adhesion  and  friction  [9,10],  and  shock-induced  phase  changes 
and  chemistry  [8,1 1]  to  be  studied  at  atomic-scale  resolution.  Herein,  we  summarize 
and  relate  some  of  the  results  we  have  obtained  in  using  a  subset  of  these  potentials 
to  simulate  the  processes  that  can  occur  during  the  hypervelocity  impact  of  an 
ultra-thin  flyer  plate  with  a  model  diatomic  molecular  solid.  Specifically,  we  discuss 
results  of  a  series  of  molecular  dynamics  simulations  of  the  hypervelocity  impact 
of  a  nanometer  thick  flyer  plate  with  several  model  2-D  diatomic  molecular  solids. 
We  find  that  even  this  ultrathin  flyer  plate — if  accelerated  to  hypervelocities — can 
have  sufficient  momentum  to  induce  a  polymorphic  (dissociative)  transition  from 
a  molecular  to  close-packed  solid  in  the  target  material.  In  addition,  we  show  how 
such  impacts  can  lead  to  extensive  chemical  reactions — even  to  the  point  of  pro¬ 
ducing  a  detonation  in  an  energetic  version  of  the  model.  In  the  next  section  we 
briefly  review  the  model  used  in  these  simulations.  Then  a  series  of  our  results  are 
discussed.  The  final  section  provides  a  summary. 

Model 

The  model  potentials  used  in  our  studies  are  based  on  empirical  bond  order 
potentials  such  as  those  used  by  TersofF  to  describe  silicon  [12]  but  tailored  by  us 
to  treat  reactive  diatomic  molecular  solids  [  8  ] .  Within  this  approach  the  total  po¬ 
tential  energy,  EB,  of  a  collection  of  N  atoms  is  represented  by, 

N  N 

Eb=X'Z  {fMVdru)  -  BijVAnj )]  +  vvMnj)j ,  (l) 

i  j>i 

where  the  molecular  bonding  portion  of  the  potential  consists  of  a  repulsive  term, 
VR(r),  and  an  attractive  term,  VA(r),  both  modeled  by  exponentials.  A  Lennard- 
Jones  potential,  Vvdw ,  is  used  to  describe  the  weak  long-ranged  van  der  Waals 
interaction.  This  Lennard-Jones  potential  is  truncated  not  only  at  large  distances 
but  also  at  close-range  bonding  separations  to  allow  for  covalent  bonding.  The  bond 
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order  function,  Bu  =  (By  +  Bji)/ 2,  introduces  many-body  effects  into  the  potential 
by  modifying  VA(r )  according  to  the  local  bonding  environment.  These  many-body 
effects  arise  from  the  hidden  electronic  degrees  of  freedom  that  are  not  treated 
explicitly  in  the  model.  For  an  isolated  diatomic  molecule  B0  is  unity  and  the 
potential  reduces  to  a  generalized  Morse  function  familiar  in  the  description  of 
diatomic  bonding  [13].  For  more  highly  coordinated  structures  By  is  no  longer 
unity,  but  rather  decreases  with  the  increasing  number  and  strength  of  competing 
bonds  the  atoms  i  and  j  form.  This  decrease  in  Bu  reflects  the  finite  number  of 
valence  electrons  these  atoms  have  available  for  bonding.  Hence  Bv  represents  an 
effective  valence  which  can  be  tailored  through  its  dependence  on  environment  to 
describe  either  highly  coordinated  metals  [14],  tetrahedrally  bonded  semiconduc¬ 
tors  [15]  and  insulators  or  low  coordination  molecular  solids  with  a  few  strong 
bonds [ 8 ] . 

In  what  follows  the  functions  and  associated  parameters  entering  Eq.  ( 1 )  have 
been  chosen  to  describe  a  diatomic  molecular  solid  at  ambient  conditions  as  de¬ 
scribed  in  detail  elsewhere  [  8  ] .  The  diatomic  molecules  that  make  up  this  model 
solid  can  be  composed  of  either  like  (A2  or  B2)  or  unlike  (AB)  molecules.  These 
diatomic  molecules  all  have  reasonable  physical  and  chemical  characteristics  in¬ 
cluding  binding  energies,  bond  lengths,  vibrational  frequencies,  and  barriers  to 
chemical  reaction  [8].  In  addition,  the  diatomic  molecular  solids  formed  from 
these  diatomic  molecules  have  reasonable  physical  characteristics  including  densities, 
solid-state  sound  speeds,  and  high  pressure  phase  transitions  [  8  ] . 

The  A 2  and  B2  molecules  entering  this  model  are  taken  to  have  identical  binding 
energies  of  5  eY  while  the  binding  energy  of  each  AB  molecule  is  set  to  2  eV.  Hence 
simulations  starting  from  an  AB  molecular  solid  allow  for  the  possibility  of  exo¬ 
thermic  chemical  reactions  from  the  higher  energy  AB  molecules  to  the  more  stable 
A2  and  B2  molecular  products.  In  contrast,  simulations  starting  from  either  A2  or 
B2  molecular  solids  do  not  have  the  possibility  of  any  net  release  of  energy.  This 
model  then  allows  simulations  of  hypervelocity  impacts  with  target  materials  com¬ 
posed  of  either  energetic  or  nonenergetic  diatomic  molecules. 

Results  and  Discussion 

Hypervelocity  impact  simulations  are  initiated  by  slamming  an  ultrathin  thin 
flyer  plate  composed  of  20  layers  or  less  of  the  target  material  into  the  edge  of  the 
semi-infinite  model  diatomic  molecular  crystal  initially  at  rest.  Both  the  flyer  plate 
and  the  molecular  crystal  are  initially  taken  to  have  near-zero  temperature  and 
pressure.  The  dynamics  of  the  system  resulting  from  this  hypervelocity  impact  is 
then  studied  by  integrating  Hamiltonian’s  equations  of  motion  using  a  Nordsieck 
predictor-corrector  method  [15].  During  these  simulations  periodic  boundary  con¬ 
ditions  are  imposed  perpendicular  to  the  direction  of  shock  propagation. 

A  series  of  studies  were  first  carried  out  assuming  an  all  A2  or  equivalently  an 
all  B2  diatomic  molecular  solid.  A  sequence  of  snapshots  for  a  simulation  beginning 
with  the  15  km/sec  velocity  plate  is  shown  in  Figure  1.  At  0.8  ps  [Fig.  1  (a)]  only 
a  single  compressional  shock  wave  is  visible.  Across  this  shock  front  the  diatomic 


Figure  I.  Snapshots  of  the  simulation  begun  with  the  15  km/ sec  flyer  plate  at:  (a)  0.8; 
(b)  1.6;  (c)  2.4;  (d)  3.2;  and,  (e)  4.0  ps  after  the  impact  with  the  all  A2  diatomic  molecular 
solid.  The  shock  waves  are  propagating  from  left  to  right. 


molecular  solid  (dms)  transforms  directly  to  a  close-packed  solid  (CPS).  Although 
the  density  of  this  CPS,  pc ps>  is  approximately  2.4  times  larger  than  the  density  of 
the  DMS,  pdms,  the  average  nearest-neighbor  interatomic  separation  in  this  phase, 
dm,  is  actually  20%  larger  due  to  an  increase  in  the  interatomic  coordination. 
These  properties  of  the  CPS  make  this  DMS  to  CPS  transition  similar  to  the  dissociative 
transition  reported  in  diatomic  molecular  solids  of  iodine  [16]  and  bromine  [17]. 
This  transition  occurs  at  2 1  GPa  in  I2  and  8 1  GPa  in  Br2  leading  to  an  increase  in 
the  overall  relative  density,  pcps/pdms,  of  1.7  for  I2  and  2.3  for  Br2,  and  an  increase 
in  dNN  of  around  1 5%  for  both  materials. 

At  1.6  ps  [Fig.  1(b)]  a  second  leading  compressional  shock  wave  has  appeared 
in  the  15  km/sec  simulation.  Across  this  new  front  the  diatomic  molecules  are 
compressed  and  rotated  from  their  initial  positions  in  the  undisturbed  crystal  but 
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retain  their  molecular  identity.  Once  the  second  shock  front  is  clearly  visible  it 
rapidly  separates  from  the  dissociative  front,  as  can  be  seen  by  comparing  Figure 

1  (b)  to  Figure  1(c).  Although  both  the  compressional  and  dissociative  shock  fronts 
propagate  at  velocities  in  excess  of  the  speed  of  sound  in  the  undisturbed  crystal, 
the  velocity  of  the  dissociative  front  rapidly  decreases  from  more  than  12  km/ sec 
at  0.5  ps  to  about  3  km/ sec  at  2.4  ps.  In  contrast,  the  velocity  of  the  leading  com¬ 
pressional  shock  remains  pinned  around  12  km/ sec  while  the  dissociative  front 
exists,  only  gradually  beginning  to  slow  after  the  CPS  disappears.  As  the  dissociative 
front  slows  the  accompanying  dissociative  zone  also  narrows  as  this  region  is  con¬ 
sumed  from  behind  by  the  rarefaction  wave  [Fig.  1(a) -Fig.  1(d)].  Figures  1  and 

2  show  that  (Fig.  2)  this  front  also  begins  to  smear  with  the  material  behind  entering 
a  mixed  phase  region  by  2.4  ps  [Fig.  1(c)]  as  the  velocity  of  the  dissociative  front 
approaches  3  km/ sec.  This  mixed  phase  is  clearly  present  at  3.2  ps  [Fig.  1  (d)]  just 
prior  to  the  complete  disappearance  of  the  dissociative  zone. 

A  dissociative  phase  together  with  an  accompanying  split  shock  wave  such  as 
shown  in  Figure  1(b)  is  also  present  in  all  the  other  simulations  begun  with  flyer 
plate  velocities  between  10  to  20  km/ sec.  Although  the  appearance  of  the  leading 
compressional  wave  is  delayed  for  higher  impact  plate  velocities,  once  formed  this 
front  always  exhibits  a  velocity  near  but  slightly  less  than  12  km/  sec,  provided  the 
dissociative  region  remains  well-defined.  In  all  simulations  exhibiting  a  split  shock 
wave,  the  dissociative  zone  begins  to  lose  its  identity  only  after  the  particle  flow 
velocity  behind  the  dissociative  front  slows  to  near  3  km/ sec. 
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Figure  2.  Scatter  plot  of  upcps  3.0  A  behind  the  dissociative  front  for  the  hypervelocity 
impact  simulation  depicted  in  Figure  1 . 
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The  general  properties  of  these  hypervelocity  impact  simulations  as  well  as  the 
explicit  behavior  shown  in  Figure  1  are  all  consistent  with  continuum  predictions 
of  a  shock  induced  phase  transition.  An  important  property  of  these  phase  transitions 
is  the  compressional  shock  wave  splitting  that  might  occur.  If  split  compressional 
shock  waves  are  present,  then  continuum  theory  predicts  [18]  that  the  leading 
shock  front  starts  material  flow  bringing  it  to  the  point  of  transition  while  the 
transition  occurs  across  the  second  compressional  shock  front.  A  split  shock  wave 
with  the  transition  occurring  across  the  second  front  is  just  what  is  observed  in 
Figure  1  (b)  and  1(c). 

The  continuum  theory  of  planar  shock  waves  also  predicts  [18]  that  a  shock 
wave  propagates  into  a  medium  (assumed  at  rest  and  characterized  by  a  pressure 
P0  and  a  specific  volume  V0)  with  a  velocity  D  given  by: 


£>  =  (1  -  V/V0)-'u„,  (2) 

where  the  particle  flow  velocity  behind  this  shock  front,  up,  is  given  by: 

up  =  V(K0-  K)(£-P0)  ,  (3) 

with  V  the  specific  volume  and  P  the  pressure  of  the  shocked  material.  Therefore, 
because  the  leading  shock  front  carries  the  pressure,  PT,  and  specific  volume,  VT, 
of  the  phase  transition,  its  velocity  from  Eqs.  (2)  and  (3)  is  given  by  Dx  = 
V( PT  -  Pq)/{Vq  —  VT),  and  hence  is  predicted  as  pinned  at  a  constant  value  de¬ 
termined  by  the  properties  of  the  phase  transition  and  the  undisturbed  molecular 
crystal.  In  addition,  the  second  front  can  only  exist  as  a  shock  front  so  long  as  the 
particle  flow  velocity  behind  this  front  exceeds  the  particle  flow  velocity  in  the 
leading  compressional  zone,  uPl  =  V(F0  —  VT)(PT  —  PQ) .  Hence  uPl  is  also  a  con¬ 
stant  determined  by  the  properties  of  the  phase  transition  and  the  initial  state. 
Continuum  theory  therefore  explains  why  when  two  shock  waves  are  present  in 
the  system — regardless  of  the  impact  plate  velocity — the  first  shock  propagates  at 
a  near  constant  velocity  Dx,  while  the  second  always  fragments  as  the  particle 
velocity  behind  this  front,  uPcps,  approaches  a  fixed  value,  uPr  Elsewhere  we  have 
shown  [8]  that  for  the  present  model:  VT  »  4.5  A2/atom;  PT  «  0.813  eV/A2; 
Vq  6.14  A2/atom;  and  P0  0.00  eV/A2;  results  which  imply  that  Dx  «  1 1.5 
km/ sec  and  uPl  «  3.0  km/ sec.  These  predictions  differ  no  more  than  5%  from  the 
corresponding  near-constant  values  of  Dx  and  uPl  observed  in  all  of  these  flyer  plate 
simulations  confirming  the  consistency  of  the  continuum  interpretation. 

The  snapshots  shown  in  Figure  1  also  suggest  that  hypervelocity  impacts  sufficient 
to  induce  a  local  dissociative  transition  should  also  be  capable  of  causing  substantial 
chemical  rearrangement  of  the  atoms  in  the  dissociative  zone.  To  explore  the  pos¬ 
sibility  of  chemistry  induced  by  hypervelocity  impacts  we  have  used  the  energetic 
AB  version  of  our  model  diatomic  molecular  solid.  Hence,  rather  than  starting 
with  an  all  A2  molecular  solid  we  now  start  with  an  all  AB  molecular  solid  which 
when  shocked  might  undergo  exothermic  chemical  reactions  to  yield  the  more 
stable  A2  and  B2  molecules.  As  noted  in  the  “Model”  section  the  exothermic  gas- 
phase  half  reactions,  AB  +  B  -►  B2  +  A  and  BA  +  A  -►  A2  +  B — taken  as  identical 
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in  the  model — are  each  assumed  to  liberate  3.0  eV.  This  3.0  eV  is  similar  to  the 
3.3  eV  of  energy  liberated  in  the  exothermic  reaction  N  +  NO  -►  N2  +  O  thought 
important  in  the  detonation  of  NO.  Also,  the  transition  states  for  these  two  identical 
reactions  occur  in  a  collinear  geometry  with  an  early  barrier  to  reaction  which  is 
typical  of  atom-diatom  exothermic  reactions.  In  addition,  the  barrier  for  these  two 
model  reactions  is  0. 1 1  eV.  This  barrier  is  greater  than  but  similar  to  the  barrier 
reported  for  the  exothermic  reaction  F  +  H2  -►  FH  +  F  [19]. 

Figure  3  depicts  a  snapshot  of  a  simulation  begun  by  the  impact  of  a  four-layer, 
6.0  km/ sec  flyer  plate  with  the  AB  model  molecular  solid.  As  in  Figure  1(b),  four 
distinct  regions  are  present:  the  unreacted  molecular  solid,  a  leading  compressional 
region,  a  close  packed  solid  (CPS)  phase,  and  a  product  region.  Hence,  the  AB 
molecular  solid  also  exhibits  a  split  shock  wave  structure  and  associated  polymorphic 
phase  transition  similar  to  that  found  in  the  A2  simulations.  However,  in  contrast 
to  the  all-A2  model,  the  CPS  phase  does  not  shrink  but  rather  grows  at  a  constant 
rate  as  the  simulation  progresses.  The  work  necessary  to  maintain  and  increase  this 
CSP  is  supplied  by  the  exothermic  chemical  reactions  induced  by  the  shock  wave. 
These  exothermic  reactions  almost  completely  convert  the  AB  molecular  solid  to 
the  lower  energy  A2  and  B2  molecular  products  clearly  visible  to  the  far  left  of 
Figure  3.  In  this  way,  the  simulation  depicted  in  Figure  3  has  become  a  chemically 
sustained  shock  wave  with  the  velocities  of  the  three  interfaces  separating  the  four 
distinct  regions  soon  reaching  constant  but  different  values  as  shown  in  Figure  4. 
As  we  will  discuss  in  detail  elsewhere,  many  properties  of  this  chemically  sustained 
shock  wave  such  as  peak  pressure,  particle  flow  velocity,  and  power  generation  are 
all  consistent  with  a  detonation.  Hence,  Figure  3  demonstrates  that  hypervelocity 
impacts  are  not  only  capable  of  causing  chemical  reactions  but  also  catastrophic 
chemical  reactions  leading  to  a  detonation. 

Whether  or  not  a  chemically  sustained  shock  wave  occurs  depends  on  the  mass 
and  impact  velocity  of  the  flyer  plate.  Specifically,  as  the  number  of  layers  in  the 
flyer  plate  is  increased  the  minimum  impact  velocity  for  initiation  decreases  ap¬ 
proaching  an  asymptotic  value  of  about  4.5  km/ sec.  Thus,  even  macroscopically 
thick  flyer  plates  must  impact  at  hypervelocities  to  cause  a  chemically  sustained 
shock  wave  in  the  model  material.  This  result  agrees  with  the  observed  insensitivity 
of  crystalline  explosives  to  initiation. 
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Figure  3.  Snapshot  at  5.1  ps  of  a  simulation  begun  with  the  impact  of  a  four-layer,  6.0 
km/ sec  flyer  plate  with  the  AB  model  energetic  diatomic  molecular  solid.  The  two  types 
of  atoms  are  depicted  as  light  and  dark.  The  shock  front  is  propagating  from  left  to  right. 
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Time  (ps) 

Figure  4.  The  positions  of  the  three  interfaces  between  the  four  distinct  regions  as  a 
function  of  time  for  a  simulation  begun  with  an  impact  velocity  of  6  km/ sec. 


Summary 

In  summary,  we  have  reported  the  results  of  a  series  of  molecular  dynamics 
simulations  of  hypervelocity  impacts  of  an  ultra-thin  plate  with  a  model  diatomic 
molecular  solid.  At  these  high  impact  velocities  even  this  ultra-thin  plate  is  found 
able  to  induce  a  dissociative  phase  transition  to  a  close-packed  solid  with  accom¬ 
panying  complex  behavior  such  as  shock  wave  splitting.  Although  this  close-packed 
phase  persists  for  less  than  10  ps  and  is  confined  to  a  domain  less  than  10  nm  wide 
it  still  behaves  in  a  manner  consistent  with  continuum  theory.  Additional  simu¬ 
lations  of  hypervelocity  impacts  of  an  ultrathin  flyer  plate  with  a  model  diatomic 
energetic  material  not  only  produce  a  dissociative  phase  transition  but  also  extensive 
chemical  reactions  leading  to  a  chemically  sustained  shock  wave. 
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Abstract 

Relying  on  theoretical  developments  exploiting  quasispin  and  the  pseudo-orthogonal  group  in  the 
Hubbard  model  of  cyclic  polyenes,  the  general  expressions  for  generating  polynomials,  providing  the 
dimensional  information  for  relevant  irreducible  representations,  were  derived  (M.  D.  Gould,  J.  Paldus, 
and  J.  Cizek,  Int.  J.  Quantum  Chem.,  in  press).  These  generating  polynomials  result  from  ^-dimensional 
formulas  through  rather  tedious  algebraic  manipulations  involving  ratios  of  polynomials  with  fractional 
powers.  It  is  shown  that  these  expressions  may  be  efficiently  handled  using  the  symbolic  manipulation 
language  MAPLE  and  the  dimensional  information  for  an  arbitrary  spin,  isospin,  and  quasimomentum 
obtained.  Exploitation  of  symbolic  computation  for  other  group  theoretical  problems  that  are  relevant 
in  quantum  chemical  calculations  and  their  relationship  with  Gaussian  polynomial  based  combinatorial 
approaches  is  also  briefly  addressed  and  various  possible  applications  outlined.  ©  1994  John  Wiley  & 
Sons,  Inc. 


Introduction 


In  order  to  arrive  at  various  attributes  of  symmetry  groups  or  their  chains  that 
are  needed  for  their  exploitation  in  quantum  mechanics,  particularly  in  deriving 
or  calculating  the  characters  of  their  irreducible  representation  (irreps),  one  often 
relies  on  suitable  generating  polynomials.  Perhaps  the  best  known  example  of 
such  a  procedure  is  represented  by  Frobenius’  theorem  of  the  symmetric  group 
representation  theory  (see,  e.g.,  Eq.  (7-24)  of  [1])  yielding  directly  all  the  simple 
characters  xj#  of  the  irrep  (X)  =  (Xb  X2, . . . ,  \N)  of  A^as  the  coefficients  of  the  Schur 
polynomial  Sfa£x)  in  the  product  s{l)(x)  -  Ylr  sr(x)ar  of  power  sum  or  Newton 
polynomials  sr, 

Sr(x)='Zxri,  (1) 
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that  are  associated  with  the  partition  (/)  =  (i«i2a23a3  •  •  • )  labelling  the  SN  classes, 
namely 


sdx)  =  2  X$Sw(x) .  (2) 

W 

Since  any  finite  group  G  is  isomorphic  with  a  suitable  subgroup  of  the  symmetric 
or  permutation  group  *SVof  an  appropriate  order  (Cayley’s  theorem),  the  usefulness 
of  the  representation  theory  of  SN  in  the  general  case  is  self-evident.  Moreover, 
there  exists  an  intimate  relationship  between  the  representation  theory  of  SN  and 
that  of  compact  Lie  groups  thanks  to  Weyl’s  global  approach  that  relies  on  the 
tensor  powers  of  the  fundamental  representation,  which  is  provided  by  the  group 
itself  or,  rather,  by  its  Lie  algebra.  Recall  also  that  unitary  groups  V(n)  play  the 
same  role  for  the  compact  Lie  groups  as  do  permutation  groups  for  finite  groups. 
This  intimate  relationship  is  also  reflected  in  various  group  theoretical  approaches 
to  the  ^-electron  correlation  problem  employing  ^-orbital  models,  where  the  in¬ 
terplay  between  the  SN,  U(«)  and  SU(2)  based  approaches  is  well  known  [2-5]. 

As  a  special  case  of  the  general  character  theory,  one  often  requires  only  the 
characters  of  the  identity,  providing  the  appropriate  dimensions.  Let  us  recall  here, 
for  example,  the  special  case  of  Weyl’s  dimension  formula  (often  referred  to  now¬ 
adays  as  Weyl-Paldus  dimension  formula  [3,6,7])  giving  the  number  of  spin-adapted 
configurations,  characterized  by  the  total  spin  quantum  number  S ,  for  the  ^-orbital 
model  of  an  JV-electron  system  that  is  described  by  a  spin-independent  Hamiltonian 
[8,9], 


D„(a,  b,  c) 


b  +  1  (n  +  l\/«  +  1\ 

«  +  1  \  a  A  c  /’ 


(3) 


where  (a,  b ,  c)  designates  Paldus  label  of  the  two-column  U(w)  irrep  (2°1^0C)  given 
by  a  =  -  S,  b  =  2S  and  c  =  n~  \N  -  S  and  (™)  =  n\/{n  -  m)\m\  is  the  usual 

binomial  coefficient.  An  equivalent  form  of  this  and  related  dimension  formulas 
may  be  often  derived  using  simple  combinatorial  means,  independently  of  group 
theory  [10]. 

Nonetheless,  even  when  relying  on  a  combinatorial  approach,  it  is  often  useful 
to  employ  the  technique  of  generating  polynomials.  In  this  way,  for  example,  Pauncz 
[11]  derived  generating  polynomials  yielding  a  number  of  configurations  associated 
with  a  given  level  in  the  harmonic  level  excitation  diagram  (hled)  [12,13].  This 
result  was  later  generalized  by  Katriel  et  al.  [14]  for  both  fermions  and  bosons, 
giving  the  number  of  configurations  involving  any  number  of  particles  with  an 
arbitrary  elementary  spin,  as  well  as  for  the  number  of  configurations  specified  by 
the  total  orbital  weight  [13]  and  by  the  degree  of  excitation. 

More  recently,  these  results  were  further  generalized  for  atomic  and  nuclear  sys¬ 
tems  involving  both  fermionic  and  bosonic  levels  with  a  fixed  number  of  particles 
per  level,  giving  the  number  of  states  with  conserved  total  angular  momentum  and 
spin  [15-17].  These  results  are  useful  for  the  calculation  of  nuclear  level  densities 
[18]  as  well  as  for  the  spectroscopy  of  partly-filled  shell  ions  in  crystalline  environ- 
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ments  [19].  For  this  latter  purpose,  a  new  generating  polynomial  was  introduced 
by  Kibler  and  Katriel  [20].  Another  application  exploiting  generalization  of  the 
dimension  formula  (3)  for  the  determination  of  spectroscopic  terms  was  recently 
put  forward  by  Torre  et  al.  [21]. 

Another  extensive  exploitation  of  the  dimension  formulas,  such  as  Eq.  (3),  giving 
the  number  of  symmetry  adapted  configurations,  is  found  in  the  many-electron 
theory  based  on  spin-adapted  reduced  Hamiltonians  (SRH)  of  Valdemoro  [22]  and 
in  the  so-called  spectral  distribution  method  (SDM)  of  nuclear  physics  [23-25].  In 
both  cases  one  evaluates  fixed-symmetry  traces  over  finite  A-body  spaces,  although 
various  distinct  formalisms  may  be  employed.  The  relationship  between  both  ap¬ 
proaches  [25,26]  was  pointed  out  by  Nomura  [27],  which  in  turn  inspired  Karwowski 
and  Valdemoro  [28]  to  derive  a  set  of  recurrent  relations  expressing  traces  of  products 
of  the  occupation  number  operators,  which  are  essential  for  the  construction  of  the 
effective  two-body  SRH,  solely  in  terms  of  the  dimensions  given  by  Eq.  (3). 

The  direct  exploitation  of  generating  polynomials,  such  as  given  by  Eq.  (2),  may 
prove  to  be  rather  demanding.  We  quote  in  this  regard  Hamermesh  [1,  p.  197]  who 
states  that  Frobenius’  formula  (2)  “looks  (and  is!)  formidable,  and  we  shall  therefore 
deal  with  it  to  devise  methods  which  are  tractable.”  Such  developments  are  invariably 
based  on  the  theory  of  polynomial  rings  in  an  arbitrary  number  of  variables  and 
in  particular  on  the  theory  of  symmetric  functions  [29].  This  type  of  approach  was, 
for  example,  exploited  by  Sunko  and  Svrtan  [15,16].  However,  in  view  of  recent 
developments  in  symbolic  computation,  it  is  of  interest  to  explore  the  possibilities 
offered  by  these  powerful  tools  in  handling  directly  the  generating  polynomials  and 
extracting  the  desired  information. 

In  this  article  we  shall  thus  employ  MAPLE  [30]  in  order  to  extract  the  dimen¬ 
sional  information  from  recently  obtained  generating  polynomials  [31]  for  the 
number  of  symmetry-adapted  configurations  of  the  Hubbard  cyclic  polyene  model 
with  conserved  spin,  quasispin,  and  quasimomentum.  These  generating  polynomials 
were  obtained  [31]  with  the  help  of  U(«)  ^-dimension  formula  [32]  that  originates 
from  the  recently  developed  theory  of  quantum  groups  [33,34].  We  briefly  overview 
the  resulting  expressions  in  the  next  section  and  subsequently  explore  their  com¬ 
putational  implementation  using  MAPLE  [30].  We  also  point  out  the  close  rela¬ 
tionship  between  the  generating  polynomials  originating  from  the  ^-character  or 
^-dimension  formalism  [31,32]  and  those  resulting  from  combinatorial  consider¬ 
ations  that  are  usually  formulated  in  terms  of  Gaussian  polynomials. 

^-Dimension  Formulas  for  the  Cyclic  Polyene  Hubbard  Model 

The  Hubbard  Hamiltonian  for  the  linear  chain  with  n  equidistant  and  equivalent 
sites,  satisfying  the  Born-von  Karman  cyclic  boundary  conditions,  has  the  form 

Hm  =  <3  2  2  (4^+u  +  XliM  +  U  2  ,  (4) 

1=1  <J=±\  *=1 

where  designates  the  resonance  (hopping)  integral,  U  the  one-center  on-site  Cou¬ 
lomb  integral,  x\a{Xi,a)  the  creation  (annihilation)  operator  associated  with  the  rth 
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site  spin  orbital  with  azimuthal  spin  <r,  a  =  ±1,  and  nha  the  corresponding  spin 
orbital  occupation  number  operator 

«/,„  =  .  (5) 

All  the  indices  are  taken  modulo  n,  n  +  1  =  1  (mod  n\  reflecting  the  cyclic  boundary 
conditions  mentioned  above.  Expressing  U  in  the  units  of  (3,  we  can  set  f3  =  —l, 
and  characterize  the  Hamiltonian  (4)  by  a  single  parameter  representing  the  coupling 
constant.  Moreover,  choosing  a  convenient  energy  zero  by  shifting  the  number 
operators  by  their  mean  values,  we  obtain  the  following  often  used  form  of  the 
Hubbard  Hamiltonian  [31,35,36] 

n  n 

H=UZ  («,, !  -  -  i)  -  2  (Eu+ 1  +  El+U)  ,  (6) 

1=1  1=1 

where  we  employed  the  U(«)  generators 

Eq  =  Eu  =  2  xlxM  (7) 

ff=±l 

of  the  unitary  group  approach  [6-9].  Clearly, 

H  =  H(h)+  iU(ln-N),  (8) 

where 

n 

N  =  2  ni9  Hi  =  nu  1  +  rii -I  (9) 

/=1 

is  the  total  electron  number  operator,  so  that  for  a  given  system  with  a  fixed  number 
of  sites  n ,  occupied  by  a  fixed  number  of  electrons  N ,  the  last  term  in  Eq.  (8) 
represents  a  constant  shift.  We  shall  only  consider  lattices  with  an  even  number  of 
sites  and  designate 

n  —  2m  .  (10) 

In  view  of  the  apparent  spin  independence  of  H,  we  can  express  it  entirely  in 
terms  of  U (n)  generators  (7)  as  follows 

£  =  i  1/  2  (Eu)2  -  2  (£/,/+ 1  +  Ei+iJ)  +  \U(\n-  2 N) ,  (6') 

1=1  1=1 

and  we  can  restrict  ourselves  to  a  fixed  irrep  of  U(«)  with  Paldus  labels  ( a ,  b ,  c)  of 
dimension  given  by  Eq.  (3),  since  H  commutes  with  the  SU(2)  operators 

Sz  —  1,1  —  S+  ^  £  1-1,  *S'_  =  <£_ij,  (11) 

where 

=  2  (»,  T  =  ±1) 

1=1 


(12) 
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designate  the  SU(2)  generators.  The  electron  number  operator  (9)  represents  clearly 
the  first  order  invariant  of  both  U(«)  and  U(2), 

N=  2  (13) 

1=1  ff=±l 

In  addition  to  this  obvious  SU(2)  spin  invariance  of  the  Hamiltonian  H,  Eqs. 
(6)  and  (6'),  it  also  possesses  a  quasi-spin  SU(2)  invariance  [37,38],  with  the  quasi¬ 
spin  operator  Q  defined  by  its  components 

Qz  =  *(iV  -  n),  <2+  =  2  (-lyxl-iXlu  Q-  =  Qi  ■  (14) 

J=l 

To  account  for  this  quasi-spin  invariance,  it  is  useful  to  introduce  the  pseudo- 
orthogonal  group  O (m,  m)  with  infinitesimal  generators  [31] 

«<,  =  (-1  )%j  ~  (~1)%  =  -«jt  >  (15) 

satisfying  the  commutation  relations 

[ay,  aid]  =  gjk.au  +  guajk  -  giuciji  ~  gjiaik  , 

with  the  orthogonal  group  metric 

*(,  =  <-l)V  (16> 

The  Hermiticity  condition  now  reads 

4  =  (- 1  )i+jaji  =  (— IV ,+j+lau  .  (17) 

It  can  then  be  shown  [31]  that  the  square  of  the  quasi-spin  Q2  =  Q  •  Q,  with  eigen¬ 
values  Q(Q  +  1)  where  Q  designates  the  quasi-spin  quantum  number,  is  determined 
by  the  second  order  U(«)  and  0(m,  m)  Casimir  invariants  (summation  convention 
over  repeated  indices  is  implied  here) 

h  =  EyEji  (18) 

and 

/HH/V,,  09) 

respectively,  namely 

Q2  =  J(/2  -  I°2)  +  i (n  -  N)(n  -  N  +  2)  -  .  (20) 

Similarly  as  in  the  U(«)  case,  one  labels  the  weights  by  the  eigenvalues  of  the  Cartan 
subalgebra  commuting  self-adjoint  operators 

hj  =  a2j,2j-u  ( 1  <  j  <  m) ,  (21) 

and  the  irreps  of  0(m,  m)  by  the  highest  weights  (under  the  lexicographic  labeling). 
Again,  at  most  two  column  irreps  of  0(m,  m)  can  occur  [31],  so  that  we  employ 
Paldus-type  labels  (a0,  bo,  c0),  a0  +  b0  +  c0  =  m,  to  designate  the  relevant  0(m,  m) 
irrep,  whose  dimension  is  now 
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Dyyhci 0,  bo,  Co) 


(bo  +  \)(n  -  lap  -  bo  +  1)  in  +  2\ /  n+  1  \ 

(«  +  1)(«  +  2)  \  0O  /Uo  +  ^o+1/ 


(22) 


Considering  the  U(«)  |  0(ra,  m)  branching  rules,  one  then  finds  [3 1  ]  that  the  U(/?) 
irrep  Y(a,  b,  c)  decomposes  as  follows 


aAc 

T(a,  b,c)=  ®  r°(0O,  bp,  c0) ,  (23) 

Oo=0 

with 

b0  =  b  A  (n  -  2a0  -  b) ,  (24) 

where  T°(0O,  bo,  c0)  designate  the  O (m,  m)  irrep  with  the  highest  weight  (0O,  bp,  c0) 
and 


x  A  y  :=  min{A:,  y }  .  (25) 

All  the  states  in  T(a,  b,  c)  have  the  azimuthal  quasi-spin  quantum  number  Qz  = 
\(N  —  ri),  N  =  2a  +  b  and  the  O (m,  m)  irrep  r°(0O,  bo,  Co)  is  characterized  by  the 
quasi-spin 

Q  =  j\n  -  N\  +  c  A  a  —  ao  =  \(n  -  b)  -  a0  .  (26) 

Thus,  the  allowed  values  of  quasi-spin  are  [3 1  ] 

\\n-N\  <,Q<\{n-b).  (27) 

To  account,  finally,  for  the  spatial  symmetry  invariance  characterized  by  the 
cyclic  group  Cn,  one  has  to  consider  the  O (m,  m)  \  Cn  branching  rules.  Labelling 
the  C„  irreps  by  the  quasi-momentum  quantum  number  k  and  exploiting  the  q- 
character  formalism  [32],  we  find  [31]  generating  polynomials  yielding  the  desired 
dimensional  information  or  multiplicities  of  the  states  characterized  by  quantum 
numbers  N,  S,  Q  and  k .  Thus,  for  a  given  n  =  2m  and  N,  the  total  spin  quantum 
number  S  determines  the  relevant  U(«)  irrep  (a,  b,  c),  a  =  —  S,  b  —  2S,  c  - 

n  —  a  —  b;  the  quasi-spin  quantum  number  Q  specifies  the  O (m,  m)  irrep  (ao, 

bo,  Co), 

a0  =  \  \n  -  N\  +  c  A  a  -  Q  , 

b0  =  b  A  [n  -  2(c  A  a)  -  b] , 

c0  =  m-  0O  -  b0,  (28) 

and,  finally,  the  quasi-momentum  k  labels  the  relevant  C„  irrep,  0  <  k  <  n.  The 
desired  generating  polynomial  F{ao^Co)(q)  is  given  by  [31] 

F^cM  =  QpFPq[a0,  to] ,  (29) 

where 


p  =  \(n+  1  )N  —  (m  +  1  )(a  —  ao) 


(30) 
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and  D0Q[ao ,  b0 ]  is  the  0(ra,  m)  ^-dimension  given  by 


D°Jao,  b0]  = 


[b0  + 1]  r»  +  2T  «  +  2 


[n  +  1]  L  ]|tfo  + 

/[/7  +  2  —  cio][ft  +  1  —  Q-0  ~  bo]  ~  q  m  l[ao  +  +  l][oo] 

X  [n  +  2]2 


The  square  brackets  on  the  right-hand  side  of  Eq.  (31)  designate  the  polynomials 
in  q  defined  as  follows 

lr]  =  4l'x>-f<imr,  (32) 

and  the  “binomial  coefficients”  are  defined  in  analogy  to  their  standard  meaning 
by 


s\  [r  -  s]\[s]\ 


0  <  s  <  r 


where  now 


M-[r-  1] . [1],  r>  1 


r  =  0  . 


In  the  limit  q-*-  1  the  ^-dimension  formula  (3 1)  reduces  to  the  0(m,  m)  dimension 
formula  (22). 

Once  the  generating  polynomial  (29)  is  obtained,  it  is  sufficient  to  transform  it 
to  the  standard  form 

o-l 

Finjbo.cAq)  =  2  rrfHao,  b0)qk  ,  (35) 


by  reducing  its  exponents  modulo  n  to  the  numbers  in  the  set  {0,  1 , ,n  —  1}. 
The  coefficients  m°k(ao,bo)  then  represent  the  desired  dimensions  or  (^-multiplicities. 


MAPLE  Implementation  and  Results 

In  order  to  get  the  idea  of  algebraic  manipulations  that  are  involved  in  the  con¬ 
struction  of  generating  polynomials  (35),  yielding  the  desired  (^-multiplicities,  we 
briefly  consider  the  simplest  cyclic  polyene  with  the  nondegenerate  ground  state, 
representing  the  x-electron  model  of  benzene.  In  this  case  n  =  N  =  6,  m  =  3  so 
that  all  states  have  a  vanishing  azimuthal  quasi-spin  Qz  -  0.  The  possible  spin 
quantum  numbers  are  clearly  S  =  0,  1,2,  and  3,  so  that  the  following  U(6)  irreps 
are  involved: 

(a,  b,  c)  =  (3,  0,  3),  (2,  2,  2),  (1,  4,  1)  and  (0,  6,  0) .  (36) 

Considering  only  the  most  important  case  of  singlet  and  triplet  configurations  (b  =  0 
and  2,  respectively),  we  easily  find  the  following  decompositions  using  Eq.  (23), 

r(3,  0,  3)  =  r§(3,  0,  0)  ©  r?(2,  0,  l)  ©  r§(l,  0,  2)  ©  r§(0,  0,  3) ,  (37a) 
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and 


r(2, 2, 2)  =  rg(2,  o,  1)  ©  r?d,  2,  o)  ©  r°2(o ,  2,  i) ,  (37b) 

with  the  quasispin  Q ,  Eq.  (26),  indicated  by  the  subscript  on  the  0(3,  3)  irrep 
symbol  Tq. 

The  required  generating  polynomial  for,  e.g.,  the  irrep  T?(2,  0,  1)  is  thus  [see  Eq. 
(29)] 


Fn,Uq)  =  QnDn«[2,  0] , 


(38) 


where 


D%2,  0]  =  jjj 


oo 

OO 

[2j 

l3j 

[6][5]  -  q~\ 3][2] 


[8]2 

[7][6]([6][5]  -  g~4[3][2]) 
[3][2]2[1] 

with  [r]  defined  by  Eq.  (32).  Employing,  next,  the  decompositions 

[r]  =  [2p  +  1]  =  t[i2  —  q~'» 

=  [!](«'  +  Q"*'  +•••+«+  1  +«■'+•  • 

for  f>  =  1,2,  and  3  and  using  the  fact  that 

(92  +  1  +  <f2)(<l 2  +  q  +  1  +  q~~'  +  q~2) 

=  (q  +  \+  q~')(q3  +  q  +  1  +  q~'  +  q~3) , 


(39) 


+  <TP)  (40) 


(41) 


we  find  that 

Ifg[2,  0]  =  (q3  +  q2  +  q  +  1  +  q~'  +  q~2  +  q-3)(q2  +  1  +  q~2) 

X  (q3  +  q  +  1  +  q~l  +  q~3  -  q~ 4) ,  (42) 

which,  after  the  reduction  mod  6  gives 

D% 2,  0]  -  16  +  12 q  +  \6q2  +  12<?3  +  16#4  +  \2q5 , 

yielding  the  required  dimensions  mk( 2,  0),  k  =  0,  1,  . . . ,  5.  Note  that  in  view  of 
the  additional  symmetry  (Dn/l)  of  the  model,  as  well  as  the  particle-hole  symmetry 
for  the  half-filled  shell  case,  we  have  that  mk(a0,  b0)  =  m°n-k(ao,  bo). 

The  key  step  in  this  algorithm  is  clearly  the  factorization  of  the  ^-dimension  (31) 
and  the  expansion  and  normalization  of  the  resulting  generating  polynomial.  This 
is  achieved  using  the  following  code 
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# 

Dq2  :  =  factor  (Dq)  ; 

Dq3  :  =  q  qhat*DQ2  ; 

Dq4  :=  expand (Dq3 ) ? 

# 

reducer  proc  (t,  x,  mm)  local  d; 

d  :  =  degree (t,  x)  ? 

subs  (x  d  =  x  (  d  mod  ram),  t) ; 

end; 

# 

res  :=  map ( reducer , DQ4 , q, n) ; 
resfct  :  =  sort(res); 

Fq [ ic, jc]  :=  resfct; 

#  (43) 

where  Dq  designates  the  O (m,  m)  ^-dimension  produced  by  another  subroutine, 
qhat  =  R  as  given  by  Eq.  (30)  and  Dq3  is  the  O (m,  m)  generating  polynomial  (29). 
Following  the  factorization,  the  generating  polynomial  Dq3  is  expanded  and  reduced 
to  the  standard  form  (35)  with  the  help  of  a  procedure  called  “reducer.”  Note  that 
except  for  this  latter  procedure,  all  the  other  operations  (factor,  expand,  map,  and 
sort)  represent  the  standard  functions  in  MAPLE.  The  resulting  dimensionalities 
are  then  suitably  tabulated  or  may  be  used  by  other  programs. 

We  illustrate  the  results  in  Tables  I,  II,  and  III  giving  the  (^-multiplicities  for  all 
the  singlet  and  triplet  states  of  cyclic  polyenes  with  6,  10,  and  18  sites  (for  the 
singlets  of  benzene  see  also  Table  II  of  [31],  which  was  obtained  manually).  Other 
spin  multiplicities  or  polyenes  with  an  even  number  of  sites  ( n  =  2m),  or  their 
various  ions  (n  A  N)  can  be  obtained  using  the  same  code.  To  illustrate  the  efficiency 
of  MAPLE  we  present  some  typical  timings  in  Table  IV. 


Discussion 

The  results  presented  in  the  preceding  section  indicate  the  usefulness  of  the  sym¬ 
bolic  manipulation  languages,  such  as  MAPLE,  for  an  efficient  handling  of  rather 
complex  algebraic  expressions,  even  though  the  required  computational  time  in¬ 
creases  appreciably  with  the  increasing  polyenic  size.  Nonetheless,  a  good  deal  of 
useful  information  can  be  extracted  in  this  way  by  directly  exploiting  various  gen¬ 
erating  functions  mentioned  in  the  Introduction. 

We  also  wish  to  mention  that  although  the  Pariser-Parr-Pople  (ppp)  Hamiltonian 
[9,39]  is  not  invariant  with  respect  to  quasi-spin,  it  exhibits  the  so-called  alternancy 
symmetry  [40,41],  which  is  intimately  related  with  the  parity  of  quasi-spin  [31,38]: 
even  (odd)  quasi-spin  species  correspond  to  minus  (plus)  states  in  Pariser’s  classi¬ 
fication  [40].  Moreover,  even  the  ppp  Hamiltonian  spectral  properties  may  be  better 
understood  when  quasi-spin  is  employed  as  an  approximate  symmetry  [42].  Further, 
we  note  that  in  Ref.  [3 1]  we  also  presented  the  generating  polynomials  for  the  direct 
U(«)  \  Cn  subduction  in  which  ^-dimension  analogue  of  Weyl-Paldus  formula  (3) 
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Table  I.  Dimensions  of  various  subproblems  labeled  by  the  quasispin  Q  or  by  the  corresponding 
0(3,  3)  irrep  label  ( a0 ,  b0,  c0),  c0  =  3  -  Oq  -  b0  and  quasi-momentum  k(k  =  0,  1,  . . . ,  5)  for  both 
singlets  (a)  and  triplets  (b)  of  the  cyclic  polyene  with  n  =  N  =  6. 

(a)  Singlets 


(flo,  bo) 

lc 

(3,  0) 

(2,  0) 

(1,0) 

(0,  0) 

0 

16 

12 

4 

0 

1,5 

8 

16 

3 

0 

2,  4 

14 

12 

4 

0 

3 

10 

16 

2 

1 

Dim  r(a0,  ^o) 

70 

84 

20 

1 

Q 

0 

1 

2 

3 

(b)  Triplets 


(0o,  b0) 

k 

(2,  0) 

(1,2) 

(0,  2) 

0 

12 

16 

2 

1,5 

16 

14 

3 

2,4 

12 

16 

2 

3 

16 

14 

3 

Dim  r(ao,  bo) 

84 

90 

15 

Q 

0 

1 

2 

replaces  the  O (m,  m)  formula  (22)  or  (31).  Clearly,  this  formalism  can  be  employed 
in  the  absence  of  quasi-spin  symmetry. 

Let  us,  finally,  briefly  indicate  a  general  relationship  between  the  ^-dimension 
generating  functions  and  those  usually  encountered  in  combinatorial  type  ap¬ 
proaches  that  often  employ  Gaussian  polynomials  (see  Appendix  of  [31]  for  a  brief 
outline  of  ^-characters  and  ^-dimensions).  The  basic  quantity  in  the  ^-character 
formalism  is  the  g-Weyl  denominator  function  Pq, 

Pq=  n  (4“/2  -  <T“/2) ,  (44) 

where  $+  designates  the  set  of  positive  roots  of  the  simple  Lie  algebra  L .  For  the 
^-dimensions,  the  relevant  denominator  function  becomes 

PM=  n  -  fiM) .  (45) 
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Table  II.  Dimensions  of  various  subproblems  labeled  by  the  quasispin  Q  or  by  the  corresponding  0(5, 


5)  irrep  label  (do, 

(d)  Singlets 

bo,  c0),  c0  =  5  -  d0  ~  b0  and  quasi-momentum  k(k 
and  triplets  (b)  of  the  cyclic  polyene  with  n 

=  0,1,... 
=  N  =  10. 

,  9)  for  both  singlets  (a) 

(flo,  bo) 

k 

(5,  0) 

(4,  0) 

(3,  0) 

(2,  0) 

(1,0) 

(0,  0) 

0 

576 

876 

420 

74 

6 

0 

1,9 

534 

906 

405 

80 

5 

0 

2,8 

574 

876 

420 

74 

6 

0 

3,7 

534 

906 

405 

80 

5 

0 

4,6 

574 

876 

420 

74 

6 

0 

5 

536 

906 

405 

80 

4 

1 

Dim  r(tf0,  bo) 

5544 

8910 

4125 

770 

54 

1 

Q 

0 

1 

2 

3 

4 

5 

(b)  Triplets 


(cio,  b0) 


k 

(4,  0) 

(3,  2) 

(2,  2) 

(1,2) 

(0,  2) 

0 

876 

1396 

588 

96 

4 

1,  9 

906 

1376 

600 

93 

5 

2,8 

876 

1396 

588 

96 

4 

3,7 

906 

1376 

600 

93 

5 

4,6 

876 

1396 

588 

96 

4 

5 

906 

1376 

600 

93 

5 

Dim  T(oo,  bo) 

8910 

13860 

5940 

945 

45 

Q 

0 

1 

2 

3 

4 

where  (,)  designates  Weyl  group  invariant  bilinear  form  induced  on  the  weights  by 
the  Killing  form  [43].  Clearly,  the  factors  appearing  on  the  right-hand  side  of  Eq. 
(45)  give  rise  to  the  basic  factors  [r],  Eq.  (32),  in  the  generating  polynomials 
employed. 

It  is  straightforward  to  realize  that  the  (/-binomial  coefficients,  Eq.  (33),  appearing 
in  the  (/-dimension  formulas,  are  in  fact  very  closely  related  to  Gaussian  polynomials, 
which  are  defined  as  follows 


Mg! 

[r-s]G![s]G!’ 


0  <  s  <  r 


(46) 


Table  III.  Dimensions  of  various  subproblems  labeled  by  the  quasispin  Q  or  by  the  corresponding  0(9,  9)  irrep  label  (do,  b0,  Cq ),  c0  =  9  -  a0  -  b0  and  quasi¬ 
momentum  k(k  =  0,  1,  . . . ,  17)  for  both  singlets  (a)  and  triplets  (b)  of  the  cyclic  polyene  with  n  =  N  =  18. 

(a)  Singlets 
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where  again 


but  where  now 


Since,  obviously 


we  have  that 


WG[r-l]G---[l]G,  r>  1  , 

(47) 

.1,  r=  0, 

1 

II 

(48) 

[r]  =  -CMc , 

(49) 

1  =  , 

(50) 

UJ  L-yJc 

so  that  we  can  easily  express  our  generating  polynomials  in  terms  of  Gaussian-type 
quantities,  involving  integral  powers.  Thus,  for  the  U(«)  (/-dimension  (cf.  Eq.  (39) 
of  [31]) 

r  [b  +  1]  \n  +  l]\n  +  11 
D„[a,  b]  =  J  ,  (51) 

[n  +  1]  [  a  J[  c  vjj.' 

which  in  the  limit  q  -►  1  reduces  to  Eq.  (3),  we  find  that 

Dg[a,  b]  =  qPGDq[a,  b]G  (52) 

where  now 

O.I<.,Ok-&±#!["+ll[”+1l  .  (53) 


[n  +  1]g  I  a  U  c 


G  L  ^  J  G 


Pg  =  \b(n  -  c)  -  £ N(n  -  a) .  (5 

Similarly,  defining  the  Gaussian  analog  of  the  Q(m,  m)  q- dimension,  as  follows 


D°a[a0,  bo\( 


[60  +  l]G  \n  +  21  f  n  +  2 


[n  +  1  ]G  I  a0  Uoq  +  b0  +  1 


w  [«  +  2  -  aohln  +  1  -  a0~  b0]G  ~  cp  2*°  b°l^o  +  b0  +  1]gWg 
X  ln  +  2fG  ’  (55) 


we  get  for  the  generating  polynomial  F(ao>bo,Co)(q)  the  expression 


F(ao,bQ,coM)  ~  QVGF>°G[a 0,  bQ]G 


(56) 
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where 

Vg  —  \N(n  +  1)  —  (m  +  1)(<2  —  tfo)  +  cio{ao  +  ^o)  —  jbo(n  —  bo) .  (57) 

It  would  be  worthwhile  to  explore  this  form  of  generating  functions  using  both 
computational  as  well  as  analytical  approaches.  In  the  latter  case,  one  can  exploit 
the  fact  that  the  logarithm  of  a  Gaussian  polynomial  together  with  the  multiplicative 
Gaussian  factors  (48)  results  in  a  simple  sum  of  terms  of  the  type  log(l  -  x)  which 
when  expanded  can  provide  a  rather  simple  recursive  scheme  for  the  calculation 
of  the  desired  polynomial  coefficients  [15].  Concerning  the  computational  advan¬ 
tages  of  the  Gaussian  polynomial  form,  our  preliminary  tests  indicate  that  it  speeds 
up  the  execution  by  about  20%. 
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Abstract 

The  coupled  cluster  method  is  widely  recognized  nowadays  as  providing  an  ab  initio  method  of  great 
versatility,  power,  and  accuracy  for  handling  in  a  fully  microscopic  and  systematic  way  the  correlations 
between  particles  in  quantum  many-body  systems.  The  number  of  successful  applications  made  to  date 
within  both  chemistry  and  physics  is  impressive.  In  this  article  we  review  our  recent  extensions  of  the 
method  which  now  provide  a  unifying  framework  for  also  dealing  with  strongly  interacting  infinite 
quantum  lattice  systems  described  by  a  Hamiltonian.  Such  systems  include  both  spin-lattice  models 
(such  as  the  anisotropic  Heisenberg  or  XXZ  model)  exhibiting  interesting  magnetic  properties,  and 
electron  lattice  models  (such  as  the  t/and  Hubbard  models),  where  the  spins  or  fermions  are  localized 
on  the  sites  of  a  regular  lattice;  as  well  as  lattice  gauge  theories  [such  as  the  Abelian  t/(l)  model  of 
quantum  electrodynamics  and  non- Abelian  SU(n)  models].  Illustrative  results  are  given  for  both  the 
XXZ  spin  lattice  model  and  U(  1 )  lattice  gauge  theory.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

The  power  of  the  coupled  cluster  method  (CCM)  as  an  ab  initio  technique  in 
microscopic  quantum  many-body  theory  is  widely  known  and  well  documented 
in  quantum  chemistry  [1,2],  particularly  where  results  of  extremely  high  accuracy 
are  required,  such  as  for  the  calculation  of  molecular  energy  differences  of  chemical 
significance  or  for  the  calculation  of  parity  violation  in  atoms.  However,  the  CCM 
has  also  been  used  much  more  widely  in  quantum  many-body  theory  and  quantum 
field  theory,  where  it  has  now  become  what  is  probably  the  most  powerful,  most 
universally  applicable,  and  numerically  most  accurate  of  all  available  fully  micro¬ 
scopic  frameworks  for  describing  in  a  fully  rigorous  fashion  the  quantum  correlations 
between  systems  of  interacting  particles.  The  number  of  successful  applications  to 
date  within  both  chemistry  and  physics  is  truly  impressive.  In  most  cases  the  nu¬ 
merical  results  achieved  are  either  the  best  or  among  the  best  available.  A  recent 
review  for  a  quantum  chemistry  audience  of  the  applications  of  the  CCM  outside 
chemistry  has  been  given  by  one  of  the  present  authors  [3].  Bartlett  has  given  a 
comparable  review  [1]  within  chemistry.  In  addition,  there  have  been  many  recent 
reviews  of  the  method  itself,  including  those  given  in  Refs.  [3-12]. 

This  widespread  success  of  the  CCM  applications  to  both  finite  and  extended 
systems  in  continuous  space  has  naturally  led  to  the  method  being  applied  in  the 
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last  few  years  [13-24]  to  quantum-mechanical  systems  defined  on  an  extended 
regular  spatial  lattice.  Examples  of  such  applications  include:  (i)  spin-lattice  systems, 
such  as  the  solid  phases  of  3He  [13]  and  various  models  of  interest  in  magnetism, 
e.g.,  the  spin- 5  anisotropic  Heisenberg  (or  XXZ)  model  [14]  and  its  extension  to 
higher  spins  [15],  and  the  spin-1  Heisenberg-biquadratic  model  [16];  (ii)  models 
of  strongly  interacting  electrons  on  lattices,  such  as  the  Hubbard  model  [19-21]; 
and  (iii)  lattice  gauge  field  theories  [22-24]. 

The  recent  discovery  of  high-temperature  superconductors  has  played  a  large 
part  in  the  resurgence  of  interest  in  spin  and  electron  lattice  models.  For  example, 
among  the  various  models  proposed,  the  Hubbard  model  of  electrons  on  a  two- 
dimensional  (2D)  square  lattice  is  widely  believed  to  describe  the  essential  corre¬ 
lations  of  the  active  electrons  in  these  ceramic  cuprate  materials.  Furthermore,  the 
antiferromagnetic  Heisenberg  model  of  spins  on  a  lattice  can  be  derived  from  the 
Hubbard  model  at  half-filling,  and  is  itself  believed  to  describe  the  electronic  prop¬ 
erties  of  the  undoped  insulating  precursors  to  the  high -Tc  materials. 

We  also  note  that  even  quantum-mechanical  spin-lattice  systems  with  very  simple 
Hamiltonians  are  often  endowed  with  very  complicated  physical  properties  and 
intricate  phase  diagrams.  Their  behavior  is  often  quite  unlike  that  of  their  classical 
counterparts.  An  obvious  example  is  the  one-dimensional  (ID)  Heisenberg  model, 
which  is  fully  integrable  by  the  Bethe  ansatz  technique  [25]  in  the  spin-  \  case, 
from  which  it  is  known  to  have  a  gapless  excitation  spectrum,  but  which  is  con¬ 
jectured  [26]  to  have  a  nonzero  gap  in  the  nonintegrable  spin-1  case.  More  generally, 
such  quantal  spin  systems  often  exhibit  a  rich  variety  of  orderings  and  phases  as 
functions  of  the  combinations  of  interactions  (e.g.,  nearest-neighbor  plus  next- 
nearest-neighbor),  the  lattice  structure,  the  spin  of  the  particles,  the  number  of 
spatial  dimensions,  and  such  other  external  variables  as  the  applied  magnetic  field. 
A  single  system  can  thus  display  one  or  more  of:  phases  with  Ising-like  long-range 
order,  dimerized  or  trimerized  phases,  valence-bond  solid  structure,  and  much 
other  exotic  and  often  only  poorly  understood  structure.  In  the  light  of  both  this 
richness  of  behavior  and  the  existence  of  various  exact  solutions  for  several  models 
with  certain  values  of  their  parameters,  spin-lattice  models  provide  a  particular 
challenge  for  quantum  many-body  theory  to  provide  an  accurate  and  detailed  mi¬ 
croscopic  description  of  their  manifold  properties. 

A  comparable  challenge  is  provided  by  lattice  gauge  field  theory,  which  was 
originally  introduced  to  provide  a  cure  for  the  chronic  divergences  inherent  in 
comparable  field  theories  in  a  spatial  continuum.  Lately,  lattice  gauge  models  have 
become  of  interest  in  their  own  right,  partly  because  they  provide  an  especially 
severe  test  for  theoretical  techniques  due  to  their  intrinsically  nonperturbative  char¬ 
acter  and  the  limitations  imposed  by  the  requirement  to  impose  manifest  gauge- 
invariance  at  all  stages.  However,  lattice  gauge  models  share  many  of  the  features 
and  attributes  of  spin-lattice  models.  It  has  been  our  intention  to  exploit  this  sim¬ 
ilarity  and  to  develop  the  COM  as  a  unifying  framework  for  treating  all  infinite- 
lattice  Hamiltonians.  In  particular,  we  have  shown  how  various  CCM  lattice  ap¬ 
proximation  schemes  developed  for  the  spin  models  can  be  simply  modified  for 
use  with  both  Abelian  and  non-Abelian  lattice  gauge  theories. 
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The  remainder  of  this  article  is  organized  as  follows.  In  the  next  section  we 
describe  the  basic  elements  of  the  CCM  which  are  needed  for  the  analysis  of  a  general 
many-body  system.  In  the  third  section  we  consider  in  some  detail  the  XXZ  model 
Hamiltonian  as  an  archetypal  spin-lattice  problem  to  which  the  CCM  can  be  applied. 
Particular  attention  is  paid  to  the  development  of  suitable  approximation  schemes. 
We  also  show  that  these  lead  to  results  of  very  high  accuracy  at  relatively  low  orders 
of  implementation.  In  the  subsequent  section  we  demonstrate  how  analogues  of 
these  schemes  may  be  applied  with  comparable  success  to  lattice  gauge  theories, 
where  we  choose  the  U(  1 )  model  as  a  concrete  example.  Our  findings  are  sum¬ 
marized  in  the  final  section,  where  we  also  indicate  possible  further  extensions  of 
the  CCM  and  consider  its  potential  for  even  wider  applications. 

Basic  Elements  of  the  CCM  Formalism 

In  order  to  describe  quantitatively  the  multiparticle  correlations  in  a  quantum 
many-body  system  one  always  needs  a  model  (or  reference)  state  |  $)  against  which 
to  compare  them.  In  the  CCM  the  only  requirement  on  |  <t>)  is  that  it  forms  a  cyclic 
vector  with  respect  to  which  we  may  define  two  subalgebras,  namely  that  of  the 
multiconfigurational  creation  operators  { Cj}  and  that  of  their  Hermitian-conjugate 
destruction  counterparts  { Cj } .  Very  importantly,  both  subalgebras  are  required  to 
be  Abelian,  i.e.,  [C],  C =  0  =  [Cj,  Cj'],  where  each  set-index  /  labels  a  general 
multiparticle  cluster  configuration  with  respect  to  |  $),  which  itself  thus  plays  the 
role  of  vacuum  (or  reference)  state.  We  thus  require  that  the  operators  {C}}  and 
{ Cj }  are  complete,  in  the  sense  that  arbitrary  ket  and  bra  states  within  the  appro¬ 
priate  many-body  Hilbert  space  may  be  decomposed  as  the  respective  linear 
combinations, 

l*>=  2//3l$>;<'*rl  =  (i) 

J  J 

This  is  clearly  achievable  if  the  identity  operator  I  within  has  the  resolution, 

1=  2  CJ|*><*|G.  (2) 

J 

For  practical  purposes  it  is  also  convenient,  but  not  essential,  if  the  operators  form 
an  orthonormal  set,  in  the  following  sense, 

<*l  CjCl'\*y  =  5(J,J'),  (3) 

where  8(J ,  /')  is  some  (suitably  defined)  delta  function  between  the  set  indi¬ 
ces  {/}. 

As  an  example  we  consider  the  familiar  application  of  the  CCM  to  an  V-electron 
molecule.  Here  |  <£)  is  chosen  as  an  V-electron  Slater  determinant  formed  from 
the  lowest  N  levels  of  an  (orthonormal)  set  of  suitably  defined  molecular  (spin-) 
orbitals.  The  set-indices  in  the  fixed- N Hilbert  space,  J ->  {px ,  •  •  • ,  pm;  ,  •  *  • , 
«w},  now  represent  an  m-particle/m-hole  cluster  configuration,  where  the  single¬ 
particle  indices  { a, }  label  the  single-hole  states  that  can  be  created  by  removing 
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an  electron  from  |$),  and  the  single-particle  indices  {/?,  }  label  the  remaining 
unoccupied  or  particle  states. 

The  CCM  is  now  characterized  by  writing  the  exact  ground  ket  state  |  ^0)  of  the 
many-body  Hamiltonian  H  in  terms  of  an  exponentiated  cluster  correlation  opera¬ 
tor  S , 

l*o>  =  **!*>;  //|^>  =  £ol*o>,  (4) 

which  may  be  decomposed  wholly  and  exactly  in  terms  of  creation  operators, 

(5) 

j 

where,  by  definition,  the  prime  on  the  sum  over  configurations  {/}  excludes  the 
J  =  0  term  corresponding  to  the  identity  operator,  Cj  =  L  We  thus  have  the 
intermediate  normalization  condition,  |  ^0)  =  (4>  |  $)  =  1,  in  the  usual  scheme 
where  (  <£  |  C]  |  <l>>  =  0,  V/  +  0.  The  correlation  operator  of  an  N-body  system  may 
itself  also  be  decomposed  as  a  linear  superposition  of  «-body  partitions,  S  = 
Sn,  where  Sn  describes  the  configurations  formed  by  exciting  n  particles  from 
their  single-particle  states  occupied  in  |  <£) . 

The  reason  for  writing  the  correlation  operator  in  the  exponential  form  of  Eqs. 
(4)  and  ( 5 ),  rather  than  in  the  linear  form  of  Eq.  ( 1 )  which  is  typical  of  the  con¬ 
figuration-interaction  (Cl)  method,  lies  at  the  heart  of  the  CCM  [3,5,7] .  Particularly 
important  features  of  the  exponential  form  are:  (i)  its  proper  counting  of  independent 
excitations  (e.g.,  n  independent  pairs  of  particles  excited  from  |  T)  are  properly 
described  by  the  wave  function  ( 1  /n\)S2  I  $));  (ii)  its  automatic  incorporation  of 
the  important  linked-cluster  theorem  of  Goldstone;  and  (iii)  its  incorporation  of 
size-extensivity,  so  that  such  extensive  variables  as  the  ground-state  energy  Eq  scale 
properly  with  particle  number  in  the  infinite  limit  N  -►  oo ,  even  when  S  is  truncated. 

In  practice  the  CCM  is  implemented  by  restricting  the  cluster  amplitudes  {<£/} 
to  some  (finite  or  infinite)  subset  of  the  entire  set,  within  a  well-defined  hierarchy 
of  such  approximations.  Solutions  for  the  amplitudes  retained  are  then  found  by 
first  rewriting  the  ground-state  Schrodinger  Eq.  (4)  in  the  similarity-transformed 
form, 

(e~sHes  -  E0)\$)  =  0  ,  (6) 

which  underpins  the  CCM.  By  taking  the  inner  products  of  Eq.  (6)  with  the  complete 
set  of  states  comprising  |  <£)  itself  and  { C}  |  <£);  J  ¥*  0 } ,  we  find,  respectively, 

<$|e-5//e5|$>  =  E0)  (7) 

($\Cje-sHes\$)  =  0;  0.  (8) 

The  set  of  Eqs.  (8)  represents  a  coupled  set  of  nonlinear  equations  for  the  c-number 
coefficients  { &j} ,  equal  in  number  to  the  number  of  retained  configurations  {/} . 
In  terms  of  their  solutions,  Eq.  (7)  then  gives  the  corresponding  approximant  for 
the  ground-state  energy,  E0  =  E’ot^y] . 

The  well-known  expansion  in  terms  of  nested  commutators. 
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e-sHes  =  H+lH,S]  +  ^[[H,S],S)+  ■■■  ,  (9) 

together  with  the  fact  that  each  of  the  individual  terms  in  the  expansion  of  Eq.  (5) 
for  the  operator  S  commutes  with  all  others,  ensures  that  every  element  of  S  in  Eq. 
(5 )  is  linked  directly  to  the  Hamiltonian  in  each  of  the  terms  in  Eq.  (9).  No  unlinked 
(or,  equivalently,  disconnected)  terms  or  diagrams  can  arise,  and  each  of  the  coupled 
equations  (8)  is  of  linked-cluster  type.  Furthermore,  when  each  term  in  the  Ham¬ 
iltonian  H  in  second-quantized  form  contains  only  a  finite  number  of  single-particle 
destruction  operators  defined  with  respect  to  1 3>),  as  is  usually  the  case,  the  otherwise 
infinite  series  of  Eq.  (9)  will  always  terminate  after  a  finite  number  of  terms.  As  a 
consequence,  the  left-hand  sides  of  Eqs.  ( 7 )  and  ( 8 )  are  expressions  of  finite  order 
in  the  retained  coeflicients  {<£/},  and  no  further  (artificial  or  approximate)  trun¬ 
cation  is  required  for  their  evaluation.  The  CCM  thus  fundamentally  differs  from 
its  unitarity-transformation  counterpart  that  would  arise  in  a  standard  variational 
formulation,  in  which  the  bra  state  (  |  is  simply  taken  as  the  manifest  Hermitian 

conjugate,  (^le^,  of  |^0)- 

The  above  feature  is  very  important,  and  leads  naturally  to  the  biorthogonal 
description  of  states  within  the  CCM  (rather  than  the  more  usual  orthogonal  de¬ 
scription),  in  which  the  manifest  Hermitian  conjugacy  of  corresponding  bra  and 
ket  states  is  lost.  While  the  bra  states  are  not  needed  to  calculate  the  ground-state 
energy,  they  are  needed  to  calculate  such  other  ground-state  expectation  values  as 
order  parameters.  In  what  has  become  known  [  5  ]  as  the  normal  version  of  the 


CCM,  the  bra  ground-state  is  parametrized  in  the  form, 

<f0|  =<*!&-*;  <*0|tf  =  £0<*ol  ,  (10) 

S=i  +  'Z'fjCJ,  (ii) 

J 

with  explicit  normalization, 

(’J'ol’i'o)  =  <$l*o>  =  <$!$)=  1  .  (12) 

Equation  (10)  readily  leads  to  the  equations, 

($|,S(e-sHes  —  EoJCl  1$)  =  0,  V/,  (13) 

which,  by  making  use  of  Eq.  (7)  and  the  fact  that  [5,  Cj]  =  0  for  all  set-indices  J, 
may  easily  be  rewritten  as, 

<$ ISfe-5!//,  C}]e‘s|<i,>  =  0,  V/.  (14) 


Equations  ( 13)  and  (14)  have  the  form  of  coupled  but  linear  equations  for  the  bra- 
state  coefficients  { $j }  once  the  ket-state  coefficients  { <£/}  are  used  as  known  input. 
Once  again,  the  resulting  equations  contain  only  finite  powers  of  S  since  the  nested 
commutator  expansions  again  terminate  as  before.  It  is  not  difficult  to  show  that 
Eq.  (13)  has  the  formal  solution 
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(<t>\S  =  (<t>\  -  (<Z>\e-sHesQ(Qe-sHesQ~  E0)~'Q  ,  (15) 

e^/-i*><*i,  (16) 

in  terms  of  which  the  ground-state  expectation  value  of  an  arbitrary  operator  A  can 
be  written  as 

A-A[*A-<*o\A\*o'> 

=  <$|e-^es|$>  -  <$| e~sHesQ(Qe-sHesQ  -  £-0)-,Ge-^es|#>.  (17) 

We  note  that  Eq.  (8)  guarantees  that  11  =  E0,  as  given  by  Eq.  (7). 

Excited  states  { I'E*)}  are  described  within  the  CCM  [27]  by  linear  excitation 
operators  {Xh}  which  act  on  the  corresponding  exact  ground-state  wave  function 

l*o>, 

|'EX>  =  ZX|'E0>  =  Xiei||t>);  H\9X)  =  -  (E0  +  ex)l**>  ,  (18) 

where  Xx  is  again  decomposed  wholly  in  terms  of  creation  pieces  with  respect 
to|*>, 

Xx=2'%)Cj.  (19) 

J 

Hence  the  operators  S  and  Xx  commute.  The  excited-state  Schrodinger  Eq.  ( 18) 
may  be  combined  with  its  ground-state  counterpart  of  Eq.  (4)  to  give  the  CCM 
eigenvalue  equation, 

e-s[//;Xx]es|$)  =  £^x|$>,  (20) 

for  the  excitation  energy,  =  E>,  —  Eo,  directly.  Equation  (20)  may  be  equivalently 
written  as, 

(Qe~sHesQ  -  E0)QXx\<f> >  =  exQXx |4>>  ,  (21) 

from  which  we  see  that  the  excitation  energies  are  given  by  diagonalizing  in  the 
space  spanned  by  the  states  { C]  \  $> ;  J  *  0 }  the  same  matrix  ( Qe~sHes  -  Eq)  as 
needs  to  be  formally  inverted  in  Eq.  ( 1 5 )  to  find  S.  In  all  practical  implementations 
of  the  CCM,  this  space  is  truncated  to  some  (finite  or  infinite)  subspace  spanned  by 
a  subset  of  the  states.  The  operators  S,  S,  and  X x  given  by  Eqs.  ( 5 ) ,  ( 1 1 ) ,  and  ( 1 9 ) 
are  thus  approximated  by  restricting  the  sum  over  configurations  {T}  to  the  ap¬ 
propriate  subset. 

In  the  following  two  sections  we  show  how  the  above  general  CCM  scheme  can 
be  applied  to  both  spin-lattice  Hamiltonian  systems  and  Hamiltonian  lattice  gauge 
theories. 


Applications  to  Spin  Lattices 

We  illustrate  the  application  of  the  CCM  to  spin-lattice  problems  by  considering 
the  archetypal  XXZ  or  anisotropic  Heisenberg  model  as  a  specific  example.  This 
model  comprises  N  spins,  with  spin  quantum  number  5,  on  the  sites  {/}  of  a  d- 
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dimensional  regular  lattice,  which  interact  via  nearest-neighbor  couplings  described 
by  the  Hamiltonian 


2 
p= i 


A sfsf+p  +  -  (sfsi+p  +  Si  st+P)  , 


(22) 


where  the  index  /  runs  over  all  N(~>  oo )  lattice  sites  with  the  usual  periodic  boundary 
condition  imposed;  the  index  p  runs  over  all  z  nearest-neighbor  sites  (i.e.,  z  is  the 
coordination  number  of  the  lattice);  the  operators  sz  and  sf  =  sf  ±  is]  are  spin-5 
operators  which  obey  the  usual  SU{  2)  angular  momentum  commutation  relations; 
and  A  is  the  anisotropy  parameter,  which  in  the  special  case  A  =  1  gives  the  isotropic 
Heisenberg  model.  For  present  purposes  we  consider  only  bipartite  lattices,  where 
the  lattice  is  composed  of  two  equivalent  alternating  sublattices,  denoted  as  A  and 
5.  Bipartite  lattices  thus  include  the  important  ID  chain  and  the  2D  square  lattice. 
The  restriction  to  bipartite  lattices  excludes  such  lattices  as  the  triangular  lattice 
and  such  associated  effects  as  frustration.  We  also  restrict  attention  here  to  the  case 
A  >  0,  where  for  the  ID  chain,  for  example,  the  system  is  known  to  be  antiferro¬ 
magnetic  in  its  ground  state. 

In  order  to  perform  a  CCM  analysis  of  this  model  we  must  now  first  choose  a 
model  state  | $).  Although  there  are  many  alternative  choices  [16],  we  illustrate 
the  approach  here  with  what  is  possibly  the  simplest  and  most  obvious  choice, 
namely  where  |  4?)  is  chosen  to  be  the  (classical)  Neel  state  | N}  in  which  all  spins 
on  the^-sublattice  have  z-component  sz  =  —s,  say,  and  all  those  on  the  5-sublattice 
have  sz  =  +5.  This  state  thus  has  perfect  antiferromagnetic  long-range  order,  but 
is  an  eigenstate  of  the  quantum  Hamiltonian  of  Eq.  (22)  only  in  the  Ising  model 
(A  -►  oo )  limit.  The  single-spin  creation  operators  with  respect  to  \N)  are  clearly 
the  spin-raising  operators  {  sf  }  for  sites  i  on  the  “spin-down”  y4-sublattice  and  the 
spin-lowering  operators  { sj }  for  sites  j  on  the  “spin-up”  5-sublattice.  Each  of  these 
operators  can  act  a  maximum  of  2s  times  on  a  single  site  before  the  state  |  N )  is 
annihilated. 

The  XXZ  Hamiltonian  of  Eq.  (22)  is  easily  seen  to  commute  with  the  operator 
total  =  2 /=i  sf.  Hence,  the  antiferromagnetic  ground-state  |  ^0)  may,  by  definition, 
be  sought  in  the  subspace  where  5f0tai  =  0.  Equations  (4)  and  (5)  thus  become,  in 
this  case, 

N/2 

|*0>  =  e5|AT>;  5=2^,  (23) 

m=  1 


with  cluster  correlation  operators  S2m  expressed  with  a  somewhat  arbitrary  choice 
of  normalization  constant  as, 


(-1)' 


{m\) 


n2 


i\h'  ’  •  imJljl '  •  ’jmSi\Sh 


Si  Sj'S, 


(24) 


*  *  '  lm  J\>J2, 


Jm 


where  indices  { U }  and  { jm }  refer  to  sites  on  the  A-  and  5-sublattices,  respectively. 
The  equivalent  analogues  of  Eqs.  (10)  and  (11)  are, 
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N/2 

<*ol  =(N\Se~s;  5  =  1  +  Z 

m=  1 


(25) 


Slm  = 


(-1)' 

{m\)‘ 


2 


1*2 *  •  •im.hh- •  ■  imShS>2  ‘  *  ’  S'mShSh 


}  Jm  5 


(26) 


’  ’  ’lmJhJ2> 


Jm 


Clearly,  for  spin-5  particles  the  coefficients  . . .  jm  and  $ix . . .  imjx  ,..jm  vanish 

if  any  one  of  the  lattice  indices  appears  more  than  25  times. 

The  ground-state  energy  per  particle  may  simply  be  calculated  from  Eq.  (7).  We 
find, 

|?  =  -y(A  +  2 %),  (27) 


where,  as  before,  z  is  the  lattice  coordination  number,  and  b\  =  < fiti+p  is  the  nearest- 
neighbor  pair  correlation  coefficient.  We  note  that  b\  is  independent  of  both  the 
index  i  (by  lattice  translational  invariance)  and  the  index  p  (by  the  lattice  symmetries 
under  rotations  and  reflections).  We  also  note  that  Eq.  (27)  is  exact.  The  coefficient 
b\  is  itself  calculated  from  the  coupled  sets  of  equations  for  the  coefficients 
•  -wr  •  •./„}>  which  are  determined  as  in  Eq.  (8),  with  Cj  ->  sjx  •  •  • 
sjnsjx  •  •  •  sjn.  As  usual  the  number  of  such  equations  must  be  truncated  in  practice 
by  restricting  the  configurations  retained  in  Eqs.  (24)  and  (26),  and  then  solving 
Eqs.  (8)  and  (14)  without  further  approximations,  keeping  in  each  case  the  same 
configurations  for  these  coupled  sets  of  equations,  so  that  we  have  one  equation  in 
each  set  for  each  retained  ket-  or  bra-state  coefficient.  The  algebraic  form  of  the 
sets  of  Eqs.  (8),  for  example,  is  that  of  a  coupled  set  of  multinomial  equations, 
each  term  of  which  is  easily  seen  from  Eq.  (9 )  to  be  of  degree  no  higher  than  fourth, 
by  noting  the  bilinear  form  of  the  Hamiltonian  of  Eq.  (22)  and  the  specific  SU{  2) 
commutation  relations.  Their  explicit  evaluation  is  straightforward,  although  very 
tedious  and  time-consuming  if  done  by  hand  if  more  than  a  few  configuration 
coefficients  are  retained.  Nevertheless,  the  algebraic  reduction  is  very  amenable  to 
automation  using  computer-algebraic  techniques,  and  many  of  the  actual  results 
given  below  have  been  found  this  way. 

All  that  remains  at  this  point,  in  order  to  perform  a  CCM  calculation,  is  to  consider 
appropriate  truncation  hierarchies.  These  cannot  ever  be  uniquely  specified,  and 
must  always  be  motivated  on  physical  grounds  for  the  problem  at  hand.  We  discuss 
here  two  such  schemes.  From  previous  applications  of  the  CCM  to  systems  defined 
in  a  spatial  continuum,  the  most  well-known  truncation  hierarchy  is  the  so-called 
SUB n  scheme  defined  by  truncating  the  summations  in  Eqs.  (23)  and  (25)  so  that 
one  retains  only  up  to  «-body  correlation  coefficients, 

S+SgUto-ZSn  S  -*■  5SUBn  —  1  +  2  &  .  (28) 

k= 1  k= 1 


For  example,  the  ccm  coupled  equations  for  the  two-spin-flip  ket-state  coefficients 
c $ij  are  now  found  from  the  relation, 
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(N\sTsJe~S2Hesi\N)  =  0 . 

Straightforward  evaluation  of  Eq.  (29)  yields  the  coupled  equations, 


( 1  +  2A bt  +  2 b2t)Sp,r  -  4s(A  +  2 bt)br  +  4s2  2  br'b. 


r-r'+p 
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0,  (30) 


where  br  =  $ij  with  r  =  j-  i,  and  we  have  taken  advantage  of  the  translational 
invariance  of  the  lattice.  We  note  that  in  ID  r  is  simply  a  (positive  or  negative) 
odd  integer,  whereas  in  2D  r  is  a  lattice  vector  connecting  sites  on  opposite  sub¬ 
lattices.  Rotational  and  reflectional  symmetry  imposes  extra  relations  among  the 
coefficients  { br }:  for  example,  in  ID,  b-r  =  br.  The  bra-state  coefficients  br  = 
are  defined,  and  equivalent  relations  derived,  similarly. 

The  full  set  of  nonlinear  SUB2  Eqs.  ( 30)  can  be  solved  analytically  by  a  sublattice 
Fourier  transform  method  [14].  For  the  ID  chain,  for  example,  the  solution  is 

,  r,  r  dq  cos  rq  r - —2 - 

br  =  K  \ - (1  -  VI  -  k2coszq) , 

Jo  7r  cos  q 

1  1  +  2Abi  +  2b] 

*-5(a  +  (A  +  2  b~- 

The  coefficient  bx  is  thus  determined  self-consistently  by  putting  r  =  1  in  Eq.  (31 ), 
from  which  the  ground-state  energy  may  be  immediately  found  from  Eq.  (27). 
Results  for  the  ID  spin-  \  chain  and  the  2D  spin-  \  square  lattice  are  given  in  Tables 
I  and  II,  respectively.  Very  interestingly,  the  exact  SUB2  solution  of  Eq.  (31 )  predicts 
a  phase  transition  at  the  terminating  point  in  Eq.  ( 3 1 )  for  which  kc=  1 ,  and 
beyond  which  the  equation  has  no  real  solution.  This  terminating  condition  de¬ 
termines  a  critical  anisotropy  Ac,  such  that  no  real  SUB2  solution  exists  for  A  <  Ac. 
Numerical  solutions  yield  the  values  Ac  «  0.3728,  0.7985,  and  0.8872  for  the  spin- 
\  models  on  the  ID  chain,  2D  square  lattice,  and  3D  cubic  lattice,  respectively. 


Table  I.  Ground-state  energy  per  spin  as  a  function  of  A  for  the  ID  spin-5  XXZ  model.  Shown  are 
the  results  of  the  full  sub2  and  various  lsubw  schemes,  defined  in  the  text,  together  with  the  exact  results 

for  comparison. 


Method 

A 

0.0 

0.5 

1.0 

1.5 

2.0 

5.0 

sub2 

-0.5075 

-0.6079 

-1.2986 

LSUB2 

-0.2887 

-0.3412 

-0.5069 

-0.6076 

-1.2986 

LSUB4 

-0.3193 

-0.5195 

-0.6155 

-1.2995 

lsub6 

-0.3198 

-0.3730 

-0.5218 

-0.6167 

-1.2995 

lsub8 

-0.3196 

-0.4414 

-0.5226 

-0.6170 

-1.2995 

LSUBlO 

-0.3194 

-0.3745 

-0.4420 

-0.5230 

-0.6171 

-1.2995 

Exact 

-0.3183 

-0.4432 

-0.5234 

-0.6172 

-1.2995 
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Table  II.  Ground-state  energy  per  spin  for  the  2D  spin-  \  Heisenberg  model  (A  =  1)  on  a  square  lattice. 
Shown  are  the  results  of  the  full  SUB2  and  various  lsub(w,  d)  schemes,  defined  in  the  text,  together  with 
the  results  of  spin-wave  theory  (swt)  of  Ref.  [29],  and  from  the  Green’s  function  Monte  Carlo  (gfmc) 
calculations  of  Ref.  [28]  for  comparison. 


sub2 

lsub(6,6) 

lsub(8,8) 

lsub(10,  10) 

SWT 

GFMC 

-0.6508 

-0.6483 

-0.6531 

-0.6644 

-0.6580 

-0.6692 

We  have  shown  elsewhere  [14,15],  by  the  further  calculation  of  the  spin-spin 
correlation  functions,  the  staggered  magnetization,  and  the  low-lying  excitations, 
that  the  above  critical  point  really  corresponds  to  the  known  Ising-Heisenberg 
phase  transition,  which  is  known  to  occur  at  A  =  1  in  the  ID  spin- \  chain.  (We 
also  note  that  when  more  of  the  higher-order  multi-spin  correlations  neglected  in 
SUB2  approximation  are  included,  the  value  of  Ac  moves  closer  to  this  exact  value 
[14],  as  expected.)  For  example,  we  have  shown  by  calculating  from  Eqs.  (18)  and 
(19)  the  spin-wave-like  excitations  on  top  of  the  SUB2  ground  state,  that  the  low- 
lying  excitation  spectrum  has  a  finite  gap  for  A  >  Ac  and  that  this  gap  exactly 
vanishes  as  A  -►  Ac  for  lattices  of  all  dimensions.  Furthermore,  the  staggered  mag¬ 
netization  Mz,  defined  for  a  spin  on  the  site  j  on  the  (“up”)  ^-sublattice  as, 

Mz  =  -<s/>  =  -<$|&-ss|e's|$)  ,  (32) 

s  s 

can  easily  be  evaluated  in  the  SUB2  approximation  as, 

Mlm2  =  1  -  45  2  brbr  .  (33) 

r 

The  exact  solution  of  Eq.  (31)  for  {br}  and  its  counterpart  for  {br}  yield  that 
MIub2  0  as  A  -►  Ac  in  the  ID  case,  in  agreement  with  the  exact  result  [25]  at 
A  =  1,  but  that  Ms ub2  remains  nonzero  as  A  -►  Ac  in  2D  and  3D.  For  example, 
for  the  important  2D  spin- 5  model  on  the  square  lattice,  MIvb2  0.682  at  A  = 
Ac.  This  value  is  in  good  agreement  with  the  results  of  recent  large-scale  Monte 
Carlo  simulations  and  the  other  best  available  calculations  cited  in  Ref.  [14] . 

We  now  turn  our  attention  to  an  alternative  approximation  scheme  which  is 
specifically  and  closely  tailored  to  lattice  systems.  The  capability  of  predicting  phase 
transitions  in  the  SUB2  and  higher  SUB n  schemes  is  clearly  related  to  their  inclusion 
of  at  least  some  correlations  of  arbitrarily  long  range.  However,  if  we  wish  to  calculate 
only  the  energy  or  other  properties  of  the  ground  state  which  are  not  sensitive  to 
possibly  nearby  phase  transitions,  we  can  employ  a  very  efficient  alternative  scheme 
which  in  practice  is  capable  of  very  high  accuracy.  This  scheme,  denoted  as  the 
LSUB n  scheme,  takes  advantage  of  the  localized  nature  of  the  interactions,  so  that 
one  retains  at  a  given  order  of  the  approximation  only  those  local  configurations 
in  the  cluster  correlation  operators  S and  S  specified  by  the  index  n.  Very  specifically, 
for  the  problem  at  hand  the  LSUB«  approximation  retains  only  those  configurations 
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which  contain  any  number  of  flipped  spins  with  respect  to  the  Neel  model  state 
over  a  localized  region  of  at  most  n  contiguous  sites  on  the  lattice,  and  which  are 
compatible  with  the  condition  that  $  total  =  0. 

As  a  simple  example  we  consider  the  spin-  \  1 D  chain.  The  lsub2  scheme  retains 
for  the  operator  S  only  the  single  coefficient  b{  =  <£*w+1 ,  whose  determining  equation 
from  Eq.  (30)  is  given  by, 

1  -  2AZ>!  -  36?  =  0  ,  (34) 

with  (physical)  solution  b\  =  ( V A2  +  3  -  A)/ 3,  and  corresponding  estimate  for 
the  ground-state  energy  given  from  Eq.  (27)  as  E0/N  =  —  5/ 12  —0.4167  for  the 
isotropic  ( A  =  1 )  Heisenberg  case.  This  is  surprisingly  close  to  the  full  SUB2  result 
of —0.4186  for  this  case.  For  the  higher-order  lsub«  approximations,  the  number 
of  independent  configurations  retained  in  the  correlation  operator  S  or  S  grows 
rapidly  with  n  (e.g.,  for  n  =  4,  6,  8,  and  10,  the  corresponding  numbers  of  inde¬ 
pendent  configurations  are  3,  9,  26,  and  81,  respectively).  These  are  illustrated  in 
Figure  1  for  the  lsub6  case.  As  can  be  seen  from  Table  I,  we  obtain  ground-state 
energies  with  very  high  accuracy  in  this  LSUB n  scheme,  even  for  relatively  low 
values  of  the  truncation  index  n ,  and  even  for  values  of  A  <  1  for  which  the  exact 
ground  state  has  no  vestige  at  all  of  the  perfect  long-range  order  present  in  the  Neel 
model  state. 

Turning  to  the  2D  lattice,  it  is  clear  that  the  number  of  configurations  at  a  given 
LSUB??  level  is  considerably  higher  than  in  ID.  For  example,  in  LSUB4  approximation 
for  the  square  lattice  there  are  seven  independent  spin-flip  configurations.  However, 
we  can  introduce  two  further  parameters  in  an  attempt  to  categorize  the  relative 
importance  of  the  configurations  more  finely.  The  first  of  these  is  the  number  w  of 
“wrong”  bonds  produced  by  flipping  spins  with  respect  to  the  Neel  state.  Classically, 
the  breaking  of  each  Neel  “up-down”  bond  in  the  antiferromagnetic  regime  costs 
energy,  and  hence  configurations  with  the  smallest  values  of  w  are,  a  priori,  likely 
to  be  the  most  important.  Secondly,  an  extension  of  the  seemingly  important  concept 
in  ID  of  locale  size  leads  us  to  consider  the  length  d  of  the  “domain  boundary”  of 
a  given  configuration,  defined  as  the  number  of  lattice  bonds  crossed  by  a  minimal- 
area  circuit  (as  indicated  by  the  dashed  lines  in  Fig.  2)  enclosing  all  of  the 
flipped  spins. 

We  may  thus  introduce  an  lsub(  w,  d)  approximation  for  the  2D  lattice  in  which 
only  those  spin-flip  configurations  with  values  of  these  parameters  not  exceeding 

x - x  x - x - - - x - x  x - • - x - x - • - x 


x- 


X  Xr 


x  x - x - x- 


x 


X - X - X - X  X - X - - - - - X - X  X - X - X - X - X - X 

Figure  1.  The  nine  independent  configurations  of  the  LSUB6  scheme  for  the  ID  spin-£ 
XXZ  model.  The  crosses  indicate  the  flipped  spins  with  respect  to  the  Neel  model  state. 
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(8,10) 


(8,10) 


(10,10)  (10,10) 


(10,10)  (10,10) 

Figure  2.  The  eight  independent  configurations  of  the  LSUB(  10,  10)  scheme  for  the  2D 
spin -\XXZ  model  on  a  square  lattice.  The  weights  (w,  d)  defined  in  the  text  are  shown 
below  each  configuration. 


w  and  d ,  respectively,  are  retained.  For  example,  the  lsub(  10,  10)  approximation 
retains  the  8  configurations  shown  in  Figure  2.  These  include  all  7  of  the  2D  lsub4 
configurations,  plus  the  most  compact  6-spin-flip  lsub6  cluster.  Results  for  the 
ground-state  energy  of  the  2D  spin-  \  Heisenberg  ( A  =  1 )  square  lattice  are  shown 
in  Table  II.  We  observe  again  the  excellent  convergence  properties  of  our  approx¬ 
imation  scheme,  by  comparing  with  the  available  Green’s  function  Monte  Carlo 
(GFMC)  results  [  28  ] ,  and  note  the  marked  improvement  over  the  spin-wave  theory 
(swt)  results  [29].  It  is  interesting  to  note  that  just  as  in  the  LSUB«  scheme  for 
the  ID  model,  our  lsub(  w,  d)  scheme  consistently  gives  good  results  for  the  ground- 
state  energy,  for  all  values  of  A  ^  0.  We  also  point  out  that  our  local  CCM  approx¬ 
imation  schemes,  such  as  the  lsub n  and  lsub(w,  d)  schemes,  systematically  re¬ 
produce  the  various  terms  in  the  perturbation  theory  series  in  the  A  oo  limit 
[  14],  at  the  same  time  as  they  provide  an  effective  continuation  of  the  perturbative 
results  far  into  the  nonperturbative  regime.  This  same  characteristic  of  the  CCM 
schemes  will  be  emphasized  again  in  the  following  applications  to  lattice  gauge 
theory.  Finally,  at  a  rather  more  pragmatic  level,  it  is  also  gratifying  to  note  that 
the  LSUB(  w,  d)  scheme  provides  a  rather  effective  practical  selection  of  the  clusters 
of  greatest  significance,  to  the  extent  that  at  any  given  order  the  numerical  mag¬ 
nitudes  of  the  coefficients  of  the  neglected  clusters  are,  by  and  large,  small  in  com¬ 
parison  to  those  retained,  for  all  values  of  A  ^  0.  We  have  verified  this  observation 
both  by  comparison  with  the  A  =  1  results  of  Ref.  [17]  and  by  several  calculations 
of  our  own  carried  out  to  higher  orders. 
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Applications  to  Lattice  Gauge  Field  Theory 

We  turn  now  to  analogous  applications  of  CCM  techniques  to  lattice  gauge  theory, 
where  as  a  specific  example  we  shall  consider  the  Abelian  U(  1 )  model  of  quantum 
electrodynamics  in  two  discretized  spatial  plus  one  continuous  time  (2  +  1)  di¬ 
mensions.  The  t/(  1 )  group  element  on  a  link  /  originating  at  a  lattice  site  with 
position  vector  n  in  (positive)  direction  k  is  C4(n)  =  exp[/^(n)],  and  the  lattice 
Hamiltonian  is 


H  =  \  2  Eli n)  +  x  2  [1  -  cos  5*(n)] . 

^  k,n  k, n 


(35) 


where  ^(n)  is  the  electric  field  on  the  links  of  the  lattice  and  Bk(n)  is  the  magnetic 
field  defined  as  the  lattice  curl,  Bt( n)  =  eiJk[Ak(n  +  ey)  —  ^4^(n) ]  around  the  ele¬ 
mentary  square  plaquettes  defined  by  the  unit  lattice  vectors  e, .  Quantum  mechanics 
is  imposed  in  the  temporal  gauge  through  the  basic  commutation  relation,  [Ak(n), 
Ek'(n')]  =  idkk'dnn',  which  is  realizable  via  the  representation  Ek(n)  ->•  -id/8Ak( n). 
The  Hamiltonian  may  then  be  represented  wholly  in  terms  of  the  plaquette  variables, 
Bp  (=Bk(n)),  in  the  gauge-invariant  sector.  For  example,  in  2  +  1  dimensions,  we 
find 


H=  2 

p 


+  X(  1  -  cos  Bp) 


+ 


2 

<P*P'> 


a 2 

dBpdBp ,  ’ 


(36) 


where  the  second  sum  over  (/?,/?')  indicates  all  nearest  neighbor  pairs  of  plaquettes. 
Equation  (36)  thus  defines  an  infinite  many-body  problem  with  compact  vari¬ 
ables,  -  7T  <  Bp  <  7T. 

In  the  non- Abelian  SU{ N)  case  the  basic  variables  are  the  SU(N)  matrices  defined 
on  each  link  /  in  terms  of  the  N2  —  1  group  generators,  and  the  conjugate  “chro- 
moelectric”  fields  Ef  have  N2  -  1  components.  Thus,  for  577(2)  the  group  element 
on  each  link  has  the  general  form  U  =  d0 1  +  id  •  a,  where  the  uk  are  the  usual  Pauli 
matrices,  and  the  real  coefficients  da  lie  on  a  sphere  in  four-dimensional  Euclidean 
space,  do  +  dk  =  1.  The  potential  term  in  Eq.  (36)  generalizes  to  X  (N  — 
ReTrf/p),  where  Up  =  U{U2U\U\  for  the  plaquette  formed  from  the  four  links  /  = 
1,  2,  3,  4  in  cyclic  order. 

Our  main  goal  is  now  to  provide  a  CCM  parametrization  of  the  physical  (i.e., 
gauge-invariant)  ground  state  and  excited  (“gluebaH”)  states  of  the  above  Hamil¬ 
tonian  in  the  vacuum  sector  [22,24] .  We  thus  need  firstly  to  choose  a  model  state 
1 4>).  A  convenient  choice  is  the  so-called  electric  (X  0)  vacuum  |0),  for  which 
2s/|0)  =  0  for  all  /.  The  CCM  now  parametrizes  the  exact  ground  state  in  terms  of 
|  $)  1 0)  as  in  Eq.  (4),  with  a  cluster  correlation  operator  S given  by  a  sum  over 

all  possible  linearly  independent  linked  clusters  of  Wilson  loops, 


5=22  StAAp),  (37) 

r  P 

with  T  (=  {7,;  /  =  1,  2,  •**,«};«=  1,  2,  •  •  •)  and  p  denoting  the  geometry 
and  position  of  the  oriented  loop  clusters  respectively,  and  Ar  =  Ayt  •  •  •  AJn,  with 
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Ay  =  Tr[  £/*  •  •  U]y  for  each  closed  loop  y.  The  ground-state  energy  E0  and  cor¬ 
relation  coefficients  {<£r}  are  determined  as  in  Eqs.  (7)  and  (8),  respectively,  with 
Cj  A\(p),  and  where  the  inner  products  incorporate  the  appropriate  group 
measure.  Furthermore,  for  the  Hamiltonian  of  Eq.  (36)  and  the  correlation  operator 
of  Eq.  (37),  the  expansion  of  Eq.  (9)  terminates  at  the  term  of  second  order  in  S. 

Henceforth  we  focus  on  the  specific  U(  1 )  model  of  Eq.  ( 36)  in  2  +  1  dimensions. 
(In  earlier  work  [24]  we  have  studied  the  same  U(  1 )  model  in  1  +  1  dimensions, 
which  has  similar  qualitative  behavior.  By  contrast,  the  same  model  in  3  +  1  di¬ 
mensions  is  expected  to  show  marked  differences.)  The  electric  vacuum  state  is 
thus  simply  a  constant.  Furthermore,  the  operator  S  can  be  partitioned  into  k- 
plaquette  pieces,  S  -  Zfti  Sk,  where  Np  oo  is  the  total  number  of  elementary 
square  plaquettes  in  the  lattice.  We  have,  for  example, 

00  Np 

Si  =  2  2  &p(n)  cos  nB,, ;  (38) 

n~  1  p=  1 


oo  Np 

S2  =  2  2  '  «')  cos  nBp  cos  it' Bp' 

n,n'=  1  p,p'=\ 

+  <£$'(«,  n ')  sin  nBp  S^n  n'Bp’]  >  (39) 

where  the  prime  on  the  sum  in  Eq.  (39)  excludes  the  term  with  p  =  p' .  For  this 
case  we  have  *S't  =  S,  and  the  inner  products  in  the  coupled  equations  of  Eqs.  (7) 
and  ( 8 )  are  simply  the  multiple  integrals  over  all  plaquette  variables  {Bp}  within 
the  range  -tt  <  Bp  <  tt.  We  also  note  that  the  multi-plaquette  correlation  operators 
{Sk}  have  a  close  relation  to  the  usual  Wilson  loops  Ar  mentioned  above.  For 
example,  the  simple  trigonometric  relation,  2  cos  Bt  cos  B2  =  cos(i?i  +  B2)  + 
cos( B[  -  B2)  corresponds  to  the  relationship  between  Wilson  loops  illustrated  in 
Figure  3.  Our  particular  parametrization  exemplified  by  Eqs.  (38)  and  (39)  is  not 
only  complete  but  is  also  especially  useful  in  view  of  the  orthonormality  of 
the  basis. 

It  should  now  be  immediately  clear  that  the  multi-plaquette  correlations  in  the 
exact  ground  state  |  ^0)  can  be  approximated  by  truncating  the  operator  S  using 
the  analogues  of  the  schemes  described  previously  for  the  spin-lattice  systems.  Fur¬ 
ther  subtruncations  are  also  possible  here,  in  terms  of  the  numbers  of  modes  { nt } 
kept  in  the  summations  in  Eqs.  (38)  and  (39)  and  their  higher-order  counterparts. 
For  example,  the  LSUB«(m)  scheme  ignores,  firstly,  all  terms  in  S  between  k  >  n 
plaquettes  and  between  k  <  n  plaquettes  if  these  k  plaquettes  occupy  a  region  which 
cannot  be  delimited  by  at  most  n  contiguous  plaquettes  on  the  lattice;  and,  secondly, 
all  terms  in  the  resulting  LSUB«  correlation  operator  with  E,  nt  >  m.  The  mode 


+ 


DO 


Figure  3.  Graphical  illustration  of  a  relationship  between  the  U(  1 )  Wilson  loops. 
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numbers  {  nt }  clearly  bear  a  direct  relationship  with  the  winding  numbers  of  the 
equivalent  Wilson  loops  retained  in  the  sum  for  S.  Thus,  it  should  also  be  clear 
how  to  generalize  our  approach  to  the  non- Abelian  SU(N)  lattice  gauge  theories. 

The  ground-state  energy  for  our  U(  1 )  model  in  2  +  1  dimensions  is  shown  in 
Figure  4  and  Table  III  as  a  function  of  X,  as  obtained  in  the  SUBl  and  LSUB2(m) 
schemes.  The  full  SUBl  calculation  is  completely  equivalent  to  the  one-body  Mathieu 
problem,  and  the  SUBl  (m)  subtruncations  are  found  to  converge  extremely  rapidly 
with  increasing  m  even  for  large  values  of  X.  We  also  include  for  comparison  some 
results  of  «th-order  strong-coupling  (X  0)  perturbation  theory  (denoted  as 
pth(S));  lowest-order  weak-coupling  (X  oo)  perturbation  theory  (pt(IT));  the 
results  from  an  analytic  continuation  of  the  strong-coupling  perturbation  series  due 
to  Hamer,  Oitmaa,  and  Zheng  (hoz)  [30];  and  those  from  the  f-expansion  cal¬ 
culation  of  Morningstar  [31].  One  sees  that  our  lsub2(w)  results  also  converge 
quickly  as  m  increases;  and  that  the  lsub2(  10)  results  are  in  very  good  agreement 
at  this  quite  low  order  of  approximation  with  those  from  the  careful  analysis  of 
Morningstar,  even  for  large  values  of  X  well  into  the  weak-coupling  regime. 

Discussion 

It  is  interesting  to  note  that  for  the  XXZ  spin-lattice  model  discussed,  the  lsubt? 
approximation  exactly  reproduces  the  results  of  wth-order  perturbation  theory  in 


Figure  4.  Ground-state  energy  per  plaquette  of  the  U(  1 )  model  on  the  square  lattice. 
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Table  III.  Ground-state  energy  per  plaquette  at  several  values  of  X  for  the  £/(l)  model  in  2  -I-  1 
dimensions.  Shown  are  the  results  of  various  lsub2(«)  schemes,  together  with  the  full  SUBl  results  and 
the  results  of  both  strong  (X  -►  0)  and  weak  (X  oo)  coupling  limits  in  perturbation  theory.  We  show 
the  results  of  4th-  and  8th-order  strong-coupling  perturbation  theory,  pt4(S)  and  pt8(S)  respectively, 
and  of  lowest-order  weak-coupling  perturbation  theory,  pt(W).  The  results  from  an  analytic  continuation 
of  the  strong-coupling  perturbation  series  by  Hamer,  Oitmaa,  and  Zheng  [30]  (denoted  as  HOZ),  and 
from  the  /-expansion  calculations  of  Momingstar  [31],  are  also  included  for  comparison  purposes. 


Method 

X 

0.5 

i 

2 

3 

4 

5 

6 

8 

9 

SUBl 

0.4391 

0.7724 

1.2430 

1.5828 

1.8597 

2.1000 

2.3156 

2.6966 

2.8688 

lsub2(2) 

0.4386 

0.7652 

1,1468 

1.1280 

0.3019 

-2.8326 

-15.108 

— 

— 

lsub2(3) 

0.4387 

0.7681 

1.2216 

1.5371 

1.7691 

1.9282 

2.0153 

1.9720 

1.8424 

lsub2(4) 

0.4387 

0.7681 

1.2214 

1.5428 

1.7994 

2.0123 

2.1921 

2.4585 

2.5568 

lsub2(6) 

0.4387 

0.7681 

1.2217 

1.5453 

1.8099 

2.0404 

2.2482 

2.6188 

2.7886 

lsub2(8) 

0.4387 

0.7681 

1.2217 

1.5454 

1.8100 

2.0404 

2.2477 

2.6142 

2.7797 

lsub2(10) 

0.4387 

0.7681 

1.2216 

1.5452 

1.8095 

2.0393 

2.2456 

2.6096 

2.7734 

HOZ 

— 

— 

1.215 

— 

1.785 

— 

2.2 

— 

— 

PT4(S) 

0.4387 

0.7690 

1.3042 

2.2898 

4.8667 

10.632 

21.638 

— 

— 

pt8(S) 

0.4387 

0.7673 

1.1358 

-0.7375 

-20.873 

— 

— 

— 

— 

pt(W) 

0.5627 

0.8434 

1.2402 

1.5447 

1.8015 

2.0276 

2.2321 

2.5917 

2.7596 

Momingstar 

— 

0.7675 

— 

— 

1.796 

— 

— 

— 

2.763 

the  limit  A  -►  oo  [  14] .  The  strong-coupling  ( X  ->  0)  perturbation  series  is  similarly 
reproduced  from  our  lsub n  equations  for  the  U(  1 )  lattice  gauge  theory  [22,24]. 
Thus,  our  lsub«  approximations  effectively  comprise  a  well-defined  analytic  con¬ 
tinuation  or  resummation  of  such  perturbation  series  within  the  context  of  a  rather 
natural  and  consistent  hierarchical  scheme.  From  the  numerical  results  obtained, 
the  CCM  approach  is  seen  to  be  far  superior  in  this  sense  to  the  alternative  rather 
ad  hoc  approaches  for  extending  the  range  of  validity  or  the  accuracy  of  similar 
perturbative  sequences,  which  are  usually  based  on  generalized  Pade  approximants 
or  similar  techniques. 

From  the  results  obtained  to  date  we  hope  to  have  convinced  the  reader  that  the 
CCM  approach  indeed  provides  a  unified  many-body  theory  of  quantal  Hamiltonian 
lattice  systems.  Obvious  extensions  then  present  themselves  as  worthy  of  study.  For 
the  spin-lattice  systems  it  is  of  interest  to  investigate  other  models,  such  as  the  spin- 
1  Heisenberg-biquadratic  chain  [16],  or  spin- 1  chains  with  both  nearest-neighbor 
and  next-nearest-neighbor  couplings.  In  this  context  the  use  of  alternative  model 
states  |  <£)  is  of  particular  interest.  For  the  lattice  gauge  models  we  are  also  interested 
in  extending  our  very  preliminary  work  done  so  far  in  the  vacuum  sector  on  the 
U(  1 )  excitation  gap  (or  glueball  mass)  [24]  and  on  the  non-Abelian  SU(2)  model. 
In  principle,  the  work  described  above  for  the  vacuum  sector  can  also  be  extended 
to  the  charged  (or  “colored”)  sector  by  the  further  inclusion  in  the  correlation 
operator  S  of  sums  over  open  paths  on  the  lattice,  representing  tubes  of  electric 
flux  between  staggered  fermions,  as  well  as  the  sums  over  closed  paths  (i.e.,  the 
Wilson  loops)  already  considered. 
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Finally,  the  quality  of  our  results  obtained  so  far  convinces  us  that  the  extension 
of  the  current  CCM  approach  to  additional  Hamiltonian  lattice  systems  is  well  jus¬ 
tified.  An  obvious  example  is  the  class  of  strongly  interacting  electronic  lattice 
systems,  including  such  topical  models  as  the  Hubbard  and  tJ  models  which,  despite 
intensive  recent  study  by  various  theoretical  techniques,  are  still  only  partially  un¬ 
derstood.  Preliminary  work  in  this  direction  has  been  reported  [19-21],  and  we 
hope  to  present  results  of  our  own  soon. 
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Abstract 

In  this  contribution  we  are  concerned  with  the  role  of  rotational  motion  in  multiphoton  processes 
induced  by  a  laser  field  of  high  intensity.  We  use  the  pseudospectral  split  operator  method  for  the 
propagation  of  the  quantum  wavefunction.  The  rotation  is  treated  by  decomposition  of  the  HF  wave- 
function  in  its  angular  momentum  components.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

The  phenomenon  of  infrared  multiphoton  excitation  (mpe)  and  dissociation 
(mpd)  is  one  of  the  many  interesting  spectroscopic  discoveries  of  the  previous  de¬ 
cades.  Irradiation  of  the  molecule  with  an  intense  laser  field  generates  excitation 
through  discrete  levels,  and  then  on  into  the  quasicontinuum  and  continuum.  In 
the  latter  regime  the  transitions  are  resonant,  but  in  the  former  much  depends  on 
the  level  spacing  in  the  system.  The  spacing  of  the  individual  discrete  levels  deter¬ 
mines  the  fraction  of  molecules  to  be  excited  up  to  the  quasicontinuum,  and  it  is 
therefore  key  to  the  efficiency  of  the  overall  process. 

The  above  does  not  apply  directly  to  diatomics  [1],  where  one  is  taken  directly 
from  discrete  levels  to  the  continuum.  And,  indeed,  calculations  have  shown  that, 
even  with  extremely  intense  monochromatic  lasers,  dissociation  probability  is  gen¬ 
erally  low  in  diatomics.  This  is  due  to  the  field  frequency  becoming  completely  off- 
resonance  as  one  ladders  through  the  anharmonically  spaced  higher  levels.  Several 
calculations  have  shown  that  the  application  of  two  fields  [2-4]  or  the  use  of  chirped 
fields  [5]  alleviates  the  off-resonance  condition  and  is  capable  of  producing  significant 
excitation  and  dissociation  rates. 

The  present  contribution  is  motivated  by  the  observation  that  many  of  the  studies 
of  infrared  mpe  in  diatomics  are  based  on  a  description  of  the  molecule,  containing 
only  the  internuclear  vibrational  coordinate.  In  a  full  description,  which  includes 
the  angles  specifying  the  orientation  of  the  molecular  axis  with  respect  to  the  field, 
the  transitions  can  be  vibration-rotational  with  Aj  =  0,  ±  1 .  Thus  the  anharmonic 
shift  in  the  vibrational  levels  can  be  compensated  by  the  addition  or  removal  of 
rotational  energy,  in  order  to  sustain  near-resonance  in  a  sequence  of  transitions. 
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The  reduced,  purely  vibrational,  description  eliminates  the  rotational  splitting  from 
the  level  scheme  and  in  that  way  conceivably  impacts  significantly  on  the  mpe 
process.  Our  objective  is  to  find  out  whether  this  is  indeed  the  case. 

We  investigate,  in  both  descriptions,  the  mpe  of  hf  irradiated  by  an  intense 
infrared  (IR)  laser,  using  a  pseudospectral  method  to  compute  the  time  evolution 
of  the  quantum  wavefunctions.  In  the  next  section  we  give  an  outline  of  the  method, 
focusing  on  the  features  related  to  the  treatment  of  the  angular  coordinates.  The 
results  of  our  calculations  and  their  interpretation  are  presented  in  the  third  section. 


Theoretical  Background 

The  pseudospectral  approach  to  the  calculation  of  the  vibrational  motion  of  a 
diatomic  is  by  now  well  established.  Several  methods  are  available,  i.e.,  second 
order  differencing  [6],  split  operator  [7],  and  Lanczos  [8].  We  will  use  the  split 
operator  method. 

The  time  evolution  associated  with  the  vibrational  nuclear  motion  in  an  external 
field  is  obtained  from  the  time-dependent  Schrodinger  equation 

d'E  -  . 

/ — (. r ,  t)  =  rnmR,  t)  =  [T+  o  (i) 

at 


with  kinetic  energy  and  potential  operators 


1  d2 
2m  dR2  ’ 


(2) 


V  =  V0  -  fx(R)E  cos (fi/). 


(3) 


This  equation  is  formally  solved  by  applying  the  time  evolution  operator  to  the 
initial  condition 


no  =  u{u  t0)nto). 


(4) 


where  U  is  to  be  constructed  using  the  Dyson  time-ordering  operator 

U(t,  t0)  =  T  exp^-z  J  ds  H(s)^j  .  (5) 

The  time  ordering  is  required  because  of  the  presence  of  a  time-dependent  term  in 
the  Hamiltonian. 

The  key  concept  in  the  split-operator  method  is  to  split  the  total  time  interval 
into  short  time  slices  and  to  approximate  the  evolution  operator  over  each  time 
slice 


U(U  t  +  e)  =  exp^-z  ^  fj  exp|— z  J  ds  F(s)j  exp |-i  ^  fj  (6) 

This  decomposition  is  correct  up  to  second  order  in  e .  The  power  of  this  algorithm 
lies  in  the  splitting  of  the  potential  and  kinetic  terms.  The  action  of  the  first  is 
evaluated  while  the  wavefunction  is  in  a  coordinate  representation,  i.e.,  defined 
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by  its  values  on  a  grid  in  coordinate  space.  For  the  second,  one  transforms  the 
wavefunction  to  the  momentum  representation,  using  the  discrete  Fourier  transform 
associated  with  the  grid.  The  kinetic  energy  operator  is  multiplicative  in  that  rep¬ 
resentation,  so  that  its  action  is  easily  evaluated.  Afterwards  one  transforms  the 
wavefunction  back  to  the  coordinate  representation  with  an  inverse  Fourier  trans¬ 
form.  This  is  a  viable  approach  because  of  the  availability  of  fast  Fourier  transform 
(FFT)  software  that  performs  these  transformations  very  efficiently.  The  decom¬ 
position  outlined  above,  including  the  elimination  of  the  time-ordering  operator, 
is  correct  up  to  second  order  in  epsilon  provided  that  the  potential  has  a  well- 
behaved  time  dependence  [9].  Higher-order  approximations  to  the  propagator  are 
also  possible  (see,  for  example,  [10]).  Their  use  allows  one  to  extend  the  time  slice 
epsilon  based  on  the  higher-order  precision,  but  on  the  other  hand  involves  more 
operations  per  time  step  due  to  the  presence  of  more  than  three  factors  [as  in 
formula  (6)]  in  the  approximate  propagator.  This  is  a  trade-off  issue,  involving  cpu- 
time  consumption,  accuracy,  and  programming  effort,  in  which  we  have  chosen  to 
use  the  second-order  scheme. 

When  all  degrees  of  freedom  are  included  in  the  description,  i.e.,  ^  R ,  0, 

0,  t)  with  (0,  0)  defining  the  molecular  orientation  with  respect  to  the  axis  of  the 
linearly  polarized  field,  the  Hamiltonian  modifies  to 

V(t)  =  V0 (R)  ~  n{R)E  cos  0  cos(Of)  (7) 

because  of  the  angular  dependence  on  the  external  field.  One  now  has  to  make  a 
choice  concerning  the  computational  approach  to  these  angular  coordinates.  One 
possibility  is  to  extend  the  grid  representation  of  the  wavefunction  to  the  angular 
domain  [11]  and  to  use  the  corresponding  transform  when  evaluating  the  angular 
kinetic  terms.  Another  possibility  is  to  use  a  basis  set  expansion  for  the  angular 
variables,  i.e., 

*  =  2  tUR,  t)Ylm{6 ,  0)  (8) 

lm 

Each  \pim  represents  the  vibrational  motion  in  the  (/,  m)  angular  momentum  channel. 
We  will  pursue  the  latter  approach  because  it  has  some  distinct  advantages  in  the 
particular  case  of  a  molecule  in  an  external  dipolar  field. 

In  the  basis  set  representation  the  time-dependent  Schrodinger  equation  couples 
the  various  angular  momentum  components: 

/%=  (9) 

ut  i 

where 

=  Eix(R)  <  Ylm\ cos  8 \Yi-m  >  cosffi;). 

Thus  the  wavefunction  is  now  a  vector  with  various  /-components  and  H  a  matrix 


1  d2 


/(/  +  1) 


“  r—  TTT?  +  — +  Vo(R) 


2m  dR 


R' 
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operator  coupling,  the  components.  It  is  diagonal  in  m  and  couples  only  nearest 
neighbor  /-components  (i.e.,  |  /  - 1’\  =  1).  The  strength  of  the  coupling  does  depend 
on  m  and  is  given  by  the  following  expression  using  3y'-symbols  [12]: 

< 

a  i  a//  i  v 

\0  0  0/\ — m  0  m 

The  matrix  structure  of  the  Hamiltonian  requires  only  minimal  modifications  of 
the  split  operator  method.  Again  one  has 

>ir(*  +  €)  =  l/(U  +  e)W  (H) 

or,  componentwise, 


Ut  +  e)  =  2  t  +  eWr(t)  (12) 

iv 


Uu*(t ,  t  +  e) 


expf-z  ~  t 


)  exp(“'  I 


ds  V(s)  1  expl -  T 


The  kinetic  energy  is  represented  by  a  diagonal  matrix  in  the  /-vector  space  and  as 
a  consequence  the  above  simplifies  to 


Uir(t,  t  +  e) 


(14) 


Thus  we  continue  to  use  transformations  to  momentum  space  to  evaluate  the 
kinetic  energy  factors  in  the  propagation  formula.  The  only  new  feature  is  that  for 
the  potential  term  one  now  has  to  exponentiate  a  matrix.  A  similar  technique  is 
used  in  the  pseudospectral  calculations  involving  two  or  more  electronic  states  with 
nonadiabatic  coupling  [13-15]. 


Results  and  Discussion 

In  our  calculations  we  have  used  the  hf  potential  V0  given  by  Coxon  and  Ha- 
jigeorgiou  [16].  It  is  defined  by  analytical  expressions  in  the  regions  of  small  and 
large  R  and  by  tensioned  cubic  spline  fits  to  experimental  data  for  the  intermediate 
region.  The  dipole  function  fi  was  taken  from  Zemke  et  al.  [17].  It  is  linear  at  small 
distances,  zero  for  R  >  20  a.u.,  and  in  between  it  is  defined  by  a  tensioned  cubic 
spline  fit  to  ab  initio  data.  The  electric  field  strength  E  and  the  field  frequency  Q 
are  the  two  variable  parameters  in  our  calculation.  We  consider  a  value  of  E  cor¬ 
responding  to  a  field  intensity  of  43.7  TW/cm2  as  in  [18].  The  values  of  £1  are  taken 
in  the  range  relevant  for  multiphoton  resonances  in  the  vibrational  spectrum,  i.e., 
from  0.015  to  0.021  a.u. 
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Throughout  the  calculations,  we  have  used  the  same  initial  condition  ^0,  viz., 
the  hf  ground  state  calculated  with  the  renormalized  Numerov  method  [19].  In 
the  full  description,  i.e.,  angular  variables  included,  the  rotational  quantum  numbers 
for  are  /  =  0,  m  =  0.  As  explained  in  the  previous  section,  only  the  latter  is  good 
quantum  number. 

For  detailed  results  of  the  purely  vibrational  calculations  we  refer  to  [20,21]. 
Here  we  will  focus  on  the  angular  momentum  features  of  the  rovibrational  calcu¬ 
lation.  In  view  of  the  strength  of  the  external  field,  a  significant  occupation  prob¬ 
ability  for  a  number  of  /-channels  can  be  expected  a  priori.  As  a  first  step,  we  need 
to  determine  which  /-channels  are  to  be  included  in  the  calculation.  To  this  end, 
we  have  performed  a  calculation  with  channels  up  to  /  =  7  and  found  that  starting 
with  1  =  4  the  maximum  probability  per  channel  is  less  than  5%  at  any  point  in 
time  and  lower  than  2%  on  the  average.  At  /  =  7,  those  numbers  have  dropped  by 
an  order  of  magnitude.  Thus  we  have  used  /  =  3  as  a  cutoff  in  all  subsequent 
calculations.  The  predominance  of  the  lower  angular  momentum  components  is 
illustrated  in  Figure  1 .  It  displays  the  time-averaged  expectation  value  of  the  Z2 
operator.  This  quantity  tends  towards  a  numerical  value  of  approximately  3.0  in¬ 
dicating  a  limited  presence  at  /  =  2  [where  /(/  +  1)  =  6]  and  higher. 

The  relevant  field  frequencies  for  transitions  from  the  ground  state  are  the  mul¬ 
tiphoton  frequencies  £ln  =  (Ev=nj= o  —  Ev=o,i=o)/n,  i.e.,  the  frequencies  at  which  the 
^-photon  process  is  in  resonance  with  the  transition  from  v  =  0  to  v  =  n  vibrational 
states.  Figure  2  shows  the  vibrational  occupation  probabilities  Pv  =  2/  Pvi  = 
2/  |  | 2  at  the  four-photon  frequency  0*  =  0.01695  a.u.  Again  the  rovi¬ 

brational  stationary  states  are  determined  using  the  renormalized  Numerov  method. 
One  clearly  notices  a  significant  amount  of  probability  at  v  =  4  during  the  time 
intervals  that  v  =  0  is  depleted.  The  presence  of  nonnegligible  probability  in  other 
vibrational  channels  is  consistent  with  the  picture  of  a  higher-order  transition  process 
that  proceeds  in  a  sequential  manner  [22].  In  order  to  ascertain  the  strength  of 


Figure  1.  Time  averaged  (Z2)  versus  averaging  interval.  Q  =  0.01695  a.u.  Intensity  - 
43.7  TW/cm2.  Time  in  fs. 
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Figure  2."  Vibrational  occupation  probabilities,  =  0.01695  a.u.  Intensity  =  43.7  TW/ 

cm2.  Time  in  fs. 


multiphoton  effects  over  a  broader  range  of  field  frequencies,  we  have  calculated 
the  time-averaged  absorbed  energy  as  a  function  of  the  frequency  U  of  the  external 
field.  The  result  is  shown  in  Figure  3  and  should  be  compared  to  Figure  4  with  the 
same  quantity  calculated  in  a  purely  vibrational  approach.  In  the  latter  one  can 
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Figure  3.  Time  averaged  energy  versus  frequency  of  the  driving  field  at  intensity  of  43.7 
TW/cm2  for  the  full  description.  Energy  and  frequency  in  a.u. 


distinguish  the  multiphoton  transitions  from  v  =  3  to  v  =  8  at  frequencies  Q3  - 
0.01735,  Q4  =  0.01695,  =  0.01620,  Q7  =  0.01583,  and  128  =  0.01546  a.u.  In  the 

comparison  of  Figures  3  and  4  two  things  stand  out.  There  is  an  overall  decrease 
in  the  absorbed  energy  and  a  more  pronounced  structure  of  the  dominant  peaks. 
We  attribute  the  first  effect  to  a  stronger  effective  field  strength  in  the  purely  vibra¬ 
tional  calculation.  Indeed,  in  the  rovibrational  calculation,  the  molecule  will  ex¬ 
perience  different  orientations  with  respect  to  the  field,  at  which  there  is  lower 
effective  interaction  strength  than  in  case  it  is  aligned  with  the  field.  Thus  a  lowering 
of  the  absorbed  energy  is  to  be  expected.  It  is  not  clear  how  to  define  field  strengths 
that  are  comparable  in  this  situation.  The  center  of  gravity  of  the  groups  of  transition 
frequencies  defined  by  Qv/  =  (Ev/  -  Ev=0,i=o)/v  correspond  to  the  four-  and  five- 


Figure  4.  Time  averaged  energy  versus  frequency  of  the  driving  field  at  intensity  of  43.7 
TW/cm2  for  the  vibrational  description.  Energy  and  frequency  in  a.u. 
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photon  peaks  in  Figure  4.  A  deeper  understanding  of  these  peaks  requires  an  /- 
resolved  analysis  of  the  data,  which  is  currently  under  way. 

Conclusion 

In  this  contribution  we  have  investigated  the  use  of  a  hybrid  method  for  computing 
the  time  evolution  of  hf  in  an  external  dipole  field.  The  method  combines  dis¬ 
cretisation  on  a  grid  for  the  vibrational  coordinate  and  a  basis  set  expansion  for 
the  angular  coordinate.  The  resulting  multi-/-component  wavefunction  is  propagated 
with  the  split-operator  method,  where  the  interaction  is  now  an  (/,/')  matrix.  This 
straightforward  approach  is  possible  because  the  external  dipole  field  is  an  /  =  1 
rotation  tensor.  As  a  consequence,  its  matrix  elements  in  the  basis  set  are  known 
analytically.  The  viability  of  this  approach  is  due  also  to  the  fact  that  a  low  cutoff 
value  can  be  used  in  the  /-expansion.  This  is  demonstrated  in  our  calculations. 
Because  of  this,  the  extension  of  the  vibrational  description  to  include  the  rotational 
motion  does  not  introduce  an  excessive  computational  burden. 
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Abstract 

A  computational  study,  using  relativistic  effective  core  potentials,  is  presented  of  transition  metal- 
main  group  multiply  bonded  complexes,  of  interest  in  the  context  of  catalysis  and  chemical  vapor  de¬ 
position  of  tm/mg  materials.  Model  d°  transition  metal  complexes  chosen  are  of  the  general  form  C1„ME 
where  M  =  Zr  (n  =  2),  Ta  (n  =  3),  and  W  (n  =  4).  Main  group  elements  of  interest  are  the  tetrels  (E  = 
C,  Si,  Ge,  Sn),  pnictogens  (E  =  N,  P,  As,  Sb),  and  chalcogens  (E  =  O,  S,  Se,  Te).  A  comparison  between 
calculated  metric  data  and  available  experimental  data  for  a  wide  range  of  tm =mg  complexes  will  help 
in  further  assessing  efficient  computational  approaches  to  tm  complexes,  particularly  of  the  heavier  MG 
elements,  as  a  function  of  metal,  ligand  and  level  of  theory.  In  the  present  work  restricted  Hartree  Fock 
(RHF)  and  Moller-Plesset  second  order  perturbation  theory  (mp2)  wavefunctions  were  employed.  In  most 
cases  there  are  small  differences  between  rhf  and  mp2  calculated  geometries,  with  both  methods  showing 
good  agreement  with  experimental  data,  suggesting  these  approaches  will  be  suitable  for  the  study  of 
larger,  more  experimentally  relevant  models.  Changes  in  ZrE  bond  lengths  for  E  =  chalcogen  (upon 
going  from  rhf  to  mp2)  suggest  a  fundamentally  different  description  between  the  Zr-oxo  bond  and 
heavier  chalcogens,  a  result  supported  by  recent  experimental  data  for  a  series  of  Zr-chalcogenidos.  To 
date  no  examples  have  been  reported  of  arsinidene  and  stibinidene  complexes.  Computational  results 
show  similar  behavior  among  the  heavier  pnictogen  complexes,  i.e.,  L„M  =  EH  (E  =  P,  As,  Sb),  suggesting 
that  strategies  used  to  synthesize  phosphinidenes  may  be  suitable  in  the  search  for  the  first  LnM^  AsR 
and  LnM  ==  SbR  complexes.  Additionally,  calculations  suggest  that  design  of  ligand  sets  which  yield 
linearly  coordinated  phosphinidenes  (and  presumably  As  and  Sb  analogues)  will  lead  to  phosphinidenes 
with  stronger  metal-pnictogen  bonds  and  increased  thermodynamic  stability  versus  nonlinearly  coor¬ 
dinated  examples.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Complexes  with  a  multiple  bond  between  a  transition  metal  (tm)  and  main 
group  (MG)  element  (tm  =  MG)  are  of  considerable  importance  [  1  ].  These  complexes 
participate  in  a  variety  of  catalytically  important  reactions  such  as  oxo  (LnM=0) 
mediated  alkane  oxidations  [2]  and  methane  activation  by  imidos  (LnM=NR) 
[3].  Multiply  bonded  complexes  are  also  important  in  advanced  materials  research, 
e.g.,  as  intermediates  in  chemical  vapor  deposition  (CVD)  of  solid  state  nitrides, 
oxides,  and  carbides  [4].  The  majority  of  tm=mg  complexes  contain  MG  elements 
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from  the  first  main  group  row  (i.e.,  C,  N,  and  O),  although  a  growing  number  of 
examples  involving  heavier  MG  elements  have  been  recently  reported  [5-7].  Multiply 
bonded  complexes  of  the  heavier  tetrels  (Group  IV  A)  are  extremely  rare,  with  few 
known  examples  for  all  tms  and  no  structurally  characterized  examples  for  early, 
high-valent  tms  [5].  Silylidenes  (LnM  =  Si(R)R')  have  been  proposed  as  interme¬ 
diates  in  silane  polymerization  [5b].  Stephan  et  al.  have  recently  developed  synthetic 
routes  into  phosphinidenes  (LnM— PR)  of  Zr  and  demonstrated  their  ability  to 
activate  C-H  bonds  [6b].  Sulfidos  (LnM  =  S)  have  been  envisaged  as  intermediates 
in  xanthine  oxidase  [7d]  and  catalytic  hydrodesulfurization  of  petroleum  feedstocks 
[7e].  Thus,  in  addition  to  fundamental  interest  in  multiple  bonding  of  the  heavier 
MG  elements,  TM  =  MGheavy  complexes  represent  a  nearly  untapped  resource  for 
the  inorganic  chemist  searching  for  novel  reactivity  [5-7]. 

The  growing  importance  of  TM  =  MGheavy  complexes  provides  strong  motivation 
for  computational  studies  of  their  bonding.  Computational  chemistry  methods  have 
become  almost  routine  for  the  lighter  MG  elements  [8],  but  numerous  opportunities 
remain  in  the  development  of  efficient  approaches  to  the  chemistry  of  the  d-,  f-, 
and  heavier  sp-block  elements.  A  variety  of  methods,  ranging  from  Extended  Hiickel, 
to  other  semiempirical  methods,  to  density  functional  theory,  have  been  imple¬ 
mented  to  meet  the  challenges  of  computational  d-  and  f-block  chemistry  [9].  Ef¬ 
fective  core  potentials  (ecps)  represent  another  promising  technique  for  studying 
interesting  chemical  problems  across  the  entire  Periodic  Table  [9,10].  Combined 
with  the  use  of  modern  techniques  for  treating  electron  correlation  and  emerging 
technologies  such  as  parallel  computing  [11],  ecps  appear  to  be  a  very  powerful 
method  for  dealing  with  inorganic  systems. 

The  present  work  is  a  continuation  of  computational  studies  of  TM  =mg  bonding 
and  reactivity  [12].  Model  d°  complexes  are  of  the  general  form  ClnME  where 
M  =  Zr  (n  =  2),  Ta  (n  =  3),  and  W  (n  =  4).  Main  group  elements  are  the  tetrels 
(E  -  C,  Si,  Ge,  Sn),  pnictogens  (E  =  N,  P,  As,  Sb),  and  chalcogens  (E  =  O,  S,  Se, 
Te).  A  comparison  between  calculated  metric  data  and  available  experimental  data 
for  a  wide  range  of  tm=mg  complexes  will  help  in  further  assessing  effi¬ 
cient  computational  approaches  to  more  experimentally  relevant  TM=MGheavy 
complexes. 


Computational  Methods 

Three  main  challenges  arise  in  computational  d-  and  f-block  chemistry:  large 
numbers  of  electrons  (many  of  them  core),  importance  of  electron  correlation,  and 
increased  relativistic  effects  for  heavier  elements  [9].  The  effects  of  electron  corre¬ 
lation,  particularly  on  the  calculation  of  geometries,  is  mitigated  to  a  large  degree 
by  the  empty  d  shell  in  the  complexes  studied  here.  Our  main  approach  to  the 
challenges  of  computational  d-  and  f-block  chemistry  is  the  design,  testing,  and  use 
of  effective  core  potentials  [12].  Great  savings  in  time,  memory,  and  disk  space  are 
effected  by  replacing  the  many  core  electrons,  and  the  basis  functions  describing 
them,  with  a  small  number  of  potentials  [10].  ecps  indirectly  address  the  correlation 
problem  by  reducing  resources  needed  for  other  parts  of  the  computational  exercise 
so  that  more  can  be  directed  towards  electron  correlation.  Transition  metal  ecps 
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are  generated  from  all-electron,  Dirac-Fock  calculations,  and  thus  incorporate 
Darwin  and  mass  velocity  relativistic  effects,  while  spin-orbit  coupling  is  averaged 
out  in  potential  generation  [10a].  From  a  chemical  standpoint,  ECP  methods  afford 
great  leeway  in  the  choice  of  interesting  problems,  particularly  with  respect  to  as¬ 
sessing  the  role  of  the  central  metal,  since  calculations  on  congeners  within  a  triad 
are  carried  out  with  near  equal  facility.  In  our  experience  [12]  with  the  Stevens 
relativistic  ECP  scheme  [10a]  we  have  found  no  degradation  in  accuracy  as  one 
descends  a  TM  triad  towards  the  heavier  metals  (for  which  relativistic  effects  will 
be  most  important). 

Calculations  employ  the  GAMESS  program  [11].  Effective  core  potentials  (ecps) 
and  valence  basis  sets  are  used  for  all  heavy  atoms  and  the  —  31G  basis  set  for  H 
[10a].  ecps  replace  the  innermost  core  orbitals  for  the  tms  and  all  core  orbitals  for 
main-group  (mg)  elements.  Thus,  the  ns,  np,  nd,  (n  +  l)s,  and  (n  +  l)p  are  treated 
explicitly  for  the  d-block;  for  the  main-group  ns  and  np  are  treated  explicitly  [10a]. 
Transition  metal  valence  basis  sets  are  quadruple  and  triple  zeta  for  the  sp  and  d 
shells,  respectively,  while  main-group  elements  have  a  double-zeta  valence  basis. 
Basis  sets  for  heavy,  main-group  elements  are  augmented  with  a  d  polarization 
function  [8].  Geometries  are  optimized  using  restricted  Flartree  Fock  (rhf)  and 
Moller-Plesset  second  order  perturbation  theory  (mp2)  wavefunctions  [13]  for 
closed-shell  singlets.  Core  electrons  are  not  included  in  the  mp2  expansion  since 
most  (other  than  the  ns  and  np  for  the  tms)  are  replaced  with  an  ECP.  Bond  lengths 
and  angles  for  TM  =  MG  complexes  are  typically  predicted  to  within  1-3%  of  ex¬ 
perimental  models  using  the  present  computational  scheme  [12]  involving  com¬ 
plexes  in  a  variety  of  geometries,  formal  oxidation  states,  and  metals  from  the  entire 
transition  series.  It  will  be  seen  below  that  in  the  majority  of  cases  very  good  agree¬ 
ment  is  obtained  between  calculated  and  experimental  metric  data  (even  for  Ta 
and  W  complexes)  using  these  relativistic  ecps. 

Results 


Zirconium  Complexes 

Until  recently,  few  TM = MG  complexes  of  the  Ti-triad  (Group  IVB)  metals  could 
be  found  which  were  stable  enough  to  be  structurally  characterized  [1].  Parkin  and 
Howard  have  reported  a  series  of  terminally-bonded  Zr  and  Hf  chalcogenidos  [7a], 
while  Arnold  and  Christou  [7b]  have  reported  Zr-  and  Hf-telluridos.  Some  work 
has  been  done  on  pnictogens  (Group  VA),  primarily  for  nitrogen  [1].  A  series  of 
ECP  studies  of  Zr-imido  chemistry  have  been  reported  [14].  Stephan  et  al.  have 
recently  characterized  the  first  Zr-phosphinidene  [6b].  To  date,  no  LnM  =  AsR  or 
LnM  =  SbR  complexes  have  been  synthesized.  An  X-ray  analysis  of  a  terminally 
bound  tetrelide  has,  to  our  knowledge,  not  been  reported  other  than  the  Zr-alkylidene 
of  Fryzuk  et  al.  [15].  An  ecp  study  of  Group  IVB  alkylidenes  and  silylidenes  has 
been  reported  [16]. 

The  Zr  complexes,  1-3,  are  of  the  form  Cl2ZrEHn(n  =  0,  1,  2).  Geometry  opti¬ 
mizations  for  Cl2ZrE  are  done  under  C2v  symmetry  to  facilitate  comparison  among 
the  various  complexes.  The  tetrelide  (1),  imido  (2,  E  =  N),  and  chalcogenido  (3) 
complexes  are  C2v  minima.  The  Zr — Cl  bond  lengths  range  from  2.40  to  2.45  A 
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(rhf)  and  2.37  to  2.43  A  (mp2),  and  are  consistent  with  a  mean  literature  value 
of  Zr — Cl  =  2.47  ±  0.06  A  (coordination  numbers  from  six  to  nine)  [17].  The 
Zr — Cl  bond  lengths  shorten  only  slightly  (<0.05  A)  upon  the  addition  of  correlation, 
suggesting  that  ZrCl  bonds  are  well  described  at  the  Hartree-Fock  level. 

Tetrelide  Complexes,  1.  Metric  data  for  Zr  tetrelides  (1)  are  collected  in  Table  I. 
The  calculated  ZrC  bond  length  ( 1 .99  A,  rhf;  1 .98  A,  mp2)  is  in  excellent  agreement 


Table  I.  Cl2ZrE  complexes. 


M/E/levela 

ZrE 

(A) 

ZrCl 

(A) 

EZrCl 

n 

Exp.  ZrEb 

Zr/CH2/RHF 

1.99 

2.43 

113 

2.024(4)  A 

Zr/CH2/MP2 

1.98 

2.40 

112 

Zr/SiH2/RHF 

2.54 

2.42 

112 

2.815(1)  A 

Zr/SiH2/MP2 

2.52 

2.37 

112 

Zr/GeH2/RHF 

2.57 

2.42 

112 

Zr/GeH2/MP2 

2.54 

2.37 

113 

Zr/SnH2/RHF 

2.77 

2.42 

111 

Zr/SnH2/MP2 

2.73 

2.37 

113 

Zr/NH/RHF 

1.79 

2.45 

117 

aver.  =  1.85  A 

Zr/NH/MP2 

1.83 

2.43 

114 

Zr/PH/RHF 

2.25 

2.41 

120 

2.505(4)  A 

Zr/PH/MP2 

2.28 

2.40 

117 

Zr/AsH/RHF 

2.33 

2.41 

121 

Zr/AsH/MP2 

2.37 

2.40 

118 

Zr/SbH/RHF 

2.53 

2.41 

121 

Zr/SbH/MP2 

2.55 

2.38 

119 

Zr/O/RHF 

1.66 

2.44 

119 

1.804(4)  A 

Zr/0/MP2 

1.78 

2.42 

113 

Zr/S/RHF 

2.20 

2.41 

118 

aver.  =  2.32  A 

Zr/S/MP2 

2.23 

2.40 

114 

Zr/Se/RHF 

2.32 

2.40 

118 

2.480(1)  A 

Zr/Se/MP2 

2.37 

2.39 

115 

Zr/Te/RHF 

2.55 

2.41 

119 

aver.  =  2.69  A 

Zr/Te/MP2 

2.61 

2.39 

115 

9 

a  All  complexes  are  C2v,  see  1-3;  M/E/level  notation  denotes  the  transition  metal,  ligand,  and  level  of 
theory  (rhf  or  mp2),  respectively. 

b  Experimental  ME  bond  lengths,  where  available,  are  reported.  Where  more  than  one  bond  length  is 
available  the  average  is  reported.  Note  the  ZrSi  value  corresponds  to  a  single  bond.  See  text  for  further 
details. 
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with  the  value  of  ZrC  =  2.024(4)  A  for  a  structurally  characterized  Zr-alkylidene 
[15].  The  ZrSi  bond  in  Cl2ZrSiH2  (2.54  A,  RHF;  2.52  A,  mp2)  and  the  ZrGe  bond 
in  Cl2ZrGeH2  (2.57  A,  rhf;  2.54  A,  mp2)  are  considerably  shorter,  as  expected, 
than  the  ZrSi  and  HfGe  single  bonds  reported  for  Cp2Zr(Si(TMS)3)Me  (2.815(1) 
A,  [18])  and  Cp*Cl2HfGe(TMS)3),  (Cp*  =  v5  ~  C5Me5;  TMS=SiMe3),  (2.740(1) 
A,  [19]),  respectively.  The  ZrSn  bond  length  in  Cl2ZrSnH2  is  2.77  A  at  the  rhf 
level,  contracting  slightly  at  the  mp2  level  (2.73  A).  The  relative  bond  lengths 
for  heavier  tetrelides  mirror  the  differences  in  covalent  radii  (rcov(Si)  =  1.17  A; 
rCOv(Ge)  =  1.22  A;  rcov(Sn)  =  1.40  A  [20]).  However,  the  covalent  radius  of  C  is 
0.77  A,  and  so  the  ZrC  bond  in  the  alkylidene  is  expected  to  be  «0.40  A  shorter 
than  the  silylidene;  however,  the  difference  is  closer  to  0.5  A  (Table  I).  Similar 
comparisons  can  be  made  between  Cl2ZrCH2  and  its  germilydene  and  stannylidene 
analogues.  The  differences  in  ME  bond  lengths  for  the  heavier  tetrels  mirror  dif¬ 
ferences  in  covalent  radii.  These  data  suggest  that  Zr  d7r-C  px  bonding  is  consid¬ 
erably  stronger  than  in  the  heavier  congeners,  but  that  the  strength  of  metal-ligand 
x-bonding  is  similar  among  the  heavier  tetrelides. 

Pnictogen  Complexes,  2.  Geometric  data  for  the  C2v  structures  of  Cl2ZrEH  (E  = 
N,  P,  As,  Sb),  are  listed  in  Table  I.  The  mp2  values  calculated  for  ME  bond  lengths 
are  slightly  larger  than  the  RHF  values,  but  by  no  more  than  0.04  A.  The  lengthen¬ 
ing  in  ME  upon  going  from  RHF  to  mp2  contrasts  to  the  contraction  seen  for  the 
Zr-tetrelides  ( vide  supra),  but  the  differences  are  small  in  both  cases.  As  with  the 
tetrelides,  the  small  changes  in  ME  bond  lengths  upon  going  from  rhf  to  mp2 
wavefunctions  suggest  that  modeling  of  the  heavier  pnictogen  complexes  of  the 
form  LnM=ER  with  rhf  and  mp2  wavefunctions  is  appropriate.  The  calculated 
Zr-imido  bond  lengths,  1.79  A  (rhf)  and  1.83  A  (mp2),  are  in  good  agreement 
with  reported  data:  Zr-imido  =  1.826(4)  A  (Cp2Zr(THF)(  =  N — t — Bu)  [3d]  and 
1.868(3)  A  (Zr(NHAr)2(py')2(=NAr))  [3e].  The  calculated  ZrP  bond  length  in 
Cl2ZrPH  (2.25  A,  rhf;  2.28  A,  mp2)  is  much  shorter  than  the  Zr-phosphinidene 
bond  length  (2.505(4)  A)  in  Cp2Zr(PMes')(PMe3),  Mes'  =  2,4,6-tri—t— butyl- 
phenyl  [6b].  The  phosphinidene  in  Cp2Zr(PMes')(PMe3)  is  coordinated  in  a  bent 
fashion  (Zr=P— C  «  1 16.1(4)°)  despite  the  bulk  of  the  Mes',  suggesting  a  strong 
driving  force  for  a  ZrP  bonding  description  which  is  closer  to  a  double  bond  (with 
a  stereochemically  active  lone  pair  on  P).  The  C2v  symmetry  constraint  was  relaxed 
for  an  MP2  optimization  of  Cl2ZrPH.  A  Cs  structure  (5.0  kcal  moC1  lower  in  energy 
than  C2v — Cl2ZrPH)  is  found  with  a  ZrP  bond  length  of  2.38  A,  or  0.1  A  longer 
than  C2v — Cl2ZrPH,  thus  much  closer  to  experiment  [6b]. 

Chalcogenido  Complexes ,  3.  The  geometrical  characteristics  of  the  chalcogenidos 
Cl2ZrE  (3,  E  =  O,  S,  Se,  Te)  were  computed  (Table  I);  in  all  cases,  the  C2v  structure 
is  a  minimum.  The  predicted  ZrE  bond  lengths  compare  reasonably  well  with 
experimental  values  [7a].  The  oxo  provides  the  best  agreement  with  experiment 
[7a]:  ZrO  =  1.804(4)  A  (ZrO  =  1.66  A  (rhf),  1.78  A  (mp2)).  The  Zr  sulfido, 
selenido,  and  tellurido  experimental  bond  lengths  in  (Cp')2Zr(E)(py),  (Cp'  = 
7j5 — C5Me4Et),  are  2.334(2)  A,  2.480(1)  A,  and  2.729(1)  A,  respectively,  all  ap¬ 
proximately  7%  longer  than  the  rhf  values  (Table  I)  [7a].  A  related  zirconocene 
sulfido,  Cp*Zr(4 — t — Bu — py)(S)  [21],  has  a  similar  Zr-sulfido  bond  length,  2.316(1) 
A,  to  that  found  in  (Cp')2Zr(S)(py)  [7a].  However,  a  Zr-tellurido  without  Cp  ligands, 
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Zr(Te)(sitel)2(dmpe)2,  has  been  reported  [7b]  and  shows  a  ZrTe  bond  length  to  the 
terminal  tellurido  of  2.650(1)  A,  or  ?=*0.08  A  shorter  than  that  in  (Cp')2Zr(Te)(py) 
[7a].  Replacement  of  the  Cl  with  Cp  to  yield  Cp2ZrE  has  been  shown  to  lengthen 
Zr-chalcogenido  bond  lengths  by  5%  [22],  in  much  better  agreement  with  experiment 
[7a].  Thus,  it  must  be  noted  that  multiple  bonds  between  transition  metals  and 
main  group  elements  are  quite  sensitive  to  the  nature  of  the  tm’s  coordination 
environment. 

Tantalum  Complexes 

Multiply-bonded  complexes  of  Ta  occupy  a  special  place  in  the  history  of 
tm  =  MG  chemistry.  The  synthesis  and  characterization  of  a  Ta-alkylidene  by 
Schrock  is  the  first  example  of  a  d°  carbene  complex  [  1  ].  To  date,  no  heavier  tetrelide 
analogues  have  been  reported.  A  large  number  of  Ta-imidos  have  been  studied,  in 
part  due  to  their  intermediacy  in  cvd  of  TaN  from  organometallic  precursors 
[4c, 23].  Schrock  et  al.  have  reported  the  crystal  structure  of  a  Ta-phosphinidene 
[6c],  Ta(PCy)[N(TMS) — CH2CH2 — ]3N  (Cy  =  cyclohexyl,  TMS  =  SiMe3),  one  of 
only  five  structurally  characterized  phosphinidenes  [6b-f]  and  only  one  of  two 
examples  with  linear  PR  coordination  [6c, d].  Ta-oxos  are  extremely  rare;  the  com¬ 
plex  Ta(0)(N — i — Pr2)3  is  the  only  Ta-oxo  whose  structure  is  reported  in  the  com¬ 
pendium  by  Nugent  and  Mayer  [1].  A  few  examples  of  mono-sulfido  complexes 
of  Tav  have  been  structurally  characterized,  e.g.,  STa(S2CNEt2)3  [24].  Arnold  and 
Christou  have  reported  Ta(E)[N(TMS) — CH2CH2 — ]3N,  E  =  Se,  Te  [25]. 

The  Ta  complexes  studied  are  of  the  form  Cl3TaEHn(n  =  0,  1,  2),  4-6.  The 
geometry  optimizations  for  Cl3TaE  (6)  and  Cl3TaEH  (5)  complexes  are  done  under 
C3v  symmetry,  while  Cl3TaEH2  (4)  complexes  are  Cs  symmetry  (Table  II).  The 
Ta-chalcogenidos  and  Cl3TaNH  are  C3v  minima,  while  the  Ta-tetrelides  are  Cs 
minima.  For  the  tetrelides,  4,  Ta-Cla  bond  lengths  range  from  2.35  to  2.38  A 
(rhf)  and  2.30  to  2.32  A  (mp2).  The  Ta-Clb  bond  lengths  range  from  2.31  to  2.33 
A  (rhf)  and  2.27  to  2.29  A  (mp2).  The  Ta-Cl  bond  lengths  for  the  pnictogen  (5) 
and  chalcogenido  (6)  complexes  range  from  2.30  to  2.33  A  (rhf  and  mp2).  The 
mean  Ta-Cl  bond  length  for  83  reported  crystal  structures  of  six-coordinate  Ta 
complexes  (the  smallest  coordination  number  reported)  is  2.383  ±  0.055  A  [17]. 

Tetrelide  Complexes,  4.  The  metric  data  for  Ta-tetrelides,  4,  are  listed  in 
Table  II.  The  calculated  TaC  bond  length  in  Cl3TaCH2  (1.90  A  (rhf) 
and  1 .92  A  (mp2))  is  in  excellent  agreement  with  the  only  published  Ta-alkylidene 
bond  length  for  a  four-coordinate  Ta-alkylidene,  i.e.,  1.888(29)  A  for 
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Table  II.  Cl3TaE  complexes. 


M/E/levela 

TaE 

TaCla 

(A) 

TaClb 

(A) 

ETaCla 

n 

ETaClb 

(°) 

Exp.  TaEb 

Ta/CH2/RHF 

1.90 

2.35 

2.31 

106 

105 

1.888(29)  A 

Ta/CH2/MP2 

1.92 

2.32 

2.29 

105 

106 

Ta/SiH2/RHF 

2.41 

2.37 

2.32 

108 

103 

Ta/SiH2/MP2 

2.40 

2.31 

2.27 

109 

103 

Ta/GeH2/RHF 

2.46 

2.37 

2.32 

108 

102 

Ta/GeH2/MP2 

2.43 

2.30 

2.27 

110 

104 

Ta/SnH2/RHF 

2.67 

2.38 

2.33 

109 

101 

Ta/SnH2/MP2 

2.63 

2.30 

2.27 

112 

103 

Ta/NH/RHF 

1.72 

2.33 

108 

aver.  =  1.77  A 

Ta/NH/MP2 

1.78 

2.32 

106 

Ta/PH/RHF 

2.15 

2.32 

111 

2.145(7)  A 

Ta/PH/MP2 

2.21 

2.31 

108 

Ta/AsH/RHF 

2.23 

2.32 

111 

Ta/AsH/MP2 

2.31 

2.31 

108 

Ta/SbH/RHF 

2.43 

2.32 

112 

Ta/SbH/MP2 

2.53 

2.33 

108 

Ta/O/RHF 

1.67 

2.31 

108 

1.725(7) 

Ta/0/MP2 

1.74 

2.31 

106 

Ta/S/RHF 

2.12 

2.30 

109 

aver.  =  2.18  A 

Ta/S/MP2 

2.17 

2.30 

107 

2.330(1)  A 

Ta/Se/RHF 

2.24 

2.30 

110 

Ta/Se/MP2 

2.31 

2.30 

107 

Ta/Te/RHF 

2.47 

2.30 

110 

2.568(1)  A 

Ta/Te/MP2 

2.54 

2.31 

107 

a  The  pnictogen  (5)  and  chalcogen  (6)  complexes  are  C3v;  the  tetrelides  are  Cs  (4).  The  M/E/level 
notation  denotes  the  transition  metal,  ligand,  and  level  of  theory  (rhf  or  MP2),  respectively.  See  4  for 
definitions  of  Cla  and  Clb . 

b  Experimental  ME  bond  lengths,  where  available,  are  reported.  Where  more  than  one  bond  length  is 
available,  the  average  is  denoted.  See  text  for  further  details. 


Ta( = C(H)TMS)(CH2TMS)(0 — 2,6-di — t — Butyl — phenyl)  [1].  The  Ta-tetrelide 
bond  lengths  for  the  heavier  members  are  TaSi  =  2.41  (rhf),  2.40  (mp2);  TaGe  = 
2.46  A  (RHF),  2.43  A  (mp2);  TaSn  =  2.67  A  (RHF),  2.63  A  (mp2).  A  Ta-silyl  has 
recently  been  characterized  [26];  the  TaSi  bond  length  in  the  formally  single  bonded 
complex  is  2.680(15)  A,  providing  an  upper  limit  for  the  Ta-silylidene  bond  length. 
To  our  knowledge,  no  Ta=Ge  or  Ta  =  Sn  complexes  have  been  reported.  The 
mp2  calculated  TaE  bond  lengths  are  slightly  less  than  the  rhf  values,  except  for 
the  alkylidene.  Similar  behavior  (except  for  the  alkylidene)  is  observed  for  the  Zr 
complexes.  However,  the  variance  between  calculated  tm=mg  bond  lengths  at 
the  two  levels  of  theory  (Table  II),  going  from  the  alkylidene  (Arhf/mp2  =  0.02 
A)  to  the  stannylidene  (Arhf/mp2  =  0.04  A)  is  very  small,  suggesting  that  electron 
correlation  is  not  as  important  for  the  Ta-tetrelides. 

Pnictogen  Complexes,  5.  Metric  data  for  Cl3TaEH  (5),  are  collected  in  Table  II. 
Differences  between  rhf  and  mp2  bond  lengths  increases  as  the  pnictogen  becomes 
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heavier,  a  difference  much  greater  than  for  Ta-tetrelides.  The  Ta-imido  shows  the 
least  variance  (Arhf/mp2  =  0.06  A)  and  the  Ta-stibinidene  changes  the  most  upon 
the  inclusion  of  correlation  (Arhf/mp2  =  0. 10  A)  (Table  II).  The  results  demonstrate 
the  growing  importance  of  electron  correlation  for  heavier  MG  elements  when  de¬ 
scribing  a  series  of  related  bonds,  although  the  magnitude  of  the  lengthening  is  4% 
or  less.  The  calculated  Ta-imido  bond  lengths  (1.72  A  (rhf)  and  1.78  A  (mp2)) 
are  consistent  with  literature  values  for  four-coordinate  Ta-imidos:  1.77(2)  A 
(Ta(N — t — Bu)(NMe2)3),  1.763(6)  A  (Ta(N— t— Bu)(NTMS2)2C1),  and  1.776(8)  A 
(Ta(NAr)(NArC2H4Ph)(OAr)2,  Ar  =  2,6-C6H3Me2)  [1].  The  TaP  bond  length  in 
Cl3TaPH  (2.15  A  (rhf)  and  2.21  A  (mp2))  is  in  agreement  with  the  value  of 
2.145(7)  A  for  Ta(PCy)[N(TMS)— CH2CH2— ]3N  [6c],  which  is  linearly  coordi¬ 
nated,  Ta-P-C  ft*  170(1)°,  and  thus  closer  to  a  TaP  triple  bond  description.  The 
calculated  TaAs  bond  length  is  2.23  A  (rhf)  (2.31  A,  mp2),  and  the  TaSb  bond 
length  in  Cl3TaSbH  is  2.43  A  (rhf)  (2.53  A,  mp2).  The  differences  in  covalent  radii 
[20]  for  the  heavier  pnictogens  (rcov(As)  -  rC0V(P)  =  0. 1 1  A;  rcov(Sb)  -  rcov(As)  = 
0.31  A;  rC0V(Sb)  -  rcov(As)  =  0.20  A)  mirror  the  calculated  Ta-pnictogen  bond  lengths 
at  both  the  rhf  and  mp2  levels  (Table  II).  As  before,  this  suggests  that  the  description 
of  the  TM-pnictogen  bond  is  similar  among  the  heavier  pnictogen  complexes,  so 
that  potential  strategies  for  stabilizing  phosphinidenes  should  be  applicable  to  AsR 
and  SbR  analogues. 

Chalcogenido  Complexes,  6.  Calculated  metric  data  for  the  Ta-chalcogenides,  6, 
are  collected  in  Table  II.  The  Ta-oxo  complex  Cl3TaO  has  a  bond  length  of  1.67 
A  (rhf)  (TaO  -  1.74  A,  mp2)  in  agreement  with  the  value  of  1.725(7)  A  for 
Ta(0)(N — i — Pr2)3  [2].  The  TaS  bond  lengths  (2.12  A  (rhf)  and  2.17  A  (mp2)) 
correspond  well  with  the  average  of  the  eight  reported  terminal  TavS  bond  lengths, 
2.18  A  [24].  The  Ta-chalcogenido  bond  lengths  for  Cl3TaSe  (2.24  A,  rhf;  2.31  A, 
mp2)  and  Cl3TaTe  (2.47  A,  rhf;  2.54  A,  mp2)  are  in  agreement  with  those  for 
T a(E)[N(TMS) — CH2CH2 — ] 3N,  TaSe  =  2.330(1)  A  and  TaTe  =  2.568(1)  A  [25]. 
Experimental  Ta-chalcogenido  bond  lengths  are  closer  to  mp2  calculated  values 
(aver,  error  «  1%)  than  the  RHF  values  (aver,  error  «  4%),  suggesting  that  electron 
correlation  at  the  MP2  level  may  be  more  appropriate  for  these  chalcogenidos. 
However,  other  than  Ta(0)(N — i — Pr2)3,  all  experimental  Ta-chalcogenidos  have 
coordination  numbers  greater  than  our  four-coordinate  models.  Lengthening  of 
the  Ta-chalcogenido  bond  upon  going  from  an  rhf  to  MP2  wavefunction  is  ap¬ 
proximately  the  same  for  all  chalcogens  (Arhf/mp2  »  0.07  A). 

Tungsten  Complexes 

Multiply-bonded  complexes  of  the  Group  IVB  metals  are  well  known  [1].  High- 
valent  (d°)  tm  =  MG  complexes  are  somewhat  rarer  than  lower  valent  examples 
and  are  primarily  limited  to  the  first  row  main  group  elements  [1].  Trans- 
W(Te)2(PMe3)4  represents  the  first  terminal  tellurido  [27];  Parkin  and  Rabinovich 
have  subsequently  characterized  a  series  of  related  bis(chalcogenido)  complexes 
[27b].  An  interesting  series  of  d°  chalcogenidos,  C14WE  (E  =  O,  S,  Se),  have  been 
studied  by  Page  et  al.  [28]  using  gas-phase  electron  diffraction.  Mononuclear,  ter¬ 
minal  W  complexes  of  the  heavier  pnictogens  and  tetrels  are  very  rare  [6d-f]. 
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The  geometry  optimizations  of  C14WE  (7-9),  Table  III,  for  the  pnictogens  (8) 
and  chalcogens  (9)  are  done  under  the  constraints  of  C4v  symmetry.  The  chalco- 
genidos  (9)  and  C14WNH  are  C4v  minima.  The  tetrelides  (7)  possess  Cs  minima. 
Calculated  W— Cl  bond  lengths  range  from  2.28  A  to  2.45  A  (rhf),  and  from  2.28 
A  to  2.50  A  (mp2)  (Table  III).  The  average  value  for  all  W — Cl  bonds  (rhf  and 
mp2  values)  is  2.34  ±  0.05  A,  consistent  with  the  mean  experimental  value  of 
2.39  ±  0.03  A  for  W— Cl  (CN  =  5)  [17]. 

Tetrelide  Complexes,  7.  The  geometries  of  the  tetrelides  C14WEH2(E  =  C,  Si, 
Ge,  Sn),  7,  were  computed  (Table  III).  The  calculated  carbon-metal  bond  of  1.87 
A  (RHF)  is  in  quite  good  agreement  with  the  W-alkylidene  bond  lengths  of  1 .882(  1 4) 
A  and  1.942(9)  A  reported  for  five-coordinate  d°  complexes  [1].  The  WC  bond  in 
CI4WCH2  shortens  by  0.0 1  A  upon  going  from  the  rhf  to  mp2  level,  thus  remaining 
in  good  agreement  with  the  experimental  models.  To  our  knowledge,  no  structural 
analysis  of  a  W-silylidene  has  been  reported.  The  complexes  [W(CO)5]3E,  10,  (E  = 
Ge,  Sn)  have  been  structurally  characterized  [5a].  The  shorter  of  the  W-tetrel  bonds 
(denoted  with  a  double  bond  in  10)  are  2.5 1 1(1)  A  and  2.702(2)  A  for  the  Ge  and 
Sn  complexes,  respectively  [5a].  Calculated  values  for  C14WEH2  are  WGe  =  2.45 
A  (rhf)  and  2.40  A  (mp2),  and  WSn  =  2.70  A  (rhf)  and  2.56  A  (mp2),  in  agreement 
with  those  found  in  complexes  10.  In  both  cases  the  rhf  values  are  closer  to  ex¬ 
periment  [5a],  although  any  designation  of  bond  character  in  these  complexes  is 
tenuous.  The  differences  in  the  WE  bond  lengths  in  10  is  0.19  A,  a  value  which  is 
intermediate  between  the  difference  between  the  RHF  (Sn  -  Ge  =  0.25  A)  and  mp2 
(Sn  —  Ge  =  0.16  A)  WE  bond  lengths  for  C14WEH2  (Table  III).  Note  that  the  mp2 
difference  in  W-tetrelide  bond  length  is  closer  to  the  difference  in  the  covalent  radii 
of  Ge  and  Sn  (diff.  =  0. 1 8  A,  [20]). 

Pnictogen  Complexes.  The  calculated  WN  bond  length  in  C4v  C14WNH  (1.66  A, 
rhf;  1 .75  A,  Mp2)  is  in  agreement  with  the  range  of  bond  lengths  (WN  =  1.736(5)- 
1 .757(5)  A,  [  1  ])  for  square  pyramidal  W-imidos.  Cowley  et  al.  [6d]  have  characterized 
a  linearly  coordinated  (W— P— C  =  168.2(2)°)  W(IV)-phosphinidene  with  WP  = 
2.169(1)  A,  in  between  the  rhf  value  of  WP  =  2.08  A  and  the  mp2  value  of 
WP  =  2.20  A  for  C4v-C14WPH.  It  is  interesting  to  note  that  a  nonlinearly  coor¬ 
dinated  W(IV)-phosphinidene,  Cp2W(PMes'),  has  a  substantially  longer  WP  bond 
length,  WP  =  2.349(5)  A  [6f].  The  WE  bond  lengths  in  Cl4WAsH  and  Cl4WSbH 
differ  by  ^0.20  A  at  the  rhf  and  mp2  levels  (Table  III).  As  with  the  other  C4v- 
CI4WEH  complexes  the  WE  bond  lengthens  by  «0.12  A  upon  the  inclusion  of 
correlation  at  the  mp2  level.  Complexes  of  WAs  and  WSb  with  any  bond  order  are 
rare;  Huttner  et  al.  [29]  have  reported  [W(CO)5]3E2  (E  =  As,  Sb),  11.  Average  WE 
bond  lengths  in  11  are  2.83  A  (E  =  As)  and  3.02  A  (E  =  Sb).  The  assignment  of  a 
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(CO)5 

W 


w 

(CO>5 


;E- - w(co)5 


10 


E  =  Ge,  Sn 


WE  bond  order  of  one  to  11  is  not  clear,  but  the  bond  lengths  should  be  an  upper 
limit  to  the  W-pnictogen  bond  lengths  in  Cl4WAsH  and  Cl4WSbH.  Additionally, 
the  difference  in  W-pnictogen  bond  lengths  in  11  (WSb  —  WAs  =  0.19  A)  is  con¬ 
sistent  with  that  calculated  for  Cl4WAsH  and  Cl4WSbH,  further  proof  that  bonding 
descriptions  among  the  heavier  pnictogens  are  similar. 


Table  III.  C14WE  complexes. 


WCla 

WClb 

EWCla 

EWClb 

M/E/levela 

WE 

(A) 

(A) 

(°) 

(°) 

Exp.  WEb 

W/CH2/RHF 

1.87 

2.40 

2.28 

98 

106 

aver.  =  1.91  A 

W/CH2/MP2 

1.86 

2.35 

2.28 

101 

105 

W/SiH2/RHF 

2.37 

2.40 

2.30 

102 

87 

W/SiH2/MP2 

2.44 

2.50 

2.29 

124 

59 

W/GeH2/RHF 

2.45 

2.43 

2.29 

96 

94 

2.511(1)  A 

W/GeH2/MP2 

2.40 

2.35 

2.36 

102 

72 

W/SnH2/RHF 

2.70 

2.45 

2.28 

89 

106 

2.702(2)  A 

W/SnH2/MP2 

2.56 

2.28 

2.31 

83 

112 

W/NH/rhf 

1.66 

2.34 

104 

aver.  =  1.75  A 

W/NH/MP2 

1.75 

2.34 

102 

W/PH/RHF 

2.08 

2.35 

105 

2.169(1)  A 

W/PH/MP2 

2.20 

2.34 

103 

W/AsH/rhf 

2.17 

2.35 

105 

W/AsH/mp2 

2.29 

2.34 

103 

W/SbH/RHF 

2.37 

2.36 

105 

W/SbH/MP2 

2.50 

2.33 

103 

W/O/RHF 

1.63 

2.32 

104 

1.685(15)  A 

W/0/MP2 

1.73 

2.33 

102 

2.086(6)  A 

W/S/RHF 

2.07 

2.32 

105 

W/S/MP2 

2.16 

2.34 

102 

W/Se/RHF 

2.17 

2.33 

105 

2.203(4)  A 

W/Se/MP2 

2.30 

2.34 

102 

W/Te/RHF 

2.40 

2.33 

105 

W/Te/MP2 

2.53 

2.36 

102 

a  The  pnictogen  (8)  and  chalcogen  (9)  complexes  are  C4v;  the  tetrelides  are  Cs  (7).  The  M/E/level 
notation  denotes  the  transition  metal,  ligand,  and  level  of  theory  (rhf  or  mp2),  respectively.  See  7  for 
definitions  of  Cla  and  Clb. 

b  Experimental  ME  bond  lengths,  where  available,  are  reported.  Where  more  than  one  bond  length  is 
available,  the  average  is  given.  See  text  for  further  details. 
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The  Chalcogen  Complexes.  The  C14WE  complexes  (E  =  O,  S,  Se,  Te)  provide 
an  exciting  opportunity  to  compare  ECP  calculated  geometries  (rhf  and  mp2 
wavefunctions)  with  experimental  gas-phase  data  [28].  The  C14WE  complexes  are 
calculated  to  be  C4v  minima  which  supports  the  conclusions  of  Page  et  al.  based 
on  electron  diffraction  (ED)  [28].  The  WC1  bonds  lengthen  by  0.01  A  (oxo)  to  0.03 
A  (tellurido)  upon  going  from  an  rhf  to  MP2  wavefunction,  and  thus  remain  in 
excellent  agreement  with  experimental  values,  WC1  =  2.28  A  [28].  The  angle  between 
the  WE  and  WC1  bonds  contract  by  2°  (E  =  O)  or  3°  (E  =  S,  Se,  Te).  Experimental 
EWC1  bond  angles  are  102°  (E  =  O),  104°  (E  =  S),  and  104°  (E  =  Se)  in  good 
agreement  with  the  rhf  and  MP2  values.  The  reported  experimental  values  of  WE 
in  CI4WE  are  1.685(15)  A  (E  =  O),  2.086(6)  A  (E  =  S),  and  2.203(4)  A  (E  =  Se) 
[28].  The  WE  bond  length,  as  expected,  shows  the  greatest  change  upon  the  inclusion 
of  electron  correlation,  rhf  values  are  shorter  than  experimental  values  by  «0.05 
A.  On  average  the  mp2  calculated  WE  bond  lengths  (Table  III),  are  0.1 1  A  longer 
than  the  RHF  calculated  WE  bond  lengths,  so  that  experimental  W-chalcogen  bond 
lengths  in  C14WE  fall  in  between  the  rhf  and  mp2  calculated  values.  Thus,  the 
mp2  wavefunction  tends  to  slightly  overcorrect  for  WE  bond  lengthening.  However, 
the  differences  in  the  RHF  and  mp2  optimized  geometries  are  small  and  thus  both 
levels  of  theory  yield  good  agreement  with  the  ED  data  of  Page  et  al.  [28]. 


Summary  and  Conclusions 

Several  points  of  interest  were  noted  in  this  research  which  are  of  interest  in  the 
context  of  studying  tm=mg  complexes,  both  in  terms  of  future  directions  for 
experimental  and  computational  research.  The  most  important  points  are  sum¬ 
marized  below. 

As  expected  [12],  the  change  in  TM=MG  bond  lengths  upon  going  from  rhf 
to  MP2  (for  a  given  metal)  are  of  similar  magnitude  and  tend  to  increase  as  one 
goes  toward  the  heavier  MG  elements.  An  obvious  exception  is  the  Zr-chalcogenidos, 
Cl2ZrE  (E  =  O,  S,  Se,  Te)  (Table  I):  Arhf/mp2  =  0.12  A  (O),  0.03  A  (S),  0.05  A 
(Se),  and  0.06  A  (Te);  ZrE  bonds  are  longer  at  the  mp2  level  than  rhf.  It  is  of 
interest  that  inclusion  of  correlation  causes  the  greatest  change  in  the  Zr-oxo  bond 
length,  since  strong  metal  dTr-ligand  p?r  bonding  for  first  row  MG  elements  typically 
yields  the  best  rhf  description  in  a  series  of  related  tm  =  mg  complexes.  Based 
on  an  analysis  of  X-ray  crystallographic  data  for  Cp2Zr(E)(py)  [7a]  and  ECP  studies 
of  Cp2ZrE  models  [22]  it  was  concluded  that  the  description  of  the  Zr-oxo  linkage 
is  more  like  that  of  a  single  bond,  while  the  heavier  chalcogenido  complexes  were 
more  weighted  towards  a  double  bond  description  of  the  Zr-chalcogenido  bond.  A 
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triple  bond  description  of  the  Zr-chalcogenido  linkage  is  inconsistent  with  the  ob¬ 
served  reactivity  of  Cp2Zr(E)(py)  [7a].  A  doubly  bonded  structure,  with  a  greater 
number  of  covalent  bonds,  is  expected  to  be  better  described  at  the  Hartree~Fock 
level  than  a  singly  bonded  structure  in  which  TM  — MG  bonding  is  more  dative  in 
nature.  Taken  together,  experimental  and  computational  results  suggest  a  funda¬ 
mental  difference  in  the  bonding  description  between  oxo  complexes  and  heavier 
chalcogenidos  for  Group  IVB  complexes. 

Small  differences  are  found  between  rhf  and  mp2  geometries  in  the  overwhelming 
majority  of  cases.  The  C14WE  (E  =  O,  S,  Se)  complexes,  for  which  there  are  exact 
experimental  models  [28],  yield  rhf  determined  WE  bond  lengths  which  are  shorter 
than  experimental  values  by  «0.05  A;  whereas  the  mp2  calculated  WE  bond  lengths 
(Table  III)  are  longer  by  a  nearly  equivalent  amount.  When  compared  to  experiment 
[28],  the  mp2  wavefunction  tends  to  slightly  overcorrect  for  WE  bond  lengthening. 
Other  geometric  parameters  for  C14WE  show  minimal  changes  upon  the  inclusion 
of  electron  correlation.  Thus,  the  differences  in  the  rhf  and  mp2  optimized  ge¬ 
ometries  are  small  and  both  levels  of  theory  yield  reasonable  agreement  with  the 
ed  data  of  Page  et  al.  [28]  for  even  the  heaviest  member  of  the  series,  i.e.,  Cl4WSe. 
The  W-tetrelide  complexes  (Table  III)  show  some  of  the  largest  changes  in  geometry 
upon  the  inclusion  of  electron  correlation,  suggesting  that  future  studies  of  W- 
tetrelide  complexes  may  need  to  be  carried  out  using  more  sophisticated  and  com¬ 
putationally  intensive,  multiconfiguration  methods.  The  close  agreement  between 
rhf  and  mp2  calculated  geometries  for  the  remaining  models  supports  the  conten¬ 
tion  that  these  computationally  efficient  methods  can  provide  a  route  to  the  theo¬ 
retical  study  of  TM  =  MG  complexes  which  are  more  realistic  models  of  synthetic 
targets. 

The  ZrP  linkage  in  Cl2ZrPH  lengthens  by  0.1  A  upon  going  from  linear  (i.e., 
triple  bond)  to  bent  (i.e.,  double  bond)  PH  coordination.  Experimental  data  exists 
to  support  the  calculational  results.  The  TaP  bond  length  in  a  linearly  coordinated 
Ta-phosphinidene  [6c]  is  0.36  A  shorter  than  the  ZrP  bond  length  in  a  bent  Zr- 
phosphinidene  [6b];  this  difference  in  TM-phosphinidene  bond  lengths  is  much 
greater  than  the  difference  in  the  metallic  radii  of  Zr  and  Ta  (RZr  =  1.454  A, 
RTa  =  1.343  A,  diff.  =  0.1 1 1  A).  Also,  a  linearly  coordinated  W(IV)-phosphinidene 
has  WP  =  2.169(1)  A  [6d],  while  a  nonlinearly  coordinated  W(IV)-phosphinidene 
[6f],  Cp2W(PMes'),  has  a  WP  bond  which  is  0.18  A  longer,  WP  -  2.349(5)  A. 
Letting  the  WPH  angle  in  C4v-C14WPH  relax  from  180°  leads  to  a  Cs  minimum 
(15.8  kcal  mol-1  lower  in  energy  at  the  RHF  level)  with  WP  =  2.22  A,  or  0.14  A 
longer  than  in  C4v-C14WPH  (Table  III).  The  changes  in  M-P  bond  length  as  a 
function  of  M-P-H  angle  are  exciting  in  the  context  of  trying  to  make  stable 
phosphinidenes  since  a  linearly  coordinated  phosphinidene  should  be  of  greater 
kinetic  (by  tying  up  the  P  lone  pair)  and  thermodynamic  (by  strengthening  the  TM- 
phosphinidene  linkage)  stability.  The  metric  data  suggest  similar  descriptions  of 
the  TM  — MG  linkage  among  the  heavier  pnictogens  so  that  similar  proposals  for 
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LnM=AsR  and  LnM  =  SbR  complexes,  which  are  currently  not  known,  can  be 
made.  A  joint  theory-experiment  study  is  underway  to  test  these  hypotheses  and 
identify  synthetic  targets  (through  modification  of  R  and  Ln)  which  favor  linearly 
coordinated  PR,  AsR,  and  SbR  [30]. 
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Abstract 

A  series  of  complete  coupled-cluster  singles,  doubles,  and  triples  (ccsdt  )  calculations  have  been  per¬ 
formed  with  Hartree-Fock  (hf)  and  Brueckner  (b)  orbitals.  Calculations  have  been  performed  with  a 
double-zeta  plus  polarization  basis  set  on  the  H20,  SiH2,  NH2,  BeO,  C2,  CN+,  and  BN  molecules. 
Calculations  on  H20  and  SiH2  at  equilibrium  and  stretched  geometries  show  negligible  difference  between 
HF-  and  B-CCSDT  energies.  This  is  also  true  for  NH2 ,  except  when  the  bonds  have  been  stretched  to  twice 
their  equilibrium  values,  at  which  point  there  is  about  a  2.5  milli-Hartree  (m Eh)  difference.  Calculations 
on  the  isoelectronic  systems  BeO,  C2,  CN\  and  BN  were  performed  at  equilibrium  geometries.  Even 
though  these  systems  have  large  Tx  amplitudes,  the  difference  between  hf-  and  B-ccsdt  energies  is  only 
about  1  m Eh.  For  the  CCSD  method  and  the  CCSD(T)  method,  which  includes  triple  excitations  in  an 
approximate,  noniterative  manner,  however,  somewhat  larger  differences  are  observed  between  and  hf- 
and  B-CC  results.  Finally,  some  properties  of  BN  were  computed  using  HF-  and  B-CC  methods.  There  are 
quite  small  differences  between  the  hf-  and  B-CCSDT  results,  but  significantly  larger  ones  for  the  more 
approximate  CCSD  and  CCSD(T)  methods.  For  this  difficult  system,  where  the  CCSD(T)  approximation 
seems  to  be  inadequate  for  hf  orbitals,  the  use  of  Brueckner  orbitals  improves  the  agreement  of  CCSD<T) 
with  CCSD(T)  substantially  for  re  and  u>e,  although  the  difference  for  m  is  unaffected.  ©  1994  John  Wiley  & 
Sons,  Inc. 


Introduction 

The  subject  of  Brueckner  orbitals  has  a  long  history  [1-7].  These  orbitals  derive 
from  Brueckner’s  work  in  the  theory  of  nuclear  matter  [1].  Brueckner  orbitals  are 
eigenfunctions  of  an  effective  one-particle  operator,  and  thus  are  sometimes  referred 
to  as  the  orbitals  of  an  “exact  SCF”  theory  [3].  They  also  provide  the  single  deter¬ 
minant  with  the  maximum  overlap  with  the  exact  wave  function. 

Brueckner  orbitals  are  defined  to  be  those  orbitals  which  make  the  single  excitation 
amplitudes  C?  vanish  in  the  full  configuration  interaction  (FCl)  wave  function.  In 
practice,  of  course,  Brueckner  orbitals  are  obtained  for  some  approximate  method, 
such  as  truncated  Cl  or  truncated  coupled-cluster  (CC)  methods.  For  example,  Chiles 
and  Dykstra  [6]  obtained  Brueckner  orbitals  with  respect  to  the  CC  singles-and- 
doubles  (CCSD)  method,  and  they  termed  this  method  CCD  ( T\  =  0). 
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In  the  last  few  years  there  has  been  a  revival  of  interest  in  the  use  of  Brueckner 
orbitals  in  CC  methods.  Handy  et  al.  [  8  ]  implemented  a  scheme  which  they  termed 
“Brueckner  doubles”  (bd).  This  is  identical  to  the  CCD  ( Tx  =  0)  scheme  of  Chiles 
and  Dykstra  [6]  and  has  been  denoted  B-CCD  by  Stanton  et  al.  [9].  This  notation 
emphasizes  that  the  method  is  just  a  CCD  calculation  with  Brueckner  orbitals,  the 
latter  being  those  which  make  the  CCSD  T x  amplitudes  vanish.  Triple  excitations 
have  been  included  noniteratively  in  these  Brueckner  orbital  methods,  in  the  same 
manner  as  in  the  CCSD  +  t(CCSD)  [10]  and  CCSd(t)  [1 1,12]  methods.  Handy  et 
al.  defined  and  implemented  a  method  which  they  termed  bd(t),  while  Stanton 
et  al.  used  a  slight  extension  of  this  which  they  called  B-CCd(t)  [9,12].  Raghavachari 
et  al.  [13]  have  added  noniterative  triple  and  quadruple  excitation  corrections  to 
the  bd  model  to  give  the  bd(tq)  method,  which  is  the  Brueckner  orbital  analogue 
of  the  CCSD  +  tq*(ccsd)  [14]  and  ccsd(tq)  [13]  methods.  The  Brueckner  orbital 
variant  of  the  iterative  CCSDT  method  [15-18]  has  also  been  considered  [19].  As 
well  as  energies,  analytical  derivatives  have  been  implemented  for  the  bd  and  bd(t) 
methods  [20,21].  A  straightforward  determination  of  Brueckner  orbitals  requires 
a  series  of  CC  calculations,  with  orbitals  being  modified  after  each  cc  calculation 
[  6  ] .  However,  with  an  atomic  orbital  formulation  of  CC  theory,  it  is  possible  to 
perform  a  B-CC  calculation  with  about  the  same  computational  cost  as  a  single  CC 
calculation  [22]. 

Along  with  these  methodological  developments,  there  have  now  been  a  number 
of  B-CC  calculations,  particularly  comparisons  with  HF-CC  results  [23,24].  Some  of 
the  most  significant  recent  results  relate  to  symmetry  breaking  questions,  as  antic¬ 
ipated  by  earlier  studies  [  5  ] .  For  example:  ( 1 )  it  has  been  shown  that  with  a  Brueck¬ 
ner  reference  determinant,  the  distance  at  which  the  spin-restricted  solution  becomes 
unstable  is  greater  than  if  an  RHF  reference  determinant  is  used  [  8  ] ;  (2 )  it  has  been 
shown  that  in  some  cases  when  the  Hartree-Fock  orbitals  suffer  from  symmetry 
breaking,  the  Brueckner  orbitals  are  not  symmetry  broken  [  9  ] . 

While  there  have  now  been  quite  a  number  of  B-CC  calculations  incorporating  a 
noniterative  treatment  of  triple  excitations,  there  are  rather  few  in  which  triple 
excitations  are  incorporated  iteratively.  To  the  best  of  our  knowledge,  there  is  only 
one  study  in  of  this  type  [19].  However,  this  work  does  not  present  any  HF-CCSDT 
results  for  comparison,  and  it  uses  only  a  minimal  basis  set.  In  this  article,  we 
provide  a  comparison  of  Brueckner  and  Hartree-Fock  CCSDT  results  for  several 
difficult  examples,  using  double-zeta  plus  polarization  (dzp)  basis  sets.  We  also 
include  some  CCSD  and  CCSD(T)  results  for  comparison. 

Computational  Details 

All  of  the  calculations  have  been  performed  with  the  ACES  n  program  system 
[25,26].  Brueckner  orbitals  were  determined  in  the  manner  described  by  Chiles 
and  Dykstra  [  6  ] .  Thus,  one  begins  with  a  set  of  orthonormal  orbitals.  These  are  in 
principle  arbitrary,  but  in  this  work  are  Hartree-Fock  orbitals.  With  these  orbitals, 
a  CC  calculation  is  performed.  From  the  T j  amplitudes  and  the  initial  orbitals,  one 
generates  a  new  set  of  orbitals,  according  to  the  equations 
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c  -  r'(p)  4.  v  t?  r<°> 

fii  1  /  j  *'  i  pa 

a 

(i) 

r*  _  ^(0)  _  v  t? 

K-'fia  p.a  1 1  fii  * 

(2) 

Here  i  and  a  denote  occupied  and  virtual  molecular  orbitals,  respectively,  and  p 
labels  the  atomic  orbitals.  The  new  orbitals  are  then  orthonormalized  and  used  in 
another  CC  calculation.  This  generates  a  new  set  of  T i  amplitudes,  from  which  a 
new  set  of  orbitals  is  generated.  This  process  is  repeated  until  convergence,  at  which 
time  the  orbitals  are  Brueckner  orbitals.  Of  course,  this  procedure  requires  several 
CC  calculations,  and  is  not  recommended  as  an  efficient  method  for  generating 
Brueckner  orbitals  (cf.  Ref.  [22]),  but  is  adequate  for  testing  purposes. 

Several  systems  have  been  studied.  In  the  first  group  of  calculations,  the  H20, 
NH2,  and  SiH2  molecules  were  studied  with  a  double-zeta  plus  polarization  (dzp) 
basis  set  at  equilibrium  geometries  and  at  geometries  with  stretched  bonds.  The 
geometries  are  the  same  as  those  used  in  fci  calculations  [27-29].  The  O  and  N 
basis  sets  are  the  4s2p  contractions  of  Dunning  [30]  augmented  with  the  d  polar¬ 
ization  functions  of  Redmon  et  al.  [31].  The  H  basis  set  in  the  H20  and  NH2 
calculations  is  Dunning’s  2s  set  [30],  augmented  with  the  p  exponent  of  Redmon 
et  al.  (0.7).  In  the  SiH2  calculations  the  H  basis  set  is  the  same,  except  that  the  p 
function  has  an  exponent  of  1 .0.  The  Si  basis  set  is  the  6s4p  set  of  Dunning  and 
Hay  [32]  with  a  d  function  of  0.3  [29].  In  the  second  group,  single  point  energy 
calculations  were  performed  at  geometries  close  to  experimental  equilibrium  ge¬ 
ometries  [33]  of  the  lowest  closed-shell  states  of  the  isoelectronic  series  BeO,  C2, 
CN  +  ,  and  BN  with  a  dzp  basis  set.  For  B,  C,  and  N,  the  basis  set  comprised  the 
4s2p  set  of  Dunning  [30]  and  the  polarization  functions  of  Redmon  et  al.  (0.386 
(B),  0.654  (C),  and  0.902  (N)).  The  Be  basis  set  was  the  3s2p  set  of  Dunning  and 
Hay  [  32  ]  with  a  polarization  exponent  of  0.6  [  34 ],  and  the  O  basis  set  was  Dunning’s 
4s2p  set  [30]  with  a  polarization  exponent  of  0.85  [32].  The  bond  lengths  used  in 
the  calculations  are  1.331  A  (BeO),  1.243  A  (C2),  1.173  A  (CN  +  ),  and  1.28  A 
(BN).  The  final  group  of  calculations  involved  determination  of  some  properties 
of  the  a 1  2+  state  of  BN  using  the  same  dzp  basis  set  as  described  above.  Throughout 
this  work,  all  electrons  were  correlated  and  spherical  harmonic  d  functions  were 
used,  except  in  the  calculations  on  H20  which  used  Cartesian  d  functions. 

In  order  to  give  a  quantitative  indication  of  the  size  of  the  T \  CC  amplitudes,  we 
have  computed  the  “T i  diagnostic”  of  Lee  and  Taylor  [35],  which  is  defined  by 

T .  =  [^?(«)2  +  /?(^3)2]  (3) 

for  scf  orbitals  where  N  is  the  number  of  correlated  electrons.  As  Tj  is  a  measure 
of  the  difference  of  a  set  of  orbitals  from  Brueckner  orbitals,  it  is  expected  to  correlate 
with  the  difference  between  CC  results  computed  with  Brueckner  and  another  set 
of  orbitals  [23].  Contrary  to  the  opinion  of  some,  this  diagnostic  should  not  be 
used  as  a  measure  of  multi-reference  character,  as  it  ignores  the  amplitudes  of 
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double  excitations  which  have  a  dominant  role  in  expressing  that  aspect.  As  any 
set  of  orbitals  may  be  obtained  from  another  from  eTl,  the  size  of  Tj  is  an  orbital 
measure. 

We  also  report  the  total  CC  energy  and  the  cc  correlation  energy.  The  former  is 
the  sum  of  the  energy  of  the  reference  determinant  and  the  CC  correlation  energy 
for  that  determinant: 


EXot  =  ($0\e-THeT\$0)  =  EKt  +  Ea,n  (4) 

Ertf  =  <4>o|ff|#o)  (5) 

Eco„=($0\e-T(H-EKt)eT\$0)  (6) 

where  <£0  is  the  reference  determinant  and  T  is  the  cluster  operator. 

Results 


H20 

Symmetrical  bond  stretching  in  H20  has  many  times  [15]  been  used  as  a  test  of 
approximate  methods.  As  the  bonds  are  stretched,  the  single  determinant  reference 
function  provides  a  progressively  less  satisfactory  description,  which  is  reflected  by 
large  amplitudes  in  the  CC  wave  function.  Studies  have  shown  that  cc  methods  which 
include  triple  excitations  provide  excellent  agreement  with  FCI  at  the  equilibrium 
geometry  ( re )  and  also  when  the  bonds  have  been  stretched  to  one-and-a-half  times 
their  equilibrium  lengths  (\.5re).  At  2re,  the  performance  is  reasonable,  but  not  so 
good,  with  the  CC  energies  a  few  millihartrees  (m Eh)  below  the  FCI  energy. 

In  Table  1,  we  compare  the  CC  energies  obtained  using  both  restricted  Hartree- 
Fock  (rhf)  and  Brueckner  reference  determinants.  It  is  seen  that  the  hf-ccsdt  and 
B-CCSDT  energies  are  essentially  identical,  although  as  the  bond  length  is  increased, 
the  B-CCSDT  “correlation  energy,”  as  it  is  measured  relative  to  the  B  reference,  becomes 
significantly  larger  than  the  hf-ccsdt  correlation  energy  (as  the  T{  amplitudes  become 
larger,  the  Brueckner  orbitals  deviate  more  from  Hartree-Fock  ones).  The  small 
difference  in  the  CCSDT  energies  seen  here  is  another  example  of  the  well-known 
(approximate)  invariance  of  the  results  of  CC  methods  (particularly  those  having  the 
full  eTl  operator  in  the  wave  operator)  to  choice  of  reference  determinant  [36] . 


Table  I.  Comparison  of  hf-  and  b-ccsdt  energies  of  H20  at  equilibrium  and 
stretched  geometries,  dzp  basis  set,  atomic  units. 


Method 

Property 

re 

\.5re 

2re 

HF-CCSDT 

Ftot 

-76.27360 

-76.08727 

-75.97141 

Ecott 

-0.23289 

-0.28605 

-0.38850 

T, 

0.006 

0.022 

0.049 

B-CCSDT 

EtoX 

-76.27359 

-76.08716 

-75.97144 

Ecott 

-0.23395 

-0.29287 

-0.41332 
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Table  II.  Comparison  of  hf-  and  B-CCSDT  energies  of  SiH2  at  equilibrium  and 
stretched  geometries,  dzp  basis  set,  atomic  units. 


Method 

Property 

re 

1.5  r. 

2re 

HF-CCSDT 

£,ot 

£corr 

-290.19614 

-0.17920 

0.009 

-290.08138 

-0.20704 

0.020 

-289.99867 

-0.29152 

0.045 

B-CCSDT 

£tOt 

F 

A-^corr 

-290.19613 

-0.18098 

-290.08129 

-0.21400 

-289.99834 

-0.31509 

SiH2 

This  example  is  similar  to  H20,  but  in  fact  is  a  more  severe  test,  since  even  at 
equilibrium  more  than  one  determinant  contributes  significantly  to  the  wave  func¬ 
tion.  The  B-CCSDT  and  hf-ccsdt  results  are  shown  in  Table  II.  The  results  are 
basically  the  same  as  for  H20,  again  the  B-CCSDT  and  hf-ccsdt  energies  being 
essentially  identical. 

2a}  nh2 

A  number  of  FCI  comparisons  have  been  performed  for  this  system  [8,12,37]. 
They  have  demonstrated  that  whether  one  starts  from  a  UHF  or  an  ROHF  reference 
determinant,  it  is  extremely  difficult  to  reproduce  the  FCI  energy  well  at  2 re.  We 
have  found  that  the  complete  ccsdt  method,  with  both  uhf-  and  rohf  reference 
determinants,  is  able  to  provide  a  good  description,  but  this  is  not  true  of  the 
CCSD(t)  method  or  iterative  approximations  to  CCSDT  [12]. 

In  Table  III  we  show  the  B-CCSDT  and  hf-ccsdt  results.  We  see  that  at  re  and 
1.5 re  the  energies  are  virtually  identical,  but  that  at  2 re  there  is  a  2.5  m Eh  difference. 
Considering  that  the  B-CCSDT  results  have  less  spin  contamination,  one  might  imag¬ 
ine  that  they  would  be  closer  to  fci.  However,  our  previous  comparison  with  FCI 


Table  III.  Comparison  of  hf-  and  b-ccsdt  energies  of  NH2 .  2Aj  at  equilibrium 
and  stretched  geometries,  dzp  basis  set,  atomic  units. 


Method 

Property 

re 

1.5r, 

2  re 

HF-CCSDT 

£to. 

T^corr 

T, 

25+  1 

-55.70288 

-0.17693 

0.005 

2.0000 

-55.53192 

-0.20638 

0.023 

2.0001 

-55.42792 

-0.16713 

0.087 

2.0156 

B-CCSDT 

£tot 

Ecctx 

25+  1 

-55.70287 

-0.17753 

2.0000 

-55.53188 

-0.21003 

2.0000 

-55.42556 

-0.23152 

2.0074 

200 


WATTS  AND  BARTLETT 


[12]  showed  that  the  UHF-CCSDT  energy  was  about  1.5  m Eh  above  the  FCI  energy, 
so  the  B-CCSDT  energy  is  apparently  in  poorer  agreement  with  FCI . 

Four  Difficult  12-Electron  Systems:  The  Lowest  Singlet  States  of  BeO,  C2,  CN+, 
and  BN 

We  now  consider  four  closed-shell,  isoelectronic  systems  which  have  significant 
contributions  from  excited  determinants  at  their  equilibrium  geometries.  BeO  has 
quite  large  T\  amplitudes,  but  small  T2  amplitudes.  C2  has  smaller  T{  amplitudes, 
but  large  T2  amplitudes,  while  CN+  and  BN  have  large  T 1  and  large  T2  amplitudes. 
In  Table  IV  we  show  T j  along  with  the  largest  T \  and  T2  amplitudes.  We  note  that 
these  data  illustrate  our  earlier  statement  about  T 1  not  being  an  adequate  measure 
of  multi-reference  character.  Thus,  from  the  largest  amplitudes,  it  seems  unreason¬ 
able  to  say  that  BeO  has  more  multi-reference  character  than  C2,  even  though  T  I 
is  larger  for  BeO  than  for  C2 .  Tj  completely  follows  the  size  of  the  largest  tf .  When 
an  important  double  excitation  is  present,  we  expect  potential  errors  in  single  ref¬ 
erence  CC  methods  due  to  an  inadequate  description  of  the  multi-reference  character. 

We  have  performed  HF-CC  and  B-CC  calculations  on  these  systems  with  a  DZP 
basis  set  at  geometries  close  to  observed  equilibrium  geometries.  The  results  are 
shown  in  Table  V.  In  all  cases,  Brueckner  orbitals  increase  the  ccsd  energy  by  2- 
3  m Eh.  The  behavior  of  CCSD(t)  is  mixed,  but  correlates  with  the  value  of 
Thus,  for  the  systems  with  smaller  T  { ,  BeO  and  C2,  the  Hartree-Fock  and  Brueckner 
results  are  very  close,  while  for  CN+  and  BN  there  is  a  significant  difference.  For 
CCSDT,  the  differences  between  the  Brueckner  and  Hartree-Fock  references  are 
somewhat  smaller  than  for  CCSD. 

a1  BN:  Molecular  Properties 

The  lowest  singlet  state  of  the  BN  molecule  provides  a  considerable  challenge  to 
theoretical  calculations  since,  even  at  equilibrium,  there  are  significant  contributions 
from  singly  and  doubly  excited  determinants.  A  recent  study  by  Martin  et  al.  [  38  ] 
compared  the  results  of  a  variety  of  CC  and  multi-reference  Cl  methods.  It  was 


Table  IV.  Comparison  of  largest  Tx  and  T2  amplitudes  and  T ,  for  BeO,  C2,  CN+,  BN.  dzp  basis  set, 
geometries  close  to  experiment  (see  text). 


Method 

Property 

BeO 

C2 

CN+ 

BN 

CCSD 

Largest  ff 

-0.124 

0.085 

-0.179 

0.211 

Largest  tff 

-0.048 

-0.309 

-0.309 

-0.243 

T\ 

0.042 

0.031 

0.059 

0.065 

CCSDT 

Largest 

-0.139 

0.085 

0.187 

0.230 

Largest  tff 

-0.056 

-0.386 

-0.403 

-0.341 

T, 

0.045 

0.031 

0.062 

0.071 
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Table  V.  Comparison  of  hf-  and  b-cc  energies  of  the  lowest  singlet  states  of  BeO,  C2,  CN\  BN.  dzp 
basis  set,  atomic  units,  and  geometries  close  to  experiment  (see  text). 

Method 

Property 

BeO 

c2 

CN+ 

BN 

HF-CCSD 

L'tot 

-89.66581 

-75.72907 

-91.99553 

-79.20780 

Econ 

-0.24459 

-0.33994 

-0.37145 

-0.31561 

T  i 

0.042 

0.031 

0.059 

0.065 

B-CCSD 

£tot 

-89.66273 

-75.72602 

-91.99274 

-79.20469 

Tcorr 

-0.25202 

-0.35646 

-0.40461 

-0.34366 

hf-ccsd(t) 

£tot 

-89.67455 

-75.75555 

-92.02868 

-79.23585 

Ecott 

-0.25333 

-0.36642 

-0.39634 

-0.33377 

T, 

0.042 

0.031 

0.059 

0.065 

B-CCSD(T) 

Etot 

-89.67506 

-75.75581 

-92.02604 

-79.23319 

Ecott 

-0.26435 

-0.38625 

-0.42964 

-0.36228 

HF-CCSDT 

Etot 

-89.67591 

-75.75454 

-92.02447 

-79.23339 

f 

A-'corr 

-0.25470 

-0.36541 

-0.40039 

-0.34120 

T1 

0.045 

0.031 

0.062 

0.071 

B-CCSDT 

EXOi 

-89.67464 

-75.75391 

-92.02354 

-79.23203 

Econ 

-0.26532 

-0.38325 

-0.42647 

-0.36226 

suggested  that  for  this  system  the  considerable  multi-reference  character  made  a 
noniterative  treatment  of  triple  excitations,  as  in  ccsd(t),  insufficiently  accurate. 
In  particular,  significant  differences  between  CCSD(t)  and  CCSDT  results  were  ob¬ 
tained,  and  CCSD(t)  gave  a  smaller  bond  length  and  a  larger  frequency  than  CCSD, 
which  is  most  unusual. 

We  have  now  performed  a  series  of  CCSD,  CCSD(t),  and  CCSDT  calculations  of 
equilibrium  properties  using  Brueckner  and  RHF  reference  determinants  and  a  DZP 
basis  set.  The  results  are  shown  in  Table  VI.  The  Hartree-Fock  results  are  in  line 
with  those  of  Martin  et  al.  The  CCSD  and  CCSDT  re  and  are  similar,  while  the 
CCSD(t)  reis  smaller  and  the  co^is  larger.  When  one  uses  instead  Brueckner  orbitals, 
several  changes  take  place.  The  CCSD  bond  length  decreases  by  about  0.005  A,  with 


Table  VI.  Calculated  properties  of  1  2+  BN  from  hf-  and  B-cc  calculations,  dzp  basis  set. 


Method 

Energy/^ 

rj  A 

av/cm  1 

n/D 

T, 

HF-CCSD 

-79.20819 

1.2983 

1650 

1.922 

0.059 

hf-ccsd(t) 

-79.23595 

1.2892 

1686 

1.625 

0.062 

HF-CCSDT 

-79.23389 

1.3006 

1656 

1.702 

0.063 

B-CCSD 

-79.20492 

1.2937 

1712 

1.873 

B-CCSD(T) 

-79.23384 

1.3034 

1655 

1.584 

B-CCSDT 

-79.23252 

1.3001 

1668 

1.662 
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a  corresponding  increase  in  coe.  The  CCSd(t)  changes  are  in  the  opposite  direction, 
with  a  particularly  large  bond  length  change.  For  CCSDT,  the  changes  are  somewhat 
smaller,  but  again  there  is  a  significant  bond  length  decrease.  With  Brueckner  or¬ 
bitals,  the  CCSD(t)  results  are  much  closer  to  the  ccsdt  results  than  is  the  case 
with  Hartree-Fock  orbitals.  Judging  by  a  comparison  with  mrci  data  [38],  ccsdt 
seems  to  provide  a  reasonable  description  of  this  system,  although  it  is  likely  that 
connected  quadruple  excitations  [  39  ]  will  be  necessary  for  really  accurate  work, 
particularly  for  calculating  the  energy  difference  between  this  singlet  state  and  the 
ground  state  [  38  ] . 

It  follows  that,  for  this  system,  one  can  further  improve  the  performance  of 
CCSD(T)  by  using  Brueckner  orbitals,  which  is  indicative  of  important  higher  order 
T\  corrections. 


Conclusions 

The  present  calculations  are  the  first  comparisons  between  hf-  and  B-CCSDT 
results.  The  differences  between  the  two  sets  of  results  are  of  little  chemical  signif¬ 
icance.  For  the  bond  stretching  examples  considered,  the  use  of  Brueckner  orbitals 
does  not  lead  to  energetic  improvements,  although  the  spin  contamination  of  the 
uhf  results  is  reduced  slightly.  The  b-ccsdt  energies  of  BeO,  C2,  CN+,  and  BN 
are  all  no  more  than  about  1  kcal  mol-1  higher  than  the  hf-ccsdt  energies.  For 
the  BN  properties,  the  differences  between  the  HF-CCSDT  and  B-CCSDT  results  may 
be  significant.  At  present,  it  is  not  clear  which  results  are  better,  and  the  answer 
will  have  to  await  larger  basis  set  studies.  On  the  whole,  our  recommendation  is 
that  if  one  can  afford  a  full  CCSDT  calculation,  little  is  to  be  gained  by  obtaining 
Brueckner  orbitals.  A  possible  exception,  which  needs  to  be  studied  further,  would 
be  symmetry  breaking  problems,  for  which  there  are  some  clear  advantages  to  using 
a  set  of  orbitals  which  are  not  symmetry  broken  [  9  ] . 

Perhaps  the  most  interesting  aspect  of  this  study  is  the  observation  of  how 
Brueckner  orbitals  improved  the  CCSd(t)  results  for  BN.  If  this  is  a  general  pattern 
of  behavior,  it  will  be  of  some  value,  and  this  should  be  further  investigated. 
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Abstract 

Relativistic  pair  correlation  energies  of  Xe  were  computed  by  employing  a  recently  developed  relativistic 
coupled  cluster  theory  based  on  the  no-pair  Dirac-Coulomb-Breit  Hamiltonian.  The  matrix  Dirac- 
Fock-Breit  SCF  and  relativistic  coupled  cluster  calculations  were  performed  by  means  of  expansion  in 
basis  sets  of  well-tempered  Gaussian  spinors.  A  detailed  study  of  the  pair  correlation  energies  in  Xe  is 
performed,  in  order  to  investigate  the  effects  of  the  low-frequency  Breit  interaction  on  the  correlation 
energies  of  Xe.  Nonadditivity  of  correlation  and  relativistic  (particularly  Breit)  effects  is  discussed.  © 
1994  John  Wiley  &  Sons,  Inc. 


Introduction 

The  majority  of  contemporary  ab  initio  calculations  have  been  performed  on 
light  atoms  and  molecules  containing  them,  with  which  a  large  part  of  chemical 
research  is  concerned.  However,  important  physical  and  chemical  processes  involve 
heavy  atoms,  where  relativistic  effects  cannot  be  ignored.  Because  relativistic  and 
correlation  effects  play  an  essential  role  in  the  electronic  structure  of  heavy-atom 
systems,  relativistic  many-body  theories  have  become  the  subject  of  active  research 
interest  in  recent  years  [  1  ] .  Since  relativistic  effects  can  be  large  and  may  not  always 
be  treated  as  a  small  perturbation,  it  is  often  necessary  to  forfeit  the  Schrodinger 
equation  in  favor  of  the  Dirac  equation  in  order  to  describe  the  electronic  structure 
of  heavy-atom  systems. 

In  the  last  few  years,  relativistic  many-body  perturbation  theory  (mbpt  )  [2-5] 
and  relativistic  coupled  cluster  (CC)  theory  [6-9] ,  which  account  for  both  relativistic 
and  electron  correlation  effects,  were  developed  by  several  groups  using  basis  sets 
of  local  and  global  functions.  The  relativistic  mbpt  and  CC  algorithms,  based  on 
an  expansion  in  analytic  basis  functions  [  2-9  ] ,  have  the  advantage  of  providing  a 
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compact  and  discrete  representation  of  the  complete  Dirac  spectrum,  and  greatly 
facilitate  the  evaluation  of  the  many-body  diagrams  using  finite  summations  [  3  ] . 
Furthermore,  by  invoking  the  finite  basis  set  expansion  in  terms  of  Gaussian  spinors 
(G-spinors),  these  relativistic  many-body  theories  can  be  applied  to  molecular  elec¬ 
tronic  structure  problems  in  a  straightforward  way. 

In  a  previous  study  [10],  a  relativistic  mbpt  based  on  the  no-pair  Dirac-Coulomb 
(dc)  Hamiltonian  was  used  to  compute  the  pair  Coulomb  correlation  energies  of 
Xe  and  their  nonadditive  contributions  [10]  due  to  the  interference  between  the 
relativistic  and  Coulomb  correlation  effects.  The  valence,  core-valence,  and  core 
contributions  of  the  relativistic  pair  correlations  were  analyzed  in  detail.  The  results 
strongly  indicated  that  the  relativistic  and  Coulomb  correlation  effects  are  additive 
in  the  valence  shell  of  Xe.  However,  in  a  subsequent  study  on  relativistic  mbpt 
based  on  the  Dirac-Coulomb-Breit  (DCB)  Hamiltonian  [4],  the  low-frequency 
Breit  interaction  was  found  to  significantly  affect  the  electron  correlation  in  He  and 
He-like  ions.  In  that  study,  the  instantaneous  Coulomb  and  low-frequency  Breit 
interactions  were  treated  using  a  unified  formalism  in  both  the  construction  of  the 
Dirac-Fock-Breit  (dfb)  self-consistent  field  (SCF)  potential  and  in  the  evaluation 
of  mbpt  diagrams.  The  results  on  the  He-like  ions  indicate  that  relativity  and  electron 
correlation  become  strongly  nonadditive  when  the  Breit  interaction  is  included  as 
an  integral  part  of  the  two-electron  interaction. 

Recently,  we  have  developed  a  relativistic  version  of  the  multireference  valence- 
universal  Fock-space  cc  theory  based  on  the  no-pair  dcb  Hamiltonian,  and  im¬ 
plemented  it  by  employing  analytic  basis  expansion  in  terms  of  G-spinors  [11-13]. 
The  relativistic  coupled  cluster  algorithm  also  treats  the  instantaneous  Coulomb 
and  the  low-frequency  Breit  interactions  as  an  integral  part  of  the  two-electron 
interaction  in  both  the  SCF  and  relativistic  CC  procedures.  The  method  was  also 
applied  to  many-electron  systems,  and  yielded  accurate  ionization  potentials,  ex¬ 
citation  energies,  and  fine-structure  splittings  for  Au  [11],  for  highly  ionized  ions 
with  2-5  electrons  [12],  and  for  alkali  metal  atoms  [13].  In  the  present  study  we 
employ  this  relativistic  coupled  cluster  theory  based  on  the  no-pair  dcb  Hamiltonian 
[11]  to  compute  the  pair  correlation  energies  of  Xe.  The  main  objective  of  the 
present  study  is  to  perform  a  detailed  analysis  of  the  pair  correlation  energies  in 
order  to  investigate  the  effects  of  the  low-frequency  Breit  interaction  on  the  valence 
correlation  energies  of  Xe.  The  pair  energies  are  partitioned  into  core,  core- valence, 
and  valence  shell  contributions,  to  investigate  where  the  correlation  energy  and 
nonadditive  contributions  originate.  The  Xe  atom  has  54  electrons,  and  is  to  our 
knowledge  the  heaviest  atom  to  which  all-electron  cc  calculations  with  analytic 
basis  functions  have  been  applied.  The  next  section  outlines  our  relativistic  CC 
method  based  on  the  no-pair  dcb  Hamiltonian.  The  analysis  of  the  relativistic  pair 
correlation  energies  in  Xe  is  presented  in  the  third  section. 

Method 

The  relativistic  many-body  Hamiltonian  cannot  be  expressed  in  closed  potential 
form.  An  approximate  relativistic  Hamiltonian  most  commonly  used  is  the  time- 
honored  “no-pair”  DC  Hamiltonian  [14,15] 


RELATIVISTIC  PAIR  CORRELATION  ENERGIES  OF  XE 


207 


h+  =  ZWO  +  4  2  v0jC+,  (l) 

i  i<j 

where  (in  a.u.)  hD  is  the  one-electron  Dirac  Hamiltonian  with  the  nuclear  potential 
V 

'  nuc 

hD  =  ca-p  +  1 8c2  +  Knuc ,  (2) 

and  the  instantaneous  Coulomb  interactions  between  electrons  are 

V0  =  ry'.  (3) 

a  and  (3  are  the  Dirac  matrices.  The  operator  jC+  is  defined  as  the  product 
L+(  1 )  •  L+(2)  •  •  •  L+(n),  with  L+(i)  the  projection  operator  onto  the  space  spanned 
by  the  positive-energy  eigenfunctions  of  the  Dirac-Fock-Coulomb  (dfc)  operator 
[16].  In  this  form,  the  no-pair  Hamiltonian  is  restricted  to  contributions  from  the 
positive-energy  branch  of  the  dfc  spectrum.  The  no-pair  DC  Hamiltonian,  however, 
is  deficient  in  that  it  is  both  noncovariant  and  not  accurate  enough  for  high-precision 
calculation  of  fine  structure  separations  and  binding  energies  of  the  inner-shell 
electrons.  Adding  the  low-frequency  Breit  interaction, 

Bij  =  -  2-  [3,  •  3,  +  (3,  •  7,j) ■  (a;  •  ,  (4) 

to  the  instantaneous  Coulomb  operator  introduces  the  leading  effects  of  the  trans¬ 
verse  photon  exchange  and  remedies  the  lack  of  covariance  of  the  no-pair  DC  Ham¬ 
iltonian  [14,15].  Inclusion  of  the  Breit  interaction  results  in  a  Hamiltonian  which 
contains  all  effects  through  second  order  in  a,  the  fine-structure  constant  [15].  The 
no-pair  dcb  Hamiltonian  [15] 

H+  =  2  hD(i)  +  £+  2  (rj  +  B,j)£+  (5) 

i  i>j 

provides  a  satisfactory  starting  point  for  calculations  on  many-electron  systems  in 
the  sense  that  it  treats  the  electrons  relativistically,  treats  the  most  important  part 
of  the  electron-electron  interaction  nonperturbatively,  and  puts  the  Coulomb  and 
Breit  interactions  on  the  same  footing  in  dfb  SCF  and  relativistic  CC  calculations. 
In  order  to  study  the  electron  correlation  induced  by  the  Breit  interaction,  the 
instantaneous  Coulomb  and  frequency-independent  Breit  interactions  are  treated 
as  an  integral  part  of  the  two-electron  interaction  in  the  dfb  SCF  and  relativistic 
CC  calculations  [11]. 

In  ^-number  theory  the  Dirac-Coulomb-Breit  Hamiltonian  H+  is  rewritten  in 
terms  of  normal-ordered  products  of  the  spinor  operators,  {r+s}  and  {r+s+ut} 
[14,15,4], 

H  =  H+-(0\H+\0)=  2,frs{r+s}  2  (rs\\tu){r+s+ut)  ,  (6) 

rs  rstu 


where 


(rs\\tu)  =  ( rs\tu )  -  (rs\ut) 


(7) 
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and 


< rs\tu )  =  J  dxldx2^f(xl)^*(x2)(rij  +  512)^(xi)^u(x2)  .  (8) 

Here,y^  and  (rs\  \  tu )  are,  respectively,  elements  of  the  one-electron  Dirac-Fock 
and  antisymmetrized  two-electron  Coulomb-Breit  interaction  matrices  over  Dirac 
four-component  spinors.  The  effect  of  the  projection  operators  <£+  is  now  taken 
over  by  normal  ordering,  denoted  by  the  curly  braces  in  the  equations  above,  which 
requires  annihilation  operators  to  be  moved  to  the  right  of  creation  operators  as  if 
all  anticommutation  relations  vanish.  The  Fermi  level  is  set  at  the  top  of  the  highest 
occupied  positive-energy  state,  and  the  negative-energy  states  are  ignored. 

By  adopting  the  no-pair  approximation,  a  natural  and  straightforward  extension 
of  the  nonrelativistic  CC  theory  emerges.  The  correlated  wave  function  is  obtained 
from  the  uncorrelated  T'o  by  an  exponential  wave  operator, 

*  =  o,  SI  =  exp  {S'},  (9) 

where  S  is  the  normal-ordered  excitation  operator  [17] 

S  =  Si  +  S2  +  *  *  *  =  2  s;{r+s}  +  ^  2  Srtsu{r+S+Ut}  +  •  •  •  ,  (10) 

rs  rstu 

with  excitation  amplitudes  srs ,  etc.  The  expansion  of  S  is  truncated  here  after 

S2 ,  to  yield  the  CCSD  (coupled  cluster  with  single  and  double  excitations)  scheme. 

Once  the  iterative  coupled  cluster  equations  converge  and  give  the  excitation 
amplitudes,  the  correlation  energy  is  obtained  by 

E  =  2  Ssfrs  +  7  2  sr,i{rs\\tu)  .  (11) 

rs  ^  rstu 

Only  the  second  sum  appears  if  hf  (or  df)  orbitals  are  used,  which  is  the  case  in 
the  present  work.  The  correlation  energy  of  the  ij  pair  is  obtained  by  limiting  the 
hole  indices  tu  in  that  sum  to  ij,  and  summing  over  all  particle  indices  rs. 

Detailed  account  of  the  matrix  dfb  scf  formalism  and  its  applications  to  many- 
electron  systems  has  been  given  in  previous  publications  [4]  and  is  not  repeated 
here.  The  speed  of  light  was  taken  to  be  137.037  a.u.  throughout  this  work.  The 
nonrelativistic  limit  was  simulated  by  setting  the  speed  of  light  to  c  =  104.  Another 
set  of  SCF  calculations  was  done  without  the  Breit  term.  This  is  the  conventional 
dfc  SCF  scheme  based  on  the  no-pair  DC  Hamiltonian  [2-4]. 

A  basis  set  of  well-tempered  Gaussian-type  functions  reported  by  Huzinaga  and 
Klobukowski  [18]  was  used  in  contracted  and  uncontracted  form  [19].  The  non¬ 
relativistic,  DFC  and  DFB  SCF  calculations  were  done  with  the  uncontracted 
{2\s\9p\?>d)  basis  set.  In  a  previous  study,  the  (21sl9/?13d)  primitives  set  was 
contracted  to  generate  a  [14sl  3p  10  d)  contracted  set  for  mbpt  calculations  on  Xe 
[19].  This  was  done  by  grouping  the  functions  of  highest  exponents  (see  Table  II 
of  Ref.  [19]).  The  relativistic  correlation  energy  of  the  contracted  set  reproduces 
over  99%  of  that  of  the  uncontracted  basis  [19].  We  have  employed  the  same  set 
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of  contracted  functions  in  the  present  cc  calculations,  and  added  uncontracted 
Gaussians  up  to  Lmax  =  5  to  yield  a  [\4s\3  p  10  dl  f  6  g5h]  basis  set.  The  exponents 
for  the  high-/  functions  were  taken  from  the  same  well-tempered  series  [18],  The 
highest  exponent  for  the  /  and  g  sectors  was  1 10.24499,  and  for  the  h  sector — 
52.022218.  The  finite-nucleus  model  discussed  in  Ref.  [4]  was  employed  for  Vnuc 
of  Eq.  (2).  The  atomic  mass  used  in  the  calculations  was  13 1.30  amu.  Calculations 
were  done  on  the  IBM  RS6000/320  and  360  workstations  at  Tel  Aviv  University. 

Results  and  Discussion 

Table  I  displays  the  SCF  and  correlation  (CCSD)  energies  computed  with  the 
nonrelativistic,  dfc  and  dfb  wave  functions.  The  DFC  SCF  energy  obtained  with 
the  uncontracted  G-spinor  basis  set  is  higher  by  0.0 1 7  a.u.  than  the  DFC  energy  of 
-7446.9010  a.u.  computed  by  numerical  finite  difference  DF  [20].  We  have  com¬ 
puted  the  nonrelativistic  limit  £Hf  by  taking  c  =  104  in  the  DFC  SCF  calculations. 
This  gave  -7232.0717  a.u.  The  relativistic  energy  lowering,  which  is  the  differ¬ 
ence  between  the  total  DFC  SCF  energy  EDFc  and  the  nonrelativistic  limit  En F,  is 
214.83  a.u. 

The  inclusion  of  the  Breit  term  in  the  two-electron  interaction  raises  the  dfb  SCF 
energy  above  the  dfc  scf  energy.  The  variational  Breit  energy,  computed  as  the 
difference  EDFB  -  EDFC,  is  the  level  shift  in  the  total  SCF  energy  due  to  the  inclusion 
of  the  Breit  interaction  in  the  SCF  process.  This  is  5.77  a.u.  for  Xe.  The  energy  shift 
is  primarily  due  to  a  large  upward  shift  in  the  inner-shell  spinor  energies  [4]. 

The  nonrelativistic  (nr),  DC  and  DCB  correlation  energies  are  those  computed 
by  applying  the  CCSD  method  to  the  nonrelativistic,  DC  and  DCB  Hamiltonians, 
respectively.  The  difference  between  the  relativistic  (DC  and  DCB)  correlation  ener¬ 
gies  and  the  nr  correlation  energy  is  the  relativistic  correction  to  the  correlation 
energy.  This  correction  is  given  in  the  last  column  of  Table  I.  Comparing  the  results 
of  the  relativistic  calculations  yielded  by  the  DC  and  DCB  Hamiltonians  with  the 
nonrelativistic  results,  the  DC  Hamiltonian  is  seen  to  account  for  only  a  small 
fraction  of  the  relativistic  correction  to  the  correlation  energy,  0.027  a.u.  This  non¬ 
additive  contribution  is  due  to  the  interference  between  relativity  and  electron 


Table  I.  scf  and  correlation  energies  of  Xe  (in  a.u.).  The  scf  energies  are  given  for  the  uncontracted 
(2\s\9pl3d)  and  contracted  [14d3pl(W]  bases,  the  CCSD  correlation  energies  for  the  contracted  basis 

augmented  by  IjhgSh. 


SCF 

Uncontracted 

Contracted 

Correlation 

Nonadditive  energy2 

NR 

DC 

DCB 

-7232.0717 

-7446.8836 

-7441.1138 

-7232.0467 

-7443.3180 

-7437.5270 

-2.5926 

-2.6199 

-2.6956 

0.027 

0.103 

a  Relativistic  correction  to  correlation  energy. 
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Table  II.  Pair  correlation  energies  of  Xe  (in  millihartree).  [\4s\3p\0dljhg5h]  basis  set. 


Pair  energy 

Pair 

NR 

DC 

DCB 

5p5p 

-71.39 

Valence 

-71.10 

-71.08 

5p5s 

-26.29 

-27.25 

-27.23 

5s5s 

-6.02 

-6.05 

-6.05 

5p4d 

-114.20 

Core-Valence 

-117.53 

-117.46 

5p3d 

-11.06 

-11.63 

-11.64 

5p4p 

-17.60 

-17.95 

-17.97 

5p3p 

-3.42 

-3.45 

-3.48 

5p2p 

-1.16 

-1.22 

-1.29 

5p4s 

-3.37 

-3.22 

-3.22 

5p3s 

-0.84 

-0.79 

-0.79 

5p2s 

-0.28 

-0.28 

-0.29 

5pls 

-0.11 

-0.11 

-0.14 

5s4d 

-39.46 

-44.62 

-44.61 

5s3d 

-4.80 

-5.68 

-5.68 

5s4p 

-9.93 

-10.80 

-10.82 

5s3p 

-1.55 

-1.75 

-1.76 

5s2p 

-0.42 

-0.50 

-0.50 

5s4s 

-1.63 

-1.69 

-1.70 

5s3s 

-0.34 

-0.35 

-0.36 

5s2s 

-0.10 

-0.11 

-0.11 

5sls 

-0.03 

-0.04 

-0.06 

4  d4d 

-369.14 

Core 

-367.98 

-368.58 

4d3d 

-156.79 

-155.67 

-157.17 

4d4p 

-184.05 

-185.92 

-186.17 

4d3p 

-48.56 

-46.10 

-46.46 

4dtlp 

-16.89 

-16.47 

-17.19 

4d4s 

-46.87 

-46.73 

-46.82 

4d3s 

-12.02 

-10.72 

-10.78 

4d2s 

-4.14 

-3.80 

-3.98 

4d\s 

-0.19 

-0.16 

-0.19 

3d3d 

-367.74 

-368.86 

-375.35 

3d4p 

-73.50 

-77.78 

-77.83 

3d3p 

-197.60 

-197.65 

-200.61 

3  dip 

-121.33 

-118.59 

-123.27 

3d4s 

-22.81 

-26.24 

-26.23 

3d3s 

-49.23 

-50.24 

-50.84 

3d2s 

-30.97 

-28.20 

-29.37 

3d\s 

-1.23 

-1.06 

-1.23 

4p4p 

-53.24 

-53.84 

-53.96 

4p3p 

-30.69 

-31.21 

-31.53 

4p2p 

-11.48 

-12.09 

-12.77 

{Continued) 
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Table  II.  ( Continued ) 


Pair  energy 


Pair  NR  DC  DCB 


Core 


ApAs 

-26.40 

Apis 

-7.77 

Apis 

-2.74 

Ap\s 

-1.09 

3p3p 

-55.53 

3p2p 

-55.66 

3pAs 

-9.35 

3p3s 

-27.21 

3pls 

-12.95 

3 pis 

-5.71 

2p2p 

-131.34 

IpAs 

-2.99 

2p3s 

-11.53 

Ipls 

-44.12 

lp\s 

-31.09 

4s4s 

-4.64 

As3s 

-2.47 

Asls 

-0.73 

As\s 

-0.24 

3s3s 

-4.57 

3s2s 

-3.28 

3  sis 

-1.11 

Isis 

-7.45 

2  sis 

-5.31 

Isis 

-24.89 

-27.30 

-27.36 

-7.42 

-7.46 

-2.79 

-2.90 

-1.14 

-1.43 

-56.11 

-57.31 

-58.15 

-62.03 

-10.37 

-10.43 

-27.39 

-27.83 

-13.14 

-13.75 

-5.96 

-7.46 

-135.07 

-146.66 

-3.46 

-3.50 

-13.06 

-13.41 

-45.94 

-49.63 

-31.67 

-38.54 

-4.77 

-4.79 

-2.49 

-2.53 

-0.78 

-0.84 

-0.28 

-0.42 

-4.59 

-4.71 

-3.46 

-3.80 

-1.30 

-2.00 

-7.73 

-8.77 

-6.08 

-9.75 

-24.06 

-41.69 

correlation,  which  arises  from  the  use  of  relativistic  single-particle  states  but  the 
same  two-body  interaction,  i.e.,  the  instantaneous  Coulomb  interaction.  The  bulk 
of  the  relativistic  correction  to  the  correlation  energy  comes  from  the  inclusion  of 
the  Breit  interaction  in  the  two-body  interaction,  i.e.,  from  the  use  of  the  DCB 
Hamiltonian.  This  relativistic  correction  is  0.103  a.u.,  which  represents  4%  of  the 
total  correlation  energy,  —2.6956  a.u. 

In  recent  years,  quasi-relativistic  effective  core  potential  calculations  have  been 
performed  on  diatomic  Xe2  and  XeCl  [21,22].  In  these  calculations,  the  55  and 
5 p  orbitals  of  Xe  were  taken  to  be  the  valence  orbitals.  All  the  remaining  orbitals 
were  treated  as  core,  and  were  replaced  by  a  set  of  effective  core  potentials  for 
valence-only  calculations.  The  present  study  correlates  all  electrons.  If  the  55  and 
5 p  are  called  valence  spinors  and  the  rest  are  core  spinors,  the  total  of  66  pair 
correlation  energies  for  ground-state  Xe  may  be  partitioned  into  valence,  core¬ 
valence,  and  core  contributions.  Table  II  gives  the  pair  correlation  energies  parti¬ 
tioned  in  this  manner.  The  nr,  DC,  and  dcb  pair  energies  were  computed  from 
the  corresponding  wave  functions.  The  results  indicate  clearly  that  the  largest  cor- 
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relation  contributions,  both  relativistic  and  nonrelativistic,  come  from  the  4d4d 
and  3d3  d  pairs,  followed  by  other  pairs  which  involve  4 d  and  3  d  spinors  (e.g., 
4d4p,  4d3d,  3d4p,  3d3p).  This  may  easily  be  understood,  because  the  4 d  and 
3d  shells  each  include  ten  electrons,  and  the  dynamical  correlation  among  these 
electrons  is  not  accounted  for  in  the  SCF  step.  The  relativistic  corrections  to  the 
pair  correlation  energies,  i.e.,  the  difference  between  the  relativistic  and  the  non¬ 
relativistic  pair  correlation  energies,  are  primarily  associated  with  the  inner- 
shell  pairs. 

The  valence,  core-valence,  and  core  contributions  of  the  NR,  DC,  and  dcb  pair 
correlation  energies,  as  well  as  the  nonadditive  relativistic  corrections,  are  tabulated 
in  Table  III.  The  valence  and  core-valence  contributions  of  the  dcb  pair  energies 
are  4%  and  8%,  respectively,  of  the  overall  dcb  correlation  energy  of  Xe,  -2.6956 
a.u.  The  valence  contribution  of  the  nonadditive  relativistic  correction,  0.7  milli- 
hartree,  accounts  for  only  0.7%  of  the  overall  nonadditive  energy,  0.103  a.u.;  it  is 
also  0.7%  of  the  valence  pair  correlation  energy,  -0.1044  a.u.  This  shows  that 
relativity  and  correlation  effects  are  additive  in  the  valence  shell  of  Xe.  The  non¬ 
additive  contribution  comes  primarily  from  the  pair  correlations  in  the  inner-shell 


Table  III.  Core,  core- valence,  and  valence 
contributions  of  pair  and  nonadditive  energies 
(in  a.u.).  [\As\3>p\0dljhg5h\  basis  set. 


Pair  energy  Nonadditive  energy3 

Core 

NR 

-2.2786 

DC 

-2.2938 

0.0152 

DCB 

-2.3694 

0.0908 

Core- Valence 

NR 

-0.2103 

DC 

-0.2217 

0.0114 

DCB 

-0.2218 

0.0115 

Valence 

NR 

-0.1037 

DC 

-0.1044 

0.0007 

DCB 

-0.1044 

0.0007 

Total 

NR 

-2.5926 

DC 

-2.6199 

0.027 

DCB 

-2.6956 

0.103 

a  Relativistic  correction  to  pair  correlation 
energy. 
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core.  The  core  contribution  of  the  nonadditive  relativistic  correction  accounts  for 
88%  of  the  total  nonadditive  energy.  Although  the  relativistic  correction  of  the 
valence  pair  correlation  is  negligible,  the  inclusion  of  the  Breit  interaction  may  be 
important  in  accounting  for  fine-structure  splittings  of  open-shell  systems  [11-13]. 

Table  II  shows  that  the  nonadditive  relativistic  corrections  in  a  number  of  DC 
pair  energies  in  the  core  shell  are  of  comparable  magnitude  but  with  opposite  sign. 
Because  of  the  large  cancellation  of  the  nonadditive  relativistic  corrections  in  the 
core  shell,  the  overall  nonadditive  energy  of  the  DC  correlation  energy  remains 
small.  Because  of  the  absence  of  such  cancellation,  the  overall  nonadditive  relativistic 
correction  of  the  dcb  correlation  energy  is  an  order  of  magnitude  larger  than  that 
of  the  DC  correlation  energy. 
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Abstract 

Topological  properties  of  the  charge  density  p(7)  of  a  series  of  diatomic  molecules,  as  well  as  ethane, 
ethene,  and  acetylene  are  calculated  at  the  Hartree-Fock  level  employing  various  basis  sets,  and  by  the 
amI  method.  The  effect  of  the  core  orbitals  on  the  bonding  regions  in  these  molecules  is  examined.  The 
results  help  to  evaluate  the  utility  of  AMl  wavefunctions  for  analyzing  the  topological  properties  of  the 
charge  density.  ©  1994  John  Wiley  &  Sons,  Inc. 


The  Theory  of  Atoms  in  Molecules 

The  theory  of  atoms  in  molecules,  proposed  by  Bader  and  co-workers  [1],  has 
been  proven  useful  for  predicting  and  explaining  many  chemical  properties.  The 
theory  is  based  on  the  topology  of  the  electronic  charge  density  p(  r ),  stated  in  terms 
of  paths  traced  by  its  gradient  vectors  Vp(r).  These  vectors  point  in  the  directions 
of  the  steepest  ascent  of  p(r),  and  the  collection  of  gradient  paths  associated  with 
a  charge  distribution  is  called  the  gradient  vector  field.  The  gradient  vectors  partition 
the  molecule  into  subsystems  bounded  by  the  so-called  zero-flux  surfaces  S (r), 
which  satisfy  the  equation 

Vp(r)  •«(?)  =  0  V  r  E  S(7) ,  (1) 

where  n( r)  denotes  the  unit  vector  normal  to  the  surface  S(r). 

Smith  et  al.  [2]  and  Collard  and  Hall  [3]  have  shown  that  main  characteristics 
of  a  molecular  structure  are  exhibited  at  maxima  (peaks),  minima  (pits),  or  saddle 
points  (passes  or  pales)  of  the  charge  distribution.  At  these  points,  known  as  the 
critical  points,  the  gradient  vector  field  Vp(7)  vanishes.  Critical  points  may  be  clas¬ 
sified  according  to  their  rank  and  signature.  The  rank  of  a  critical  point  equals  the 
number  of  nonzero  eigenvalues  X,  of  the  Hessian  matrix  of  p( r)  at  that  point.  The 
signature  is  the  difference  between  the  number  of  positive  and  negative  eigenvalues. 
For  critical  points  of  rank  three,  designated  as  (rank,  signature ),  (3,  +3)  called  pits, 
(3,  +1)  pales,  (3,  -1)  passes,  and  (3,  -3)  peaks.  The  total  number  of  each  type  of 
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critical  point  for  a  system  of  a  finite  number  of  nuclei  is  described  topologically  by 
the  so-called  Poincare-Hopf  relation; 

n-b  +  r-  c=  1  ,  (2) 


where  n ,  b,  r,  and  c  denote  the  numbers  of  peaks,  passes,  pales,  and  pits,  respectively. 

If,  in  a  molecule,  p( f)  shows  a  saddle  point  when  viewed  in  every  plane  containing 
the  internuclear  axis,  then  there  is  a  critical  point  at  which  p( 7)  is  a  minimum  with 
respect  to  the  direction  along  the  line  of  the  nuclei,  and  a  maximum  with  respect 
to  all  directions  perpendicular  to  this  line  (i.e.,  a  pass).  Such  a  critical  point  is  called 
a  bond  critical  point . 

The  occurrence  of  a  bond-critical  point  in  the  region  between  two  centers  is  a 
necessary  condition  for  the  existence  of  a  bond  [4,5].  There  are  two  and  only  two 
gradient  paths  that  originate  at  the  bond  critical  point  and  terminate  at  the  neigh¬ 
boring  peaks.1  The  value  of  p(r)  along  this  path  is  maximal  with  respect  to  two 
independent  perpendicular  directions.  This  is  a  definition  of  a  bond  path.  The 
existence  of  a  bond  path  is  a  necessary  and  sufficient  condition  for  the  existence  of 
a  bond. 

The  bond  ellipticity  e  shows  the  extent  to  which  the  electronic  charge  is  prefer¬ 
entially  accumulated  in  a  given  plane  [6].  It  is  calculated  as 

*  =  Xj/X2  -  1  ,  (3) 


where  Xj  and  X2  are  the  magnitudes  of  the  negative  curvatures  along  the  axis  per¬ 
pendicular  to  the  bond  path,  which  indicate  the  rates  of  decrease  of  p(r). 

Associated  with  the  charge  density  is  its  Laplacian  V2p(r).  The  Laplacian  of  a 
function  is  a  measure  of  its  lumpiness  [7].  It  is  the  sum  of  the  curvatures  in  the 
electron  density  along  any  three  orthogonal  coordinate  axes  at  the  point  r : 


V2p(r)  = 


dV?)  d2p(7)  d2P<7) 

dx1  dy2  dz1 


(4) 


The  sign  of  V2p(?)  indicates  whether  the  charge  density  is  locally  depleted 
[V2p(7)  >  0]  or  locally  concentrated  [V2p(?)  <  0], 

The  location  of  the  bond  critical  point,  and  the  value  of  the  Laplacian  of  the 
charge  density  at  this  point  have  been  shown  to  be  useful  in  the  study  of  molecular 
electronic  structure  [1,8-11]. 


Valence-Orbital  Wavefunctions  in  AMl 

Topological  analysis  usually  uses  electron  densities  calculated  from  quantum 
mechanical  methods,  such  as  the  Hartree-Fock  method  (SCF).  Thus  it  naturally 
inherits  any  problems  associated  with  these  methods.  One  of  the  commonly  known 
drawbacks  of  ab  initio  methods  is  the  computational  time  and  memory  requirements 
and  hence  their  cost.  As  a  result,  despite  their  accurate  description  of  one  electron 


1  With  the  exception  of  structures  containing  bifurcation  or  conflict  catastrophe  points,  which  vanish 
upon  infinitesimal  geometrical  changes. 
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properties,  there  have  been  few  ab  initio  calculations  performed  on  large  molecules. 
On  the  other  hand,  although  there  are  many  calculations  at  the  semiempirical  level 
on  large  systems,  hitherto  no  one  has  used  wavefunctions  obtained  from  semi¬ 
empirical  calculations  to  study  topological  properties.  We  shall  consider  one  such 
method,  namely,  the  amI  method  [12], 

In  this  method,  energy  contributions  arising  from  core  electrons  and  their  inter¬ 
actions  with  valence  electrons  are  represented  only  in  a  parameterized  form.  As  a 
consequence,  no  explicit  charge  distribution  associated  with  these  electrons  can  be 
given.  The  resulting  amI  wavefunctions,  therefore,  contain  only  valence  orbitals. 
In  Figure  1,  the  charge  density  of  a  substituted  oxirane,  computed  by  the  amI 
method,  clearly  exhibits  local  minima  (i.e.,  pits)  at  the  positions  of  the  nuclei.  This 
replacement  of  a  peak  by  a  pit  at  the  site  of  every  heavy  atom  also  alters  the  number 
of  passes  and  pales  present  in  the  molecular  density.  For  example,  if  in  a  given 
density  one  peak  becomes  a  pit,  and  the  number  of  (3,  —1)  critical  points  is  to 
remain  constant,  the  Poincare-Hopf  relation  [Eq.  (2)]  forces  the  occurrence  of  two 
additional  pales.  If  more  atoms  without  core  are  present,  and  the  condition  of 
constant  number  of  passes  is  relaxed,  the  picture  becomes  rapidly  very  complex. 
A  detailed  topological  analysis  of  such  a  charge  distribution  is  not  warranted. 


Figure  1.  Charge  density  of  a  substituted  oxirane  computed  by  the  amI  method.  Note 
the  local  minima  at  the  positions  of  the  nuclei. 
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An  example  for  such  a  rather  complicated  pattern  of  peaks  and  pales  is  given  in 
Figure  1. 

This  exclusion  of  the  inner-shell  orbitals  raises  the  obvious  question  of  how  the 
topological  properties  of  the  molecule  are  affected.  To  our  knowledge,  there  have 
been  no  investigations  of  this  and  the  related  problem  of  wavefunctions  using  pseu¬ 
dopotentials.  The  impact  of  such  a  study  would  nevertheless  be  important,  since 
it  could  extend  the  molecular  size  for  the  topological  analysis.  The  average  bioorganic 
molecule  is  in  general  out  of  reach  of  ab  initio  calculations.  Unless  one  can  use 
am  1  or  other  semiempirical  wavefunctions  to  study  topological  properties,  the  theory 
of  atoms  in  molecules  seems  to  be  of  little  practical  use  for  organic  chemistry  at 
the  present  time. 

In  this  paper,  we  study  the  electronic  behavior  of  a  series  of  diatomic  molecules, 
as  well  as  ethane,  ethene,  and  acetylene  at  the  HF  level  with  the  STO-3G,  3-2 1G, 
6-3 1 G  basis  sets,  and  employing  the  am  1  method.  The  calculations  were  performed 
using  the  GAUSSIAN86  package  [13],  and  the  topological  analysis  was  done  with 
the  PROAIM  program  [14].  Both  programs  were  implemented  on  an  IBM  3081 
computer. 

The  wavefunctions  of  the  ground  states  of  the  diatomics  H2,  NH,  OH,  HF,  HC1, 
CH,  CN,  CO,  CF,  CS,  CC1  and  the  polyatomics  C2H6,  C2H4,  and  C2H2  were  cal¬ 
culated.  The  bond  distances  and  ground  state  configurations  of  the  diatomic  mol¬ 
ecules  and  the  polyatomic  molecules  shown  in  Table  I  were  taken  from  Hubert 
and  Herzberg  [15]  and  Herzberg  [16],  respectively. 

To  examine  fully  the  contribution  of  the  inner-shell  orbitals  to  the  bonding  region 
and  especially  to  the  bond  critical  point,  these  orbitals  were  removed  from  the  ab 
initio  wavefunctions  (by  deleting  the  MO  coefficients  of  the  “core  orbitals”  from 


Table  I.  Input  data  for  the  molecules  treated  in  this  paper. 


MO  ground  state 
configuration 

Bond  distance 
(«o) 

‘H2 

1 2? 

1.4010 

I4n‘h 

32" 

1.9581 

160'H 

Tl, 

1.8325 

'HI9F 

is+ 

1.7325 

1H35C1 

!2+ 

2.4086 

l2C'H 

2nr 

2.1163 

I2C14N 

2s+ 

2.2144 

12C160 

'2+ 

2.1322 

12C19F 

2nr 

2.4034 

12C32S 

*2+ 

2.9005 

12C35CI 

2n1/2 

3.1086 

c2h6 

'A|£ 

2.9026  (C— C) 

c2h4 

‘A* 

2.5303  (C— C) 

c2h2 

2.2828  (C— C) 
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the  total  wavefunctions).  The  topological  properties  of  the  remaining  wavefunctions 
were  then  compared  with  the  original  and  the  AM  1  wavefunctions. 


Results  and  Discussion 

The  numerical  results  of  the  topological  analysis  of  the  considered  molecules  are 
summarized  in  Tables  II- V.  The  tables  list  for  all  basis  sets  employed  in  the  HF 
framework,  as  well  as  for  the  AMl  description,  locations  of  bond  critical  points, 
charge  densities,  Laplacians  of  p,  and  the  eigenvalues  of  the  Hessian.  Ab  initio 
calculations,  from  which  the  core  contributions  have  been  removed,  are  denoted 
by  the  letter  C  added  to  the  basis  set  specification. 

In  general,  one  may  say  that  the  AMl  description  of  the  topological  properties 
in  the  bonding  region  is  quite  similar  to  those  of  the  ab  initio  calculations  (see,  e.g., 
Fig.  2).  For  the  region  behind  the  nuclei,  in  particular,  the  AMl  results  show  excellent 
agreement  with  the  6-3 1G  results.  This  region  behind  the  nuclei  is  responsible  for 
nonbonded  interactions,  which  are  of  essential  importance  for  conformational 
analysis. 

For  heterodiatomic  systems,  the  bond  critical  point  locations  found  from  the 
AMl  results  deviate  on  average  by  0.08  a0  from  those  found  by  6-3 1G  calculations 
(see  Tables  II-IV).  The  STO-3G  and  3-2 1G  results  agree  only  marginally  better  with 


z/ao 

Figure  2.  Charge  densities  along  the  H — Cl  bond  at  the  Hartree-Fock  level  with  various 
basis  sets,  and  employing  the  AMl  method.  The  z  axis  is  the  bond  coordinate  and  the 
origin  is  arbitrarily  set  at  the  center  of  mass.  Empty  circles:  HF/6-31G;  empty  squares: 
HF/3-21G;  triangles:  HF/STO-3G;  full  circles:  AMl. 
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Table  II.  Topological  properties  of  various  diatomic  hydrides.3 


Location  of 
Pb{a  o) 

Pb 

(ea^) 

V2p 

(ea05) 

A. 

(eao5) 

X2 

(eao5) 

a3 

(eao5) 

Hydrogen 

AMl 

0.7006 

0.25253 

-0.82702 

-0.96985 

-0.96985 

1.11268 

sto-3G 

0.7006 

0.25560 

-0.83708 

-0.98165 

-0.98165 

1.12622 

3-2 1G 

0.7006 

0.23854 

-0.67781 

-0.78780 

-0.78780 

0.89778 

6-3 1G 

0.7006 

0.24339 

-1.05082 

-0.86507 

-0.86507 

0.67931 

Nitrogen  hydride 

AMl 

1.2968 

0.32606 

-1.02751 

-0.97632 

-0.97632 

0.92512 

STO-3G 

1.2805 

0.31351 

-0.99884 

-0.91211 

-0.91211 

0.82539 

sto-3GC 

1.2803 

0.31348 

-0.99958 

-0.91162 

-0.91162 

0.82365 

3-2 1G 

1.3755 

0.29789 

-0.87221 

-1.01904 

-1.01904 

1.16588 

3-2 1GC 

1.3749 

0.29782 

-0.87498 

-1.01741 

-1.01741 

1.15984 

6-3 1G 

1.3703 

0.30694 

-1.23066 

-1.01650 

-1.01650 

0.80235 

6-3 1GC 

1.3701 

0.30692 

-1.23085 

-1.01624 

-1.01624 

0.80163 

Hydrogen  oxide 

AMl 

1.4654 

0.37380 

-3.09293 

-1.78480 

-1.73510 

0.42761 

sto-3G 

1.3636 

0.36608 

-2.18105 

-1.55627 

-1.47450 

0.84975 

STO-3GC 

1.3635 

0.36608 

-2.18085 

-1.55608 

-1.47430 

0.84953 

3-2 1G 

1.3697 

0.33559 

-1.35684 

-1.46282 

-1.39111 

1.49708 

3-2 1GC 

1.4637 

0.32551 

-1.35784 

-1.46237 

-1.39063 

1.49576 

6-3 1G 

1.4108 

0.34164 

-1.62624 

-1.53484 

-1.47405 

1.38266 

6-3 1GC 

1.4108 

0.34163 

-1.62639 

-1.53470 

-1.47390 

1.38221 

Hydrogen  fluoride 

AMl 

sto-3G 

3-2 1G 

b 

b 

1.3649 

0.35189 

-2.24320 

-1.76069 

-1.76069 

1.27818 

6-3 1G 

1.3979 

0.35177 

-1.85908 

-2.00307 

-2.00307 

2.14706 

Hydrogen  chloride 

AMl 

1.6018 

0.21156 

-0.34857 

-0.38588 

-0.38588 

0.43201 

STO-3G 

1.4755 

0.20647 

-0.37336 

-0.13998 

-0.13998 

0.00886 

STO-3GC 

1.4290 

0.20573 

-0.44237 

-0.22915 

-0.22915 

0.01593 

3-2 1G 

1.6510 

0.20772 

-0.36109 

-0.44593 

-0.44593 

0.53078 

3-2 1GC 

1.6578 

0.20714 

-0.36943 

-0.4405 1 

-0.44051 

0.51160 

6-3 1G 

1.6032 

0.20999 

-0.39463 

-0.43980 

-0.43980 

0.48497 

6-3 1GC 

1.5990 

0.20976 

-0.40702 

-0.43547 

-0.43547 

0.46393 

a  The  location  of  the  bond  critical  point  is  the  distance  from  the  heavier  atom. 
b  No  bond  critical  point. 


the  6-3 1G  results.  For  the  symmetric  bonds,  the  locations  of  the  bond  critical  points 
are  of  course  identical  for  AMl  and  the  ab  initio  ones.  The  AMl  values  of  p(j)  at 
the  bond  critical  point  vary  by  an  average  of  0.03  eao 3  from  the  corresponding  6- 
3 1G  calculations.  The  worst  case  is  acetylene  where  the  difference  is  0.07  eao 3  (see 
Table  V).  AMl  correctly  predicts  the  sign  of  the  Laplacian  of  the  charge  density  at 
the  bond  critical  points.  The  magnitudes,  however,  deviate  from  6-3 1G  results  by 
a  large  average  of  0.4  eao  \  where  the  worst  case  (OH,  Table  II)  shows  a  difference 
of  1.5  eao5.  The  topological  shape  of  the  charge  density  in  the  bonding  region 
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Table  III.  Topological  properties  of  heteronuclear  diatomics  containing  carbon  and  a  first-row  atom 

and  carbon  hydride.3 


Location  of 
Pb  («o) 

Pb 

(^o3) 

V2p 

(eao5) 

X, 

(ea05) 

X2 

(^o5) 

x3 

(ea05) 

Carbon  hydride 

AMl 

1.1977 

0.23955 

-0.52311 

-0.60319 

-0.45026 

0.53034 

sto-3G 

1.1861 

0.24256 

-0.62518 

-0.60831 

-0.44947 

0.43259 

sto-3GC 

1.1801 

0.24233 

-0.65427 

-0.60529 

-0.44329 

0.39431 

3-2 1G 

1.3391 

0.24516 

-0.74125 

-0.65273 

-0.57224 

0.48372 

3-2 1GC 

1.3359 

0.24488 

-0.74780 

-0.65013 

-0.56886 

0.47119 

6-3 1G 

1.3118 

0.24641 

-0.70185 

-0.67555 

-0.59111 

0.56848 

6-3 1GC 

1.3106 

0.24633 

-0.70782 

-0.67455 

-0.58977 

0.55650 

Carbon  nitride 

AMl 

sto-3G 

b 

1.4662 

0.36712 

1.17349 

-0.45775 

-0.45775 

2.08897 

STO-3GC 

3-2 1G 

b 

1.4392 

0.43301 

-0.22856 

-0.95677 

-0.95677 

1.68498 

3-2 1GC 

6-31G 

b 

1.4473 

0.43131 

-0.07886 

-0.87077 

-0.87077 

1.66269 

6-3 1GC 

Carbon  monoxide 
AMl 

sto-3G 

b 

b 

1.4323 

0.37405 

2.46638 

-1.14616 

-1.14616 

4.75872 

sto-3GC 

3-2 1G 

b 

1.3983 

0.44397 

1.03454 

-1.53316 

-1.53316 

4.10086 

3-2 1GC 

6-3 1G 

b 

1.4124 

0.45084 

0.87288 

-1.54174 

-1.54174 

3.95635 

6-3 1GC 

Carbon  fluoride 

AMl 

sto-3G 

b 

b 

1.6298 

0.23707 

1.45720 

-0.71725 

-0.18512 

0.23596 

STO-3GC 

3-2 1G 

b 

1.6054 

0.26082 

0.50262 

-0.75863 

-0.35414 

1.61539 

3-2 1GC 

6-3 1G 

b 

1.6153 

0.25738 

0.74929 

-0.81262 

-0.36168 

1.29235 

6-3 1GC 

b 

3  The  location  of  the  bond  critical  point  is  the  distance  from  the  heavier  atom. 
b  No  bond  critical  point. 


expressed  by  the  ellipticity  e  is  in  general  also  correctly  calculated  by  amI.  Note 
that  for  2  states,  e  is  zero,  and  for  II  states,  it  has  a  finite  value. 

As  one  may  easily  see  from  Figure  3,  the  removal  of  the  core  orbitals  does  not 
significantly  affect  the  electronic  description  at  the  bond  critical  point.  The  topo¬ 
logical  properties  of  CH,  OH,  HC1,  CC1,  and  C2H6  from  the  valence  ab  initio  wave- 
functions  are  very  similar  to  their  full  counterparts  (also  compare  Tables  II  and 
IV).  For  the  case  of  CN,  CO,  CF,  and  CS,  the  absence  of  the  core  orbitals  makes 
the  bond  critical  points  disappear  for  both  the  AMl  wavefunctions  and  the  valence 
ab  initio  ones.  This  problem  is,  however,  not  due  to  the  inability  of  AMl  to  calculate 
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Table  IV.  Topological  properties  of  heteronuclear  diatomics  containing  carbon  and  a  second-row 

atom.3 


Location  of 

Pb 

V2p 

A, 

X2 

x3 

Pb  (<Zo) 

(eao3) 

(eao5) 

(eao5) 

(eao5) 

(eaZ5) 

Carbon  sulfide 
AMl 


ST0-3G 

1.6503 

0.26185 

-0.61940 

-0.31901 

-0.31901 

0.01863 

STO-3GC 

b 

3-2 1G 

1.4503 

0.25840 

-0.68574 

-0.36361 

-0.36361 

0.04148 

3-2 1GC 

b 

6-3 1G 

1.1553 

0.24066 

0.20843 

-0.25107 

-0.25107 

0.71057 

6-3 1GC 

Carbon  chloride 

b 

AMl 

1.9048 

0.20166 

-0.23800 

-0.30518 

-0.20420 

0.27137 

sto-3G 

1.9609 

0.19964 

-0.31141 

-0.31928 

-0.18731 

0.19518 

STO-3GC 

1.9871 

0.19920 

-0.36976 

-0.32120 

-0.17376 

0.12520 

3-2 1G 

1.8186 

0.19761 

-0.28159 

-0.33790 

-0.28013 

0.33643 

3-2 1GC 

1.8201 

0.19692 

-0.29187 

-0.33589 

-0.27771 

0.32172 

6-3 1G 

1.7854 

0.20379 

-0.48777 

-0.34185 

-0.29363 

0.38671 

6-3 1GC 

1.7861 

0.20365 

-0.57724 

-0.34128 

-0.29288 

0.37643 

3  The  location  of  the  bond  critical  point  is  the  distance  from  the  heavier  atom. 
b  No  bond  critical  point. 


the  charge  density  in  the  bonding  region  of  these  molecules,  or  to  the  direct  con¬ 
tribution  of  the  core  orbitals  to  the  charge  density  in  this  region.  The  bond  critical 
point  is  the  minimum  in  the  charge  density  along  the  internuclear  path.  The  short 
bond  lengths  plus  the  high  atomic  charges  of  the  N,  O,  F,  and  S  atoms  suppress 
these  minima  in  CN,  CO,  CF,  and  CS  and  thus  pose  a  systematic  problem  for  the 
am  1  (Fig.  4),  as  well  as  for  the  valence  ab  initio  wavefunctions. 

Fortunately,  the  problem  does  usually  not  occur  in  all  heterodiatomic  systems 
with  short  bond  lengths  and  large  differences  in  atomic  charges.  For  instance,  in 
various  cases  of  hydrides  such  as  OH,  HC1,  CH,  and  the  CC1  molecule,  the  am  1 
results  are  in  excellent  agreement  with  the  6-3 1 G  Hartree-Fock  calculations  (see 
Tables  III  and  V).  To  further  assess  the  problem,  we  have  also  computed  the  to¬ 
pological  properties  of  various  polyatomic  molecules.  The  problem  seems  to  persist 
in  the  CO  fragment  in  methanol,  the  CO  double  bond  in  formic  acid,  and  the  CN 
triple  bond  in  hydrogen  cyanide.  On  the  other  hand,  systems  containing  COC 
fragments,  such  as  oxirane  or  diethyl  ether,  and  the  CN  single  bonds,  such  as  in 
methyl  amine,  show  perfectly  normal  behavior  at  their  am  1  bond  critical  points. 

A  possible  solution  to  the  problem  of  the  absence  of  some  bond  critical  points 
for  the  am  1  wavefunctions  is,  of  course,  to  add  the  missing  core  orbitals.  As  an 
example,  we  have  augmented  the  am  1  density  of  carbon  monoxide  with  the  C  and 
O  core  contributions  from  atomic  wavefunctions  provided  by  Clementi  and  Roetti 
[17].  The  result  in  Figure  4  shows  that  the  augmented  AMl  charge  density  is  much 
closer  to  that  of  the  6-3 1G  than  the  STO-3G  density.  Again,  core  orbitals  show 
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Table  V.  Topological  properties  of  C — C  in  ethane,  ethene  and  acetylene.3 


Location  of 

Pb  («o) 

Pb 

(■ eao 3) 

V2p 

(ea05) 

a. 

X2 

(<rao5) 

X3 

(eao5) 

Ethane 

AMl 

1.4513 

0.23644 

-0.58653 

-0.39990 

-0.39990 

0.21327 

sto-3G 

1.4513 

0.24177 

-0.65504 

-0.40540 

-0.40540 

0.15576 

sto-3GC 

1.4513 

0.24172 

-0.65744 

-0.40524 

-0.40524 

0.15304 

3-2 1G 

1.4513 

0.22065 

-0.54146 

-0.37476 

-0.37476 

0.20806 

3-2 1GC 

1.4513 

0.22035 

-0.54775 

-0.37346 

-0.37346 

0.19917 

6-3 1G 

1.4513 

0.22274 

-0.40251 

-0.37221 

-0.37221 

0.34190 

6-3 1GC 

1.4513 

0.22269 

-0.40513 

-0.37196 

-0.37196 

0.33879 

Ethene 

AMl 

1.2652 

0.29948 

-0.65734 

-0.52093 

-0.28900 

0.15257 

sto-3G 

1.2652 

0.31372 

-0.83554 

-0.54561 

-0.31807 

0.02813 

3-2 1G 

1.2652 

0.32302 

-0.99444 

-0.64214 

-0.51912 

0.16682 

6-3 1G 

1.2652 

0.32295 

-0.85465 

-0.64049 

-0.52448 

0.31031 

Acetylene 

AMl 

1.1414 

0.31549 

-0.34113 

-0.20498 

-0.20498 

0.06883 

STO-3Gb 

0.8668 

0.34422 

-0.15532 

-0.25744 

-0.25744 

0.35956 

3-2 1G 

1.1414 

0.39123 

-0.16063 

-0.63775 

-0.63775 

0.11486 

6-3 1G 

1.1414 

0.38807 

-1.18456 

-0.60201 

-0.60201 

0.05586 

a  The  location  of  the  bond  critical  point  is  the  distance  from  either  of  the  C  atoms. 
b  A  (3,  -3)  critical  point  is  found  at  the  bond  midpoint,  1.1414  a0  away  from  each  carbon  atom.  The 
(3,-1)  critical  point  is  reported  here. 


insignificant  effects  on  the  charge  density  in  the  bonding  region  as  well  as  in  the 
regions  behind  the  nuclei,  but  reintroduce  a  bond  critical  point,  and  thus  make  the 
AM  1  density  accessible  to  a  more  complete  topological  analysis.  Similar  behaviors 
are  found  with  the  augmented  am  1  densities  in  the  cases  of  the  CO  bond  in  methanol 
and  formic  acid,  the  CN  bond  in  CN  molecule  and  in  hydrogen  cyanide,  and  in 
the  CS  and  CF  molecules. 

The  exception  is  the  HF  molecule,  in  which  even  the  augmented  AM  1  density 
does  not  exhibit  a  bond  critical  point.  In  this  case,  core  orbitals  were  only  added 
to  the  fluorine  atom,  since  hydrogen  does  not  have  a  core.  The  problem  is  rather 
due  to  the  inappropriate  description  of  the  charge  density  in  the  vicinity  of  the 
hydrogen  nucleus.  For  other  hydrides,  this  insufficiency,  expressed  in  a  too  “flat” 
hydrogenic  density,  is  not  very  relevant  for  the  intemuclear  region.  For  HF,  however, 
the  electronegative  fluorine  atom  contracts  the  charge  to  such  an  extent  that  the 
hydrogen  peak  almost  degenerates  to  a  shoulder.  The  resulting  bond  critical  point 
is  therefore  quite  faint,  and  vanishes  altogether  if  the  basis  set  is  not  flexible  enough. 
This  phenomenon  occurs  as  well  in  the  HF/STO-3G  description,  although  the  latter 
includes  the  core  by  default.  The  fact  that  amI  and  HF/STO-3G  hydrogen  densities 
show  great  similarities  is  very  well  demonstrated  by  the  case  of  the  hydrogen  mol¬ 
ecule.  One  can  see  from  Table  II  that  the  topological  properties  at  the  bond  critical 
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z/a0 

Figure  3.  Total  (circles)  and  valence  (squares)  charge  density  along  the  C — C  bond  of 
ethane  at  the  HF/6-3 1 G  level.  The  z  axis  is  the  bond  coordinate  and  the  origin  is  arbitrarily 
set  at  the  bond  midpoint. 


z/a0 

Figure  4.  Charge  density  of  carbon  monoxide  along  the  bond  axis  at  the  HF/6-3 1G 
(empty  circles),  HF/STO-3G  (empty  squares),  AMl  (triangles)  levels,  and  at  the  AMl  level 
augmented  with  atomic  core  contributions  (full  circles).  The  z  axis  is  the  bond  coordinate 
and  the  origin  is  arbitrarily  set  at  the  center  of  mass. 
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point  from  the  amI  density  are  only  marginally  closer  to  the  6-3 1G  results  than 
those  of  the  STO-3G. 

In  the  case  of  C2H2 ,  Wiberg  and  co-workers  [  1 8]  reported  a  (3,  —3)  critical  point 
(i.e.,  a  local  maximum  in  the  charge  density)  at  the  center  of  the  CC  bond  for  the 
6-3 1G*  and  6-3 1G**  basis  sets  on  a  Hartree-Fock  level  Both  Wiberg  and  Gatti 
et  al.  [19]  showed  that  these  nonnuclear  peaks  disappeared  with  more  flexible  basis 
sets  and/or  electron  correlation.  We  found  that  the  maximum  also  occurs  with  the 
STO-3G  basis  set,  but  not  for  the  (medium-quality)  6-3 1G  or  3-2 1G  bases.  Again, 
the  amI  wavefunction  exhibits  similar  behavior  to  those  of  the  3-2 1G  and  6-3 1G 
split-valence  basis  sets.  According  to  the  Poincare-Hopf  relation,  the  peak  at  the 
bond  midpoint  causes  two  additional  passes  on  the  molecular  axis.  The  latter  are 
reported  in  Table  V.  For  a  more  recent  discussion  of  these  “pseudo-atoms”  in  other 
chemical  systems,  see,  e.g.,  [20]  and  references  therein. 

Conclusion 

In  summary,  results  from  various  systems  indicate  that  the  absence  of  the  core 
orbitals  results  in  relatively  small  discrepancies  in  some,  and  negligible  discrepancies 
in  others  with  respect  to  the  description  of  the  electronic  properties  in  the  bonding 
region,  as  well  as  the  region  behind  the  nuclei.  However,  the  systematic  problem 
in  the  topological  analysis  of  the  am  1  wavefunctions  for  several  mostly  7r-bonded 
systems  with  short  bond  lengths  and  large  differences  in  atomic  charges  is  the  absence 
of  bond  critical  points.  This  problem  may,  however,  be  resolved  by  adding  the 
missing  core  orbitals  from  atomic  wave  functions,  which  are  cheaply  and  readily 
available. 

The  topological  properties  resulting  from  the  amI  functions  are  quite  similar 
to  those  from  ab  initio  calculations  with  various  basis  sets.  In  fact,  the  amI  re¬ 
sults  seem  to  be  more  consistent  with  Hartree-Fock  calculations  in  a  split- valence 
(6-3 1G)  basis  than  the  corresponding  scf  functions  employing  a  minimal  (sto- 
3G)  basis. 

These  results  give  rise  to  some  optimism  concerning  the  applicability  of  topological 
analysis  to  large  molecules,  for  which  calculations  on  an  ab  initio  level  are  beyond 
the  limits  of  present-day  computing.  It  may  be  speculated  that  semi-empirical 
methods,  which  represent  the  valence  regions  of  molecules  with  acceptable  accuracy, 
may  be  used  in  combination  with  atomic  core  contributions  to  give  a  description, 
even  of  extended  systems,  that  allows  the  analysis  of  the  molecules  in  terms  of  the 
charge  density,  its  critical  points,  and  its  Laplacian.  This  would  open  a  wide  field 
of  possibilities  including,  under  favorable  conditions,  solid-state  systems,  where  the 
replacement  of  core  contributions  by  “pseudo-potentials”  is  very  common. 
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Abstract 

This  article  considers  2  X  2,  3  X  3,  4  X  4,  and  5X5  Gram  determinantal  inequalities  among  <>"), 
where  r  is  the  distance  from  the  atomic  nucleus.  The  elements  of  these  determinants  involve  n  -  -1,0, 
I ;  «  =  -1,  0,  1,  2,  3;  n  =  -1, 0,  1,  2,  3,  4,  5;  and  n  =  -1,  0,  1,  2,  3,  4,  5,  6,  7.  The  2  X  2,  3  X  3,  4  X  4, 
and  5X5  determinantal  inequalities  are  used  to  obtain  lower  bound  estimates  of  (1/r)  for  atoms  of 
spherically  symmetric  charge  distributions.  The  2  X  2,  3  X  3,  and  4X4  inequalities  have  been  applied 
to  He,  Ne,  Ar,  Kr,  Xe;  Li,  Na,  K,  Rb;  and  Be,  Mg,  Ca,  Sr.  The  5X5  inequality  has  been  applied  only 
to  He,  Li,  and  Be.  It  is  found  that  the  lower  bound  values  of  (1  /r),  for  all  atoms  considered,  appear  to 
converge  to  the  quantum  mechanical  values  of  Boyd,  who  calculated  them  with  the  Roothaan-Hartree- 
Fock  wavefunctions  of  Clementi  and  Roetti.  ©  1 994  John  Wiley  &  Sons,  Inc. 


Introduction 

The  possibility  of  deriving  a  bound  on  one  expectation  value  of  a  quantity  in 
terms  of  other  expectation  values  of  the  same  quantity  via  an  inequality  is  an 
intriguing  one.  The  establishment  of  inequalities  involving  (r"),  where  r  is  the 
distance  from  the  atomic  nucleus,  permits  one  to  pose  the  question:  To  what  ac¬ 
curacy  can  one  calculate  an  expectation  value  of  r  raised  to  a  certain  power  in 
terms  of  other  expectation  values  of  r  raised  to  other  powers? 

In  the  present  work  Gram  determinantal  inequalities  are  used  to  obtain  lower 
bounds  for  ( 1/r)  from  2  X  2,  3  X  3,  4  X  4,  and  5X5  determinants,  whose  elements 
are  (r").  The  2X2  inequality  involves  (r")  with  n  =  —  1,  0,  1 .  The  3X3  inequality 
contains  ( rn )  with  n  =  -1,  0,  1,  2,  3.  The  4X4  inequality  has  elements  (r")  with 
n  =  - 1,  0,  1,  2,  3,  4,  5,  while  the  5  X  5  inequality  involves  (r°)  with  n  -  —  1,  0,  1, 
2,  3,  4,  5,  6,  7. 

The  above  description  shows  that  (1/r),  a  quantity  whose  value  is  mainly  de¬ 
termined  by  distances  close  to  the  atomic  nucleus,  is  “synthesized”  by  a  number 
of  (rw)  values  which  involve  distances  far  from  the  atomic  nucleus.  The  question 
that  arises  is  the  following:  do  the  (1/r)  values  obtained  for  a  particular  atom  from 
2  X  2,  3  X  3,  4  X  4,  and  5X5  Gram  determinantal  inequalities  converge  to  the 
quantum  mechanical  values  of  (1/r)  calculated  by  Boyd  [1]  with  the  Roothaan- 
Hartree-Fock  wavefunctions  of  Clementi  and  Roetti  [2]?  The  answer  seems  to  be 
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in  the  affirmative  for  the  spherically  symmetrical  atoms  He,  Ne,  Ar,  Kr,  Xe;  Li, 
Na,  K,  Rb;  and  Be,  Mg,  Ca,  Sr. 

Determinantal  Inequalities 

The  2X2  and  3X3  determinantal  inequalities  with  elements  (r-1),  (r°),  (r) 
and  (r°),  (r),  (r2),  (r3)  were  obtained  by  Gadre  and  Matcha  [3],  while  the 
4X4  inequality,  with  elements  (r-1),  (r°),  ( r ),  (r2),  <V3),  <74),  and  (r5),  was 
obtained  by  Csavinszky  [4].  The  5X5  inequality,  with  elements  <V~l),  </°),  <r), 
(r2),  (r3),  (r4),  (r5),  (r6),  and  (r7),  is  given  in  the  present  article.  It  is  mentioned 
here  that  the  expectation  values  (r6)  and  (r7)  have  been  calculated  by  the  author 
by  making  use  of  the  analytical  approximations  for  D(r),  given  by  Boyd  [1].  The 
derivation  of  the  determinantal  inequalities  is  based  on  the  Gram  inequality  [5], 


fxixY^x)  dx 


>  0 


fi=  1,  2,  3, 


In  Ref.  [3 ],fn(x)  was  defined  by* 

fn(x)  =  4tt r2n~lp{r)m  n=  1,  2,  3, . . . ,  N , 

where  p(r)  is  the  (position)  electron  density. 

It  is  noted  that 


p(r)  =  (1/4 ir)r  2  D(r) , 

where  D(r )  is  the  (position)  radial  electron  density,  normalized  to  unity. 
The  expectation  values  of  powers  of  r  are  obtained  [1]  from 
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The  2  X  2,  3  X  3,  4  X  4,  and  5X5  determinantal  inequalities  are  given  below 


KO 


(5) 


|<'-> 

1 

<^> 


1  (r) 

<r>  <r2> 

<r2>  <r3> 


>0 


|<o 

i 


1  <r>  <r2) 

<r>  <r2>  <r3) 

<r2>  (r3)  <r<> 

<r>  (r4<>  <r5> 


>  0 


(6) 


(V) 


y-1) 

i 

<r> 

:<r2> 

<r3) 


1  <r>  <r2)  <r3> 

<r>  <r2>  <r3>  <r4> 

<r2>  <r3>  <r4>  (r5> 

<r3>  <r4>  <r5>  <r6> 

<r4>  <r5>  <r6>  <r2> 


>0. 


(8) 


*  There  is  a  misprint  in  Ref.  [3].  The  function  f„  is  given  in  Ref  [3]  as  fn  -  [4ir p(r)2n  *]1/2  ( n  -  1,  2. 
3, . . .).  The  correct  expression  is  displayed  in  Eq.  (2)  of  the  present  article,  and  in  Eq.  (2)  of  Ref  [4]. 
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Table  I.  Lower  bound  and  quantum  mechanical  (qm)  values  (in  a.u.)  of  (r  *)  for  the  noble  gas  atoms. 


Atom 

(r'> 

2X2 

3X3 

4X4 

5X5 

QM 

He 

1.078 

1.366 

1.487 

1.548 

1.687 

Ne 

1.267 

1.754 

2.064 

— 

3.111 

Ar 

1.120 

1.808 

2.204 

— 

3.874 

Kr 

1.372 

2.138 

2.552 

— 

5.079 

Xe 

1.382 

2.189 

2.680 

— 

5.886 

I 

Putting  the  2  X  2,  3  X  3,  4  X  4,  and  5X5  determinants  in  Eqs.  (5)-(8)  equal  to 
zero,  one  can  calculate  lower  bounds  to  (1/r)  arising  from  these  determinants.  The 
( 1  / r )  values  for  the  noble  gas  atoms,  the  alkali  atoms,  and  the  alkaline  earth  atoms 
are  displayed  in  Tables  I,  II,  and  III. 

Discussion 

A  look  at  Table  I  reveals  that,  for  the  noble  gas  atom  He,  the  lower  bound  values 
of  (1/r),  calculated  from  the  2  X  2,  3  X  3,  4  X  4,  and  5X5  Gram  determinants, 
show  an  increase  as  one  goes  in  succession  from  the  2  X  2  to  the  5X5  case.  Table 
I  shows  that,  for  the  noble  gas  atoms  Ne,  Ar,  Kr,  and  Xe,  the  lower  bound  values 
of  (1/r),  calculated  from  the  2  X  2,  3  X  3,  and  4  X  4  Gram  determinants,  also 
show  an  increase  as  one  moves  from  the  2  X  2  to  the  3X3,  and  then  to  the  4X4 
case.  For  all  atoms  in  Table  I,  the  calculated  lower  bound  values  of  (1/r)  appear 
to  converge  toward  the  quantum  mechanical  (qm)  values  of  (1/r),  calculated  by 
Boyd  [1]. 

Table  II  shows  that,  for  the  Li  atom,  the  lower  bound  values  of  (1/r)  increase, 
as  one  proceeds  in  succession  from  the  2  X  2  to  the  5X5  case.  The  same  is  true 
for  Na,  K,  and  Rb,  since  the  lower  bound  values  of  (1/r)  increase  as  one  goes  from 
the  2  X  2,  3  X  3,  and  then  to  the  4  X  4  case.  The  converging  trend  of  the  lower 
bound  values  of  (1/r),  for  all  atoms  in  Table  II,  to  the  QM  values  of  (1/r),  calculated 
by  Boyd  [1],  is  again  clearly  seen. 


Table  II.  Lower  bound  and  quantum  mechanical  (qm)  values  (in  a.u.)  of  (r  !)  for  the  alkali  atoms. 


Atom 

<'•-') 

2X2 

3X3 

4X4 

5X5 

QM 

Li 

0.598 

1.334 

1.456 

1.905 

Na 

1.014 

1.461 

1.679 

— 

3.221 

K 

0.977 

1.407 

1.740 

__ 

3.942 

Rb 

1.242 

1.732 

2.122 

— 

5.126 
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Table  III.  Lower  bound  and  quantum  mechanical  (qm)  values  (in  a.u.)  of  (r  ')  for  the  alkaline  earth 

atoms. 


Atom 

<r-’> 

2X2 

3X3 

4X4 

5X5 

QM 

Be 

0.653 

1.124 

1.429 

1.593 

2.102 

Mg 

0.979 

1.526 

1.679 

— 

3.327 

Ca 

0.941 

1.459 

1.791 

— 

4.008 

Sr 

1.195 

1.784 

2.170 

— 

5.173 

Table  III  shows  the  same  converging  trend  for  the  lower  bound  value  of  (1/r) 
for  Be,  as  one  goes  in  succession  from  the  2  X  2  to  the  5X5  case.  For  Mg,  Ca, 
and  Sr  the  lower  bound  values  of  (1  /r)  seem  to  converge  also  to  the  QM  values  of 
Boyd  [1],  as  one  proceeds  from  the  2  X  2  to  the  4  X  4  case. 

Conclusion 

On  the  basis  of  the  behavior  of  the  lower  bound  values  of  (1  /r),  displayed  in 
Tables  I,  II,  and  III,  the  author  expects  further  manifestation  of  convergence  to  the 
qm  values  of  (1/r),  should  one  extend  the  calculations  by  consideration  of  6  X  6, 
7X7,...  etc.  Gram  determinantal  inequalities. 

In  closing,  it  should  also  be  mentioned  that  Weinhold  [6]  dealt  with  the  application 
of  Gram  determinants  to  some  physical  quantities. 

Yet  another  comment  should  be  made  about  two  recent  articles:  one  by  Angulo 
and  Dehesa  [7],  and  the  other  one  by  Galvez  and  Porras  [8].  The  first-article  contains 
inequalities  for  <(r")  while  the  second  contains  the  more  complicated  in¬ 
equality  <  (2Z/k)  (r*-2),  where  Z  is  the  atomic  number  and  k  =  1,  2,  3. 

(For  even  more  complicated  inequalities  among  (rk).  Ref.  [8]  should  be  consulted.) 

Bibliography 

[  1  ]  R.  J.  Boyd,  Can.  J.  Phys.  55,  452  (1977). 

[2]  E.  Clementi  and  C.  Roetti,  At.  Data  Nucl.  Data  Tables  14,  177  (1974). 

[3]  S.  R.  Gadre  and  R.  L.  Matcha,  J.  Chem.  Phys.  74,  589  (1981). 

[4]  P.  Csavinszky,  Int.  J.  Quantum  Chem.,  Quantum  Chem.  Symp.  27,  377  (1993). 

[5]  R.  Courant  and  D.  Hilbert,  Methods  of  Mathematical  Physics  (Interscience,  New  York,  1955),  Vol. 
I,  p.  62. 

[6]  F.  Weinhold,  Adv.  Quantum  Chem.  6,  299  (1972). 

[7]  J.  C.  Angulo  and  J.  S.  Dehesa,  Phys.  Rev.  A44,  1516  (1991). 

[8]  F.  J.  Galvez  and  I.  Porras,  Phys.  Rev.  A44,  144  (1991). 


Received  March  1 1,  1994 


A  New  Representation  for  Ground  States 
and  its  Legendre  Transforms 

ANDRES  CEDILLO 

Departamento  de  Qui'mica,  Universidad  Autonoma  Metropolitana-Iztapalapa, 
Ap.  Postal  86-219;  Villa  Coapa,  DF 14391;  Mexico 


Abstract 

The  ground-state  energy  of  an  electronic  system  is  a  functional  of  the  number  of  electrons  ( N )  and 
the  external  potential  (u):  E  -  E[N,  v],  this  is  the  energy  representation  for  ground  states.  In  1982, 
Nalewajski  defined  the  Legendre  transforms  of  this  representation,  taking  advantage  of  the  strict  concavity 
of  E  with  respect  to  their  variables  (concave  respect  v  and  convex  respect  N),  and  he  also  constructed 
a  scheme  for  the  reduction  of  derivatives  of  his  representations.  Unfortunately,  N  and  the  electronic 
density  (p)  were  the  independent  variables  of  one  of  these  representations,  but  p  depends  explicitly  on 
N.  In  this  work,  this  problem  is  avoided  using  the  energy  per  particle  (e)  as  the  basic  variable.  In  this 
case  c  is  a  strict  concave  functional  respect  to  both  of  his  variables,  and  the  Legendre  transformations 
can  be  defined.  A  procedure  for  the  reduction  of  derivatives  is  generated  for  the  new  four  representations 
and,  in  contrast  to  the  Nalewajski’s  procedure,  it  only  includes  derivatives  of  the  four  representations. 
Finally,  the  reduction  of  derivatives  is  used  to  test  some  relationships  between  the  hardness  and  softness 
kernels.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 


Consider  an  iV-electron  system,  under  the  influence  of  an  external  potential, 
v(r).  For  this  system  the  Hamiltonian  has  this  form: 

Hn.v  =  +  2  «(n) ,  (1) 

1=1 


where 


jy(0) 

nN 


N 

2 

i=! 


N- 1  N 

+  22 

i=\  j=i+ 1 


In  -  rj\ ' 


(2) 


If  the  operator  HN  v  has  a  nondegenerated  ground  state,  then  the  ground-state  energy 
is 


E[N,  v]  =  min  { Ek\HN/i!k  =  Ek^k }  , 

k 


(3) 


where  { are  the  vV-particle  antisymmetric  eigenfunctions  of  HNv,  and  the 
ground-state  density  is  given  by: 
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p(r)  mN  J  xN)V(x,,  ...,xN)  da,  dx2  -  •  ■  dxN  ,  (4) 

where  ^  is  the  ground-state  wave  function,  and  x  =  { r ,  cr}  contains  the  spatial  and 
spin  coordinates.  If  one  assumes  that  E  is  differentiable  with  respect  to  N,  then  the 
differential  form  of  eq.  (3)  is  [1]: 

dE  -  fidN  +  J  p(r)5tf(r)  dr  ,  (5) 

where 


In  addition,  E  has  a  strict  concavity  respect  to  N  and  v .  From  the  experimental 
ionization  potentials  of  atoms  and  molecules  [2]  one  can  see  that  E  is  a  convex 
function  of  N,  while  E  is  a  concave  functional  of  t?(r)  [3] .  This  fact  allows  one  to 
use  Legendre  transformations  to  find  alternative  representations  for  ground  states 

[4]: 

F[N,p]-E-  f  p(  r)v(r)  dr 
till*,  v]  =  E  —  fiN 

R[p,p]=E- pN-  Jp(r)v(r)dr.  (7) 

Unfortunately  N  and  p  are  not  independent  variables: 

N  =  J  p(r)  dr  ,  (8) 

and  the  functional  F[N,  p]  is  not  a  functional  of  two  independent  variables. 

In  this  work,  this  problem  is  avoided  by  constructing  a  new  representation  for 
ground  states.  This  new  representation  turns  out  to  be  a  concave  functional  of  both 
of  its  variables  and  its  Legendre  transformations  do  not  present  the  former  problem. 

A  systematic  method  for  the  reduction  of  derivatives  is  also  developed  for  this 
kind  of  functionals  and  this  method  is  used  to  test  some  relationships  between 
chemical  reactivity  parameters. 

The  Energy  per  Particle 

The  main  problem  with  the  functional  F[N ,  p]  comes  from  the  Legendre  trans¬ 
formation: 

FIN,  p]  =  E[N,  v]-J  *  -  (9) 

which  replaces  the  dependence  in  v  with  the  derivative  of  E  with  respect  to  v  (in 
this  case,  p),  leaving  the  other  variable  (N)  unchanged. 
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Taking  into  account  that  the  density  depends  explicitly  on  N,  Eq.  (4),  then  one 
can  extract  N  from  the  density: 

p(r)  =  NP(r),  (10) 

where  P(r)  is  the  one-particle  probability: 

P( r)  =  J  ¥*(xi,  ..  ..XjvWxi,  ..  .,xN)dax  d\2- ■  -dxN,  (11) 

and  it  must  integrate  to  one. 

From  eq.  (5),  one  can  find  the  differential  form  of  the  energy  per  particle, 
c  =  E/N, 

de  =  Mdv  +  J  P(r)Sn(r)  dr  ,  (12) 

where 

i>  =  lnjV,  (13) 


and 


E  -  fxN 
N 


P(r) 


(14) 


Then  the  energy  per  particle  is  a  functional  of  v  and  v, 

e  —  e[v ,  u]  .  (15) 

Since  N  and  v  have  a  one-to-one  correspondence,  v  and  v  determine  the  ground- 
state  energy  and,  therefore,  the  energy  per  particle.  Then  Eq.  ( 15)  is  also  a  repre¬ 
sentation  for  ground  states. 

This  new  functional  also  has  strict  concavity.  In  Figure  1,  experimental  values 
of  e,  for  helium  to  argon,  are  plotted  versus  v,  and  one  can  see  that  e  is  a  concave 
function  of  p,  for  a  fixed  potential  and  v  >  In  2.  And  for  a  given  value  of  v  (and 
therefore  a  fixed  value  of  A),  e  is  a  concave  functional  of  v  [3]: 

e[v ,  (1  -  a)l>i  +  aV2 ]  >  (1  -  a)e[v,  X^]  +  ae[v,  V2]  .  (16) 

Then  e[v,  n]  has  the  same  concavity  with  respect  to  its  independent  variables.  In 
this  case  [5],  the  Legendre  transformations  of  ^  can  be  defined: 

<t>[v,  P]  m  c[v,  d]  -  J  F(r)t)(r)  dr 
oj [M,  t)]  =  e[y,  t)]  -  Mv 

-  J  P(r)v(r)  dr  , 


ir [M,  P]  =  e[v,  t)]  -  Mv 


(17) 
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Figure  1 .  Experimental  energies  per  particle  for  helium  to  argon. 


and  these  representations  can  be  related  with  those  from  Eq.  (7).  As  a  special  case, 


4>[v,  P ]  - 


E-  j  p(rMr)  dr 


F[N,  P] 

N 


(18) 


where  one  can  see  that  the  independent  variables  of  0  do  not  have  an  explicit 
dependence. 

These  new  functionals  are  also  ground-state  representations,  since  the  strict  con¬ 
cavity  assures  a  one-to-one  mapping  for  its  independent  variables  [  5  ] .  From  Eq. 
(12)  one  can  find  the  differential  form  of  the  new  representations: 


d(f>  =  Mdv  -  J  n(r)5F(r)  dr 
do)  =  -vdM  +  J  i>(r)6u(r)  dr 
dir  =  -  vdM  -  J  n(r)&P(r)  dx  , 


(19) 


and  each  of  the  four  new  representations  yield  a  Maxwell’s  relation,  see  Table  I. 


Reduction  of  Derivatives 

All  the  functionals  used  in  this  work  depend  on  a  scalar  variable  and  on  a  function. 
Let  x  be  the  scalar  variable  and  y(r),  the  function.  To  obtain  general  relationships 
among  quantities  from  any  two  representations,  it  is  convenient  to  work  with  an 
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Table  I.  Maxwell’s  relations  for  the  new 
representations. 


Representation 

Maxwell’s  relation 

Ml  | 

fw\  _  iam\ 

W,  l av  L 

[M,v]  , 

(  hv  \  /d/>(r)\ 

\  )v 

=  (Mr)\ 

l 

\SP(r)/r  \  dv  )p 

[M,P]  ! 

(  bv  \  Mr)\ 

arbitrary  quantity  A .  The  differential  form  of  A  can  be  written  in  both  represen¬ 
tations: 


dA 


dx  i  + 


dy{(r)  dr 


dA 


dA 


^2(r) 


5y2(r)  dr  , 


(20) 


where  {x*}  =  { v,  M)  and  {yt }  =  {v9  P} .  Also  dx2  and  dy2  can  be  expressed  in 
terms  of  dxx  and  dy\ ,  and  then  both  equations  become  identical.  By  equating  the 
coefficients  of  the  independent  variables,  the  following  relations  are  obtained: 


where  the  transformation  coefficients  are  second  derivatives  in  the  representation 
of  [xi,  y\]  and  they  must  be  expressed  in  terms  of  the  second  derivatives  of  the 
representation  of  [x2,  y2] .  These  reductions  can  be  made  by  replacing  Xi  and  yi(r") 
in  Eq.  (21).  The  rearrangement  of  the  resulting  equation  leads  to  the  following 
implicit  relations  for  the  second  derivatives  of  the  representation  of  [Xi  ,yi]: 


5(r  -  r")  = 


dr' 


(22) 
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From  the  first  equation  one  must  find  [by2/byx]Xx  by  solving  the  integral  equation. 
Using  this  solution,  the  other  two  derivatives  can  be  obtained  directly  if  one  takes 
into  account  that  [fo^/Syi]*,  and  [dy2/dx i]^  are  related  by  a  Maxwell’s  relation. 
With  these  derivatives,  the  reduction  of  derivatives  can  be  done  with  Eq.  (21 ). 

If  one  independent  variable  is  common  to  both  representations,  then  the  reduction 
equations  become  simpler.  There  are  two  different  cases — one  generates  algebraic 
relationships,  whereas  the  other  gives  functional  equations.  Both  cases  are  de¬ 
scribed  below. 

When  the  second  variable  is  the  same  in  both  representations,  y2  =  yx  =  y,  the 
reduction  equations  take  their  simplest  form.  These  representations  differ  in  their 
real  variable,  [*i,  y]  and  [ x2 ,  y],  and  the  reduction  equations  can  be  written  in 
the  following  form: 


dx2)y\8y(r)Jxi 


where  the  second  derivatives  of  [xx ,  y]  are  given  by 


The  relations  given  in  the  last  equations  correspond  to  the  well-known  chain-rule, 
inverse-derivative,  and  cyclic-rule  of  differential  calculus. 

The  other  case  occurs  when  x2  =  xx  =  x ,  in  which  the  derivatives  from  the 
representation  of  [x,  yx  ]  are  reduced  in  terms  of  those  from  [x,  y2]: 


1\  /&i\  f  /  M  \  (dy2( r) 

7„  \ dx !yi  J  Uy2(r)/X\  dx 

Syd t)L  J  {Sy2(r')j\dy,(r)j 


5(r  -  r")  = 


dydj) 

dx 


r(M TJ. 

J  \Syi(r) 
[(Me) 

J  [5y,(r': 


Sy  ,(ry 

Sy2(r’) , 

'  dyi(r’) 
dx 


In  this  case  [by2/byx]x  must  be  solved  from  the  functional  inverse-equation,  and 
Eq.  (25,  26 )  corresponds  to  the  functional  generalizations  of  the  algebraic  relations 
given  by  Eqs.  (23,  24). 
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These  three  cases  can  be  found  in  the  derivative  reduction  problems  and  in  the 
following  section  they  will  be  applied  to  test  some  relationships  between  some 
reactivity  parameters. 

Relationships  between  Reactivity  Parameters 
Density  Functional  Theory  (dft)  has  been  widely  used  to  study  the  chemical 
reactivity,  and  most  of  the  reactivity  parameters  provided  by  this  theory  are  deriv¬ 
atives.  Then  the  methodology  from  the  last  section  will  be  used  with  these  quantities. 

dft  is  based  in  the  Hohenberg-Kohn  theorems  [6] .  The  first  theorem  states  that 
the  ground-state  energy  is  a  functional  of  the  ground-state  density: 

E  =  £HK[p]  -  FHK[p]  +  J  p(r)u(r)  dr  .  (27) 

The  second  is  a  variational  principle  for  the  energy,  it  establishes  that  the  functional 
EUK[p]  takes  its  minimum  value  for  the  exact  ground-state  density.  In  the  variational 
procedure,  one  restriction  must  be  included,  namely  that  the  density  must  be  nor¬ 
malized  to  the  number  of  electrons: 

iV[p]  =  J  p(r)  dr  =  N .  (28) 

This  restriction  is  incorporated  through  a  Lagrange’s  multiplier,  p,  and  the  resulting 
Euler-Lagrange  equation  takes  the  following  form: 


— —  +  v(r)  =  p,  (29) 

8p(r) 

where  p  can  be  identified  with  the  chemical  potential  of  the  system  [1] . 

Since  most  of  the  chemical  reactivity  parameters  are  derivatives  involving  p,  N, 
v,  and  p,  at  first,  one  can  relate  the  representations  of  e[v,  v]  and  c c[M,  u].  The 
relationships  for  the  second  derivatives  of  both  representations  are 


dP(  r) 

MO 


dP(  r) 
dM 


dP(r') 

6M 


8P(r) 

dv(r') 


and  they  can  be  related  with  the  derivatives  of  the  original  representations,  Eqs. 

(sH&r 


fy>(r) 

5p(r') 


Sv(r)JN 


dp(r') 

dp 


dp(r) 

Sv(r’) 
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These  derivatives  have  been  previously  defined.  The  hardness  [  7  ] 


the  softness  [  8  ] , 


the  Fukui  function  [9], 


the  local  softness  [  8  ] , 


_  i  dfi\ 

l  d»  /„  ’ 

/(r)  (  m  ),  \5n(r)  )w  ’ 


J(r) 


_  / dp(r)\ 

l  h 


5N 

Sv(r) 


and  the  linear  response  coefficient  for  the  density, 

are  then  related  by  Eq.  (30).  This  equation  becomes: 

V  =  1  /s 
A  r)  =  s(r)/S 
s0 


X(r,  r') 


r)S(r')  /  Mr)  \ 

s  W')J  • 


and  these  relations  are  found  in  Refs.  [8,10], 

Another  quantity  of  interest  is  the  hardness  kernel  [10]. 


«r,  r')  - 


Sp(r)Mr') 


(32) 


(33) 


(34) 


(35) 


(36) 


(37) 


(38) 


It  can  be  written  in  terms  of  the  derivatives  of  the  representation  of  <j>[  v,  P] ,  since 
0  determines  e,  EHK[p[i>,  P]],  and  therefore  FHK.  Some  of  its  properties  [10-12] 
can  be  also  obtained  with  the  same  procedure: 

J  f  v(r,  r’)a(r')  dr'  /( r)  dr  =  ij  J  a(r)  dr 

f  ri{ r,  r')f(r')  dr' =  r, , 


(39) 
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where  a(r)  is  an  arbitrary  function  and  the  property  J/(r)  dr  -  1  was  used.  The 
softness  kernel  [10]  was  defined  as  the  functional  inverse  of  the  hardness  kernel: 

J  5j(r,  rW,  O  dr'  =  «(r,  r") .  (40) 

This  equation  can  be  solved,  in  terms  of  derivatives: 

s(r,  r')  =  af(r)f(rr)  —  X(r,  r') ,  (41) 

where 

a  s  J  j(r,  r')  dr'  dr  ,  (42) 

and  its  value  can  be  determined  by  substitution  in  the  inverse  equation,  Eq.  (40): 

<7  =  1  -S,  (43) 

V 

then 

s( r,  r')  =  Sf(r)f(r')  -  X(r,  r') ,  (44) 

and  the  softness  kernel  can  be  identified  with  [5p(r)/$u(r')]M  by  comparison  with 
Eq.  (37). 


Discussion 

The  energy  per  particle,  as  a  functional  of  v  and  v(r),  is  an  alternative  represen¬ 
tation  for  ground  states.  This  functional  has  a  strict  concavity  with  respect  to  its 
independent  variables  and  its  Legendre  transformations  can  be  defined.  These 
functionals  are  also  ground-state  representations. 

The  independent  variables  of  one  of  the  original  representations,  F[N,  p],  have 
an  explicit  dependence.  The  corresponding  new  functional  do  not  present  this 
problem. 

A  systematic  method  for  the  reduction  of  derivatives,  for  this  kind  of  functionals, 
was  described.  This  procedure  was  proved  by  testing  some  relationships  between 
the  reactivity  parameters  provided  by  Density  Functional  Theory. 

The  methodology  for  the  reduction  of  derivatives  presented  here  only  involves 
derivatives  from  the  four  new  representations,  in  contrast  with  Nalewaj ski’s 
procedure  [13],  which  involves  some  derivatives  that  do  not  belong  to  any 
representation. 
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Abstract 

In  this  work  we  present  a  mathematical  formulation  of  the  physical  fact  that  the  bound  states  of  a 
quantum  system  confined  into  a  box  ft  (with  impenetrable  walls)  are  similar  to  those  of  the  unconfined 
system,  if  the  box  ft  is  sufficiently  large,  and  it  is  shown  how  the  bound  states  of  atomic  and  molecular 
Hamiltonians  can  be  approximated  by  those  of  the  system  confined  for  a  box  ft  large  enough  (Dirichlet 
eigenproblem  in  ft).  Thus,  a  method  for  computing  bound  states  is  obtained  which  has  the  advantage 
of  reducing  the  problem  to  the  case  of  compact  operators.  This  implies  that  a  broad  class  of  numerical 
and  analytic  techniques  used  for  solving  the  Dirichlet  problem,  may  be  applied  in  full  strength  to  obtain 
accurate  computations  of  energy  levels,  wave  functions,  and  other  physical  properties  of  interest.  ©  1994 
John  Wiley  &  Sons,  Inc. 


Introduction 

In  the  past  several  authors  have  computed  the  bound  states  of  Schrodinger  op¬ 
erators  imposing  Dirichlet  boundary  conditions  [1-8].  By  this  we  mean  solving  by 
some  method  the  time-independent  Schrodinger  equation  corresponding  to  the 
quantum  system  within  a  box  12  with  impenetrable  walls  (Dirichlet  eigenproblem), 
and  it  has  been  observed  that  the  eigenvalues  and  other  expectation  values  of  the 
confined  system  converge  to  the  values  of  the  free  system  as  the  size  of  the  box 
12  grows. 

The  aim  of  the  present  work  is  ( 1 )  to  give  the  mathematical  results  that  support 
this  procedure  (Theorems  3  and  4),  (2)  to  show  how  these  results  explain  some 
results  found  by  other  authors  in  particular  cases,  (3)  discuss  how  to  solve  the 
Dirichlet  problem  by  standard  numerical  methods  (third  section)  and  (4)  to  discuss 
some  theoretical  advantages  of  this  procedure  to  compute  bound  states  of  atomic 
and  molecular  hamiltonians. 

The  mathematical  results  (Theorems  3  and  4)  that  support  the  convergence  of 
the  method  proposed  here  have  a  simple  physical  interpretation,  namely:  Taking 
a  sufficiently  large  box  12,  the  bound  states  of  the  free  quantum  system  are  slightly 
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perturbed  by  the  boundary  <30.  Therefore,  both  the  free  and  confined  systems  have 
similar  bound  states  for  large  enough  0. 

The  Method 

Consider  the  eigenproblem  associated  with  a  free  atom  or  molecule  in  the  Bohm- 
Oppenheimer  approximation. 

Hi (/)  =  -  A i<‘>  +  q(x)iw  =  Ewiw,  ( 1 ) 

with  A  the  37V-dimensional  Laplacian,  q(x)  the  potential  associated  with  iV particles 
interacting  with  each  other  by  Coulomb  forces  and  one  of  the  eigenfunctions 
associated  to  the  energy  level  E(i), 

We  will  show  how  the  bound  states  of  the  free  system  ( 1 )  can  be  approximated 
by  those  of  the  Dirichlet  eigenproblem  associated  with  the  system  inside  a  box  0„ 
with  impenetrable  walls  <30„,  or 

=  +  =  itt(dQ„)  =  0,  (2) 

as  grows  in  the  following  manner;  Namely,  0„  will  be  a  sequence  of  boxes  such 
that  0„  C  0„+1  and  grow  to  cover  the  whole  space  R3  as  n  -►  oo . 

Every  box  0„  defines  an  open  region  in  the  3A/-dimensional  configuration  space 
R3N,  which  we  denote  by  with  boundary  <30„,  abusing  the  notation.  The  operators 
H  and  Hiln  denote  the  self-adjoint  Schrodinger  operators  associated  with  Eqs.  ( 1 ) 
and  (2)  in  L2(R3N)  and  respectively,  and  whose  energy  functionals  are 

given  by  [9] 

h(u,v)  =  (Vw,Vv)  +  (qu,v}  for  u,v  E  WU2(R3N),  (3) 

hn„(u,v)  =  <Vw,Vu>ft„  +  (qu,v)Qn  for  u,v  E  Wl0’2(tin),  (4) 

with  <,),(,  )n„  the  usual  inner  products  of  L2(R3N)  and  L2(Qn)  and  the  following 
notation  is  used  (Vw,Vr)  =  2^  (du/dxk,dv/dxf). 

Convergence  to  the  Energy  Levels  of  H 

Let  a  be  the  bottom  of  the  essential  or  continuous  spectrum  of  H.  Then  any 
element  in  the  spectrum  a(H)  of  H  and  below  a  is  an  isolated  eigenvalue  with 
finite  multiplicity.  These  eigenvalues  as  well  as  those  of  HQfi  can  be  characterized 
with  aid  of  the  min-max  principle  [9]. 

Proposition  1.  The  number  ii{k)  given  by 

M(/°  =  sup  inf  {(h(ff) :  /  e  W'*(R3N)9  </,/>  =  1}  <  a 

Bk- 1 

is  the  kth  eigenvalue  of  H,  counting  multiplicity.  In  similar  way,  the  number 

^  =  sup  inf  {h^(fj) :  fs  W'o’2(Q„),  =  1} 

Ba,k-i  Ba  k_ , 

is  the  kth  eigenvalue  of  HQn,  counting  multiplicity. 
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Remark  2.  If  the  energy  level  Eu)  of  the  free  system  has  multiplicity  ra,  and  is 
ordered  according  to  E(l)  <  E{2)  <•••<«,  then  the  relation  with  the  eigenvalues 
li(k)  given  by  the  min-max  principle  is 

i- 1 

^(ki)  <  £(0  =  ^(^+1)  =  .  .  .  =  <  ^(ki+mt+l)  wjth  fa,  _  2  ffij, 

j=  1 

and  a  similar  result  holds  between  the  energy  levels  E\fn  and  eige values  fi(nk) 

of  Han. 

The  convergence  result  to  the  eigenvalues  of  H  is  the  following  one. 

Theorem  3  [7,8].  Let  be  the  eigenvalues  of  H,HQn  given  by  the  min-max 

principle.  Then 

li<k)  <;  mSwi  =£  and  lim  flifj  =  n<k>  formal.  (5) 

«— ►oo 

If  E(i)(~ix{ki+i))  has  multiplicity  m/5  kt  =  Sj=!  ny,  then 

lim  f. t$+j)  =  E{i)  holds  for  7  =  1, ...»  m,-.  (6) 

n~*  00 

In  particular,  if  EU)  is  non  degenerate  then  {/4£+1)}“=i  is  the  unique  sequence 
converging  to  it. 

Convergence  to  the  Wave  Functions  of  H 
The  quantity  to  verify  the  convergence  of  wave  functions  is  the  norm  ||  ,  ||  of 

L2(R ™):  ll/ll  =  </,/>1/2. 

Theorem  4  [7,8].  Let  Eu)  be  an  energy  level  of  H  with  miltiplicity  m,-,  and  let 
A(n°  =  {ii§f I,  . . . ,  fi^mi }  be  the  set  of  eigenvalues  of  H^n  converging  to  E0)  ac¬ 
cording  Eq.  (26).  Then  the  following  statements  are  true: 

(i)  If  Q(l)  and  Qq}  are  the  orthogonal  projections  on  the  eigenspaces  associated 
to  E(i)  and  then  Q^l  converges  in  the  operator’s  norm  to  Q{l)  as 
n  -►  00 . 

(ii)  If  \[/{l)  and  are  wave  functions  of  the  above  eigenspaces,  then 

lim  \\ip{i)  -  ||  =  0  and  lim  -  Q^IW  =  0,  (7) 

n-+  00  n-*-  00 

where  \pa}(x)  -  0  for  x  3 

(iii)  In  particular,  if  EU)  is  nondegenerate  and  ^(,)  and  are  normalized  and 
have  the  same  phase,  then 

lim  =  0  and  lim  ||V^^  "  V^(/)||  -  0.  (8) 

n-*-  00  n-*-cc 

Remark  5.  By  Theorems  3  and  4  the  convergence  of  the  set  A(nl)  to  Eu)  implies  the 
convergence  of  the  corresponding  eigenspaces.  But  the  orthogonal  projection  as¬ 
sociated  to  each  individual  eigenvalue  of  HQn  need  not  be  convergent  [7,8]. 
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Example:  Hydrogen  atom  in  boxes.  As  particular  cases  of  above  results,  we  have 
the  work  of  Ley-Koo  and  coworkers  [2,3],  who  computed  the  exact  solutions  in 
an  increasing  sequence  of  boxes  tin  with  different  shapes  for  hydrogen  and  molecular 
ion  Hj ,  showing  in  an  explicit  way  that  the  convergence  to  the  properties  of  the  free 
system  does  not  depend  on  the  shape  ofti„,  as  Theorems  3  and  4  establish.  We  now 
discuss  briefly  the  hydrogen  results. 

Ley-Koo  and  Rubinstein  [2]  considered  the  hydrogen  atom  inside  boxes  defined 
by  pairs  of  confocal,  coaxial  and  mutually  intersecting  paraboloidal  surfaces  which 
are  open  in  opposite  directions.  Introducing  the  parabolic  coordinates  for 

the  electron  in  terms  of  its  usual  spherical  coordinates 

£  =  r(  1  —  cos  0),  ij  =  r(  1  +  cos  0),  <p  =  <p  , 

the  shape  and  position  of  the  boxes  is  defined  by  the  surfaces 

(£2  =  const,  ri,  <p),  (£,  Vn  =  const,  <p ), 

where  the  center  of  mass  at  the  origin  is  the  focus  and  £°  and  rjn  are  the  respective 
focal  distances.  By  simplicity  we  consider  symmetrical  boxes  £°  =  Therefore, 
the  box  tin  limited  by  the  paraboloidal  surfaces  grows  as  £°n  oo .  The  Dirichlet 
boundary  conditions  are  given  by 

=  0,  ^(f,  £!,¥>)  =  0. 

The  wave  function  i ptfj  was  computed  using  functional  series. 

In  agreement  with  Theorem  3,  Table  I  shows  that  the  energy  ground  state  p{nx) 
tends  asymptotically  from  above  to  as  grows  -►  oo).  The  convergence 
of  wave  functions  was  verified  with  the  calculation  of  the  isotropic  and  anisotropic 
components  of  hyperfine  splitting  given  by 

Also  =  6ol'/4!)(0)|2,  A°3}  =  |(3z2  -  r2)r“5|V'^)). 

Table  I  shows  the  convergence  of  these  values  to  its  values  of  the  free  system  as 
£2-^00. 

Ley-Koo  and  Cruz  [3]  considered  the  hydrogen  atom  inside  ellipsoidal  boxes. 
The  nucleus  is  localized  at  one  of  the  focii.  Introducing  prolate  spheroidal  coor¬ 
dinates  for  the  electron  in  terms  of  its  usual  spherical  coordinates,  we  have 


Table  I.  Convergence  of  the  ground  state  energy  isotropic  and 
anisotropic  components  of  hyperfine  splitting  for  the  hydrogen  atom 
inside  symmetrical  boxes  with  paraboloidal  walls. 


A'ko/  bo 

A^/Uq 

1.4177 

6.250  00 

555.9673 

-38.4120 

3.1599 

-0.250  00 

115.1934 

-2.6441 

6.3707 

-0.961  17 

56.8796 

-0.1307 

11.5577 

-0.999  60 

50.8870 

-0.0014 

Distance  are  given  in  bohrs  and  energies  in  Ryd. 
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f  =  (ri  +  r2)/R9  y  =  (rx  +  r2)/R,  <p  =  <p, 

with  R  the  distance  of  focci  and  rx  ,r2  the  distance  of  the  electron  from  the  focci. 
The  ellipsoidal  boxes  are  defined  by  ( =  const,  rj,  <p),  and  hence  the  box  limited 
by  the  ellipsoidal  surface  grows  as  £*  -►  oo .  Clearly,  the  boundary  condition  is 
given  by  i'ti'Kiht  <P)  =  0.  In  agreement  with  Theorems  3  and  4,  Table  II  shows 
the  convergence  of  the  energy  ground  state  and  A0  and  of  the  confined  system 
to  their  values  of  the  free  system. 

Numerical  Implementation 

Apparently  the  approximation  by  confined  systems  replaces  the  original  problem 
( 1 )  by  another  one  (2)  whose  numerical  solution  is  as  complicated  as  that  of  the 
original  one  ( 1 ).  However,  the  Dirichlet  problem  in  boxes  has  a  set  of  properties 
that  are  crucial  for  the  convergence  and  stability  of  standard  numerical  methods 
such  as  projections  methods  (which  include  the  Ritz  method)  as  well  as  the  finite- 
element  and  finite-difference  methods.  These  properties  are  due  to  Sobolev  space 
W o’2(fi)  [the  domain  of  hQ,  Eq.  (4)]  and  do  not  occur  with  the  domain  WU2(R3N) 
of  h  [Eq.  (3)]. 

As  is  known  [9,10],  W o,2(fi)  and  WX'2(R3N)  endowed  with  the  inner  products 

</,S>i  =  <V/,V£>  +  (fg)  for  fg  E  WX2(R3N), 

(fg) i.n  -  <V/,Vg>„  +  </,*>0  for  fgG  Wb2(Q), 

are  themselves  Hilbert  spaces.  The  canonical  injection  between  the  Hilbert  spaces 
Wh2(R3N)  and  L2(R3N)  is  the  operator  i  defined  by 

i :  WU2(R3N)  L2(R3n)  :f->if=  /, 

and,  clearly,  by  \\if\\  <  \[f\\  i ,  /  is  a  bounded  operator.  In  similar  way  the  following 
operator  is  also  bounded: 


Table  II.  Convergence  of  the  ground  state  energy  /4°,  isotropic 
and  anisotropic  components  of  hyperfine  splitting  for  the  hydrogen 
atom  inside  boxes  with  ellipsoidal  walls. 


Mn* 

^iso/  bo 

^ii/b'o 

2.2298 

0.05 

111.2170 

2.2163 

3.0697 

-0.75 

75.2017 

0.0419 

5.1423 

-0.99 

52.8402 

0.0015 

7.9309 

-0.999 

50.7999 

0.0073 

Distances  in  bohrs  and  energies  in  Ryd. 
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The  fundamental  difference  between  i  and  iQ  is  that  iQ  is  a  compact  operator, 
result  due  to  the  properties  of  W}f2{Sl)  and  the  boundedness  of  12  [9,10].  This 
simple  fact  is  responsible  of  the  salient  numerical  properties  of  the  Dirichlet  eigen- 
problem  (2),  which  we  discuss  briefly  below. 

( 1 )  The  resolvent  Rn(z)  =  (H&  -  z)~l  is  a  self-adjoint  compact  operator  in  L2(Q) 
[8,9].  Therefore,  if  {</>«}  f  is  an  orthonormal  basis  of  L2(12)  and  Pm  denotes 
the  orthogonal  projection  on  the  m-dimensional  space  with  basis  { 4>n }  T  » then 
the  m  X  m  matrix  Rn(m,z)  =  PmRn(z)Pm  converges  in  the  operator’s  norm 
to  Hence  may  be  proved  the  following  result  [11]. 

Theorem  6.  Let  fiQk)(m)  be  an  eigenvalue  of  the  m  X  m  matrix  H^{m)  =  PmHnPm 
given  by  the  min-max  principle.  Then 

(i)  fiQk)(m)  converges  to  the  eigenvalue  of  Hn  as  m  ->  oo . 

(ii)  If  the  energy  level  E of  Hu  has  multiplicity  mf,  then  the  set  A\i\m)  = 
{p£&l){m), . . . ,  ptfi+mi\m)}  (m  >  m,  )  converge  to  it. 

(iii)  If  Q\P  and  Q^\m)  are  the  orthogonal  projections  on  the  eigenspaces  of 
,  A\p(m),  then  Q\P(m)  converges  to  Q\}]  in  the  operator’s  norm.  Fur¬ 
thermore,  if  and  yp\}\m)  are  wave  functions  in  these  spaces,  then 

lim  -  Qi°(/w)^if)||  =  0  and  lim  ||i pli\m)  -  QHtylP(m)  ||  =  0. 

m-*  oo  m-*’  oo 


(2)  A  similar  result  to  the  above  theorem  can  be  obtained  working  in  the  space 
W  o2(fi)  endowed  with  the  inner  product  (  ,  )fW2  given  by 

</,sVq  =  Mf,g)  -  Z(f,g)a  U  <  E(l)).  (9) 

In  this  case  the  first  derivatives  of  the  wave  functions  also  converge  since  they 
are  contained  in  the  inner  product  (  ,  )ivn  [8]. 

( 3 )  The  approach  by  confined  systems  appears  in  a  natural  way  when  the  schemes 
of  finite  element  and  finite  difference  are  applied  to  the  free  system,  since  these 
can  be  only  applied  in  bounded  regions.  In  these  methods,  compactness  ar¬ 
guments  together  with  the  coerciveness  of  [Eq.  (4)] 

hdtfj)>cAf\\\M-c2\\f\\l 

play  a  crucial  role  in  the  convergence  analysis  [10-12]. 

Remark  7.  Compactness  properties  allows  us  to  give  error  estimates  of  approximate 
eigenvalues  and  eigenfunctions  obtained  from  the  methods  mentioned  above  as 
well  as  to  prove  their  numerical  stability  [10-12]. 

We  now  discuss  two  simple  numerical  examples. 

Helium  atom  solved  vocationally  in  boxes .  A  simple  example  of  computation  of 
energy  ground  state  for  the  helium  was  given  by  Marin  and  Cruz  [4],  who  used 
the  functions 
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yp^\aruar2)  =/s(ari)fs(ar2)(R  -  n)(R  -  r2)  (10) 

to  approximate  the  Dirichlet  wave  function  \phl)  within  a  sphere  with  radius  R. 
The  function  fs{ar)  is  the  Is  hydrogenic  wave  function  of  the  free  system  and  the 
parameter  a  was  used  to  minimize  the  energy  of  the  confined  system.  Table  III 
shows  the  convergence  of  the  energy  ground  state  ( *S)  as  a  function  of  R  as  well 
as  the  values  computed  by  Ludeiia  [1] .  The  slow  convergence  is  explained  by  using 
a  single  base  function.  Now,  by  compactness,  one  can  expect  a  more  rapid  con¬ 
vergence  if  the  number  of  basis  functions  is  increased.  This  is  illustrated  below. 

Hydrogen  atom  solved  by  Ritz  method  in  spheres.  In  Ref.  [  5  ]  the  Ritz  method  was 
used  to  compute  accurate  Dirichlet  wave  functions  of  the  hydrogen  atom  whithin 
a  sphere  of  radius  R .  The  basis  functions  used  in  every  box  are  given  by 

<PRk(a,r)  =  e~ar(R  -  r)rk ,  k  =  1,  2,  3,  •  •  •  (11) 

with  a  an  arbitrary  parameter.  Table  IV  shows  the  fast  convergence  of  energy,  wave 
function  and  (r2)  of  confined  hydrogen  to  those  of  the  Is  state  of  the  free  system 
as  R  -►  oo .  By  compactness,  the  convergence  of  the  Ritz  method  in  boxes  does  not 
depend  on  a .  This  fact  was  used  in  Ref.  [5]  to  optimize  a  in  a  such  way  that  it 
gives  the  asymptotic  behavior  of  the  free  system  wave  functions  of  a  particular 
state.  In  Table  IV  we  have  the  computation  of  the  2p  and  3p  states  using  this 
procedure.  The  convergence  of  eigenfunctions  was  verified  using  the  Cauchy 
criterion. 

Comparison  with  the  Standard  Ritz  Method  in  L2(R3N) 

In  this  section  some  theoretical  advantages  of  the  approximation  by  confined 
systems  respect  to  the  Ritz  method  in  L2(R3N)  are  discussed.  To  this  end,  we  give 
a  summary  of  the  latter  one. 


Table  III.  Convergence  of  the  ground  state  energy  /4l)  for 
the  helium  atom  within  a  sphere  of  radius  Rn. 


Rn 

aa 

M(*1)b 

0.5 

0.7465 

22.9229 

22.79095 

1.5 

0.9339 

-1.8456 

-1.86422 

2.5 

1.1510 

-2.7273 

-2.76644 

4.0 

1.3701 

-2.8302 

-2.85852 

6.0 

1.4924 

-2.8426 

-2.86151 

a  Parameter  of  functions  (3.2). 

Energies  in  Ryd  and  distnces  in  Oq. 
b  Variational  calculations  with  a  single  function  (10), 
Ref.  [4]. 

c  scf  calculations  of  Ludena  [1]. 
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Table  IV.  Convergence  of  energies,  wave  functions  and  the  moment  (r2)  of  the  hydrogen  atom  within 
a  sphere  with  radius  R,  toward  those  of  the  free  system. 


[R,  m] 

m 

URm 

llfom  "  l/exacll 

<r2) 

Is  state  a  =  1.0 

[4,  4] 

-0.483 

0.165J0] 

2.27 

[11,7] 

-0.499  999  845 

0.482[—3] 

2.999  85 

[18,  10] 

-0499  999  999  999  46 

0.982[-6] 

2.999  999  998  8 

[25,  13] 

-0.500  000  000  000  00 

0.500[— 7] 

3.000  000  000  000 

[32,  16] 

Same 

0.500[— 8] 

Same 

[R\  m’] 

[R  m] 

2p  state  a  =  0.5 

W^R'm'-  '/'Rmll 

[64,  13] 

[84,  16] 

-0.125  000  000  000  00 

3d  state  a  =  0.333 

0.635[ — 7] 

30.000  000  000 

[64,  12] 

[84,  15] 

-0.055  555  555  55 

0.378[— 5] 

126.000  000  000 

m  denotes  the  number  of  base  functions  (3.3)  of  the  Ritz  expansion  with  exponent  a,  and  || ,  ||  denotes 
the  norm  of  L2(,  oo).  Atomic  units  are  used. 


According  to  the  Ritz  method,  the  bound  states  of  H  are  approximated  hy  those 
of  the  m-dimensional  matrix  //,„  =  PmHPm,  where  Pm  is  the  orthogonal  projection 
on  an  m-dimensional  space  with  basis  { <t>k }  7  and  { 4>t }  f  is  a  complete  basis  of 
W'-2(R2N).  As  is  known  [13],  with  such  a  basis,  the  nondegenerate  energy  levels 
of  Hm  converge  to  those  Eu)  of  H  for  i  <  /,  provided  that  EU)  is  the  first 
degenerate  energy  level. 

Let  i l/u)  and  V'm*  eigenfunctions  of  the  above  non  degenerate  energy  levels  Eu) 
and  E(J,\  with  their  phase  properly  chosen.  Then,  the  convergence  of  eigenvalues 
together  with  the  Eckart’s  inequality  and  its  generalizations, 


E$  -  £(/) 
Eu+ ‘>  -  EU) 


(12) 


implies  the  convergence  in  the  norm  of  to  the  exact  one.  Furthermore,  from 
the  inequality  h(fj)  +  C,  |[/']| 2  >  C2||V/|| 2  one  obtains,  as  in  Theorem  4  [Eq.  (8)], 
that 


lim  ||V^>-  V^(i)|l  =0. 

m-+co 

In  a  previous  section  we  discussed  the  role  of  the  compactness  arguments  in  the 
convergence  analysis  of  numerical  methods  for  solving  eigenproblems.  Now,  a  nec¬ 
essary  condition  in  order  to  have  operators  with  compact  resolvent,  is  that  their 
spectrum  consists  entirely  of  isolated  eigenvalues  with  finite  multiplicity  [9]. 
Therefore,  the  Hamiltonian  //associated  with  the  free  system  does  not  has  a  compact 
resolvent  R(z )  =  (H  -  z)"1 .  This  implies  the  following  facts: 

( 1 )  The  m  X  m  matrices  Rm(z)  =  PmR(z)Pm  can  not  converge  in  the  operators 
norm  to  R(z)  [if  this  were  the  case,  then  R(z)  would  be  compact].  At  this 
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point  the  approximation  by  confined  systems  is  salient  (see  the  paragraph  above 
Theorem  6). 

(2)  Compactness  arguments  permits  us  to  analyze  the  convergence  and  stability 
of  methods  for  solving  numerically  the  Dirichlet  problem  (2.2)  (Remark  7),  while 
the  non  compactness  of  R(z)  imposes  the  necessity  of  developing  new  alternatives 
to  make  such  analysis. 

( 3 )  The  convergence  of  the  Ritz  method  in  L2(R3N)  has  been  proved  for  simple 
states  [13].  But  even  when  the  convergence  to  degenerate  energy  levels  holds  true, 
Eckart’s  inequality  and  its  generalizations  can  not  be  used  to  analyze  the  convergence 
of  the  corresponding  eigenspaces,  since  such  analysis  requires  a  relationship  between 
the  orthogonal  projections  associated  to  the  exact  and  approximate  eigenspaces. 
Newly,  at  this  point  the  approximation  by  confined  systems  is  salient  (Theorems 
4  and  6).  Furthermore,  by  working  in  IFj’2(Q)  endowed  with  the  inner  product 
(9),  we  can  obtain  information  about  the  first  derivatives  of  the  eigenfunctions 
associated  to  degenerate  states. 

(4)  Respect  to  the  calculation  of  expectation  and  transition  values  different  from 
the  energy  levels,  there  is  an  important  difference  between  the  approach  by  confined 
systems  and  the  usual  Ritz  method  in  L2(R3N).  Klahn  and  Morgan  [14]  showed 
in  detail  how  the  Ritz  expansion  with  a  Gaussian  basis  set  for  the  hydrogen  atom 
wave  functions  gives  approximate  moments  which  diverge  or  converge  to  a  wrong 
limit,  even  when  the  Ritz  wave  functions  converge  correctly  in  the  norm  of  L2(R3), 
pointing  out  that  the  correct  calculations  depend  on  the  ability  of  finite  linear 
combinations  to  duplicate  the  nonanalyticities  (cusps)  in  the  exact  wave  function. 

In  boxes  the  calculation  of  expectation  and  transition  values  of  a  symmetric 
operator  is  quit  simple.  Consider,  for  example,  the  operator  S  ~  rk  (with  k  >  0) 
associated  with  the  moment  of  the  one-electron  density  of  confined  system. 

Since  S  is  a  bounded  operator  in  L2(0),  then  any  numerical  sequence  \ p^(m) 
converging  to  \[/^  in  the  norm  ||  ,  ||«  of  L2(Sl)  satisfies 

lim  (Vd\™)skVd\M)\  =  <*£0,rfy&°> a 


independently  on  the  local  peculiarities  of  .  It  remains  to  prove  that  the  expec¬ 
tation  value  converges  to  that  of  the  free  system  when  converges 

toward  \p{i)  as  U  grows  [Eq.  (8)].  This  result  was  rigorously  proven  in  Ref.  [15] 
for  a  wide  class  symmetric  operators,  including  rk  with  k  >  —2,  in  the  1 -dimensional 
case,  and  it  is  intuitively  obvious  in  several  dimensions  since  the  free  and  confined 
systems  have  bounded  states  with  similar  properties  when  &  is  large  enough. 

The  above  discussion  points  out  the  relevance  that  the  approach  through  confined 
systems  may  have  to  compute  with  accuracy  and  certainty,  the  most  significant 
quatities  in  quantum  chemistry,  avoiding  technical  difficulties  that  accompany  the 
Ritz  method  in  L2{R3N).  Furthermore,  it  shows  that  numerical  methods  (such  as 
the  finite-element  and  finite-difference  schemes)  that  have  been  scarcely  used  may 
be  applied  with  positive  results. 
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Abstract 

In  wave  mechanics  an  appropriate  description  of  a  system  under  the  influence  of  a  linearly  velocity 
dependent  frictional  force  can  be  given  by  a  nonlinear  Schrodinger  equation  (nlse)  with  logarithmic 
nonlinearity.  However,  the  particular  logarithmic  form  of  the  dissipative  nonlinear  frictional  term  in  the 
NLSE  is  connected  with  the  definition  of  the  momentum-  or  velocity-operator  in  position-space.  Therefore, 
in  momentum-space,  this  form  of  the  NLSE  is  no  longer  correct  to  describe  the  same  physical  situation. 
This  can  be  seen,  e.g.,  from  the  fact  that,  in  contrast  to  the  linear  case,  the  Fourier  transform  of  the 
solution  of  the  nlse  in  position-space  does  not  fulfill  anymore  the  logarithmic  nlse  in  momentum- 
space.  It  will  be  shown,  using  results  obtained  from  the  theory  in  position-space,  that  it  is  possible  to 
find  a  form  of  the  nonlinear  dissipative  frictional  term  which  is  valid  in  position-  as  well  as  in  momentum- 
space.  Using  this  form,  the  NLSE  looks  like  a  diffusion  equation  with  complex  diffusion  coefficient,  i.e., 
a  combination  of  a  diffusion  and  a  Schrodinger  equation.  The  solution  of  this  nlse  in  momentum-space 
will  be  discussed.  ©  1 994  John  Wiley  &  Sons,  Inc. 


Introduction 

Over  the  last  few  decades  many  approaches  were  introduced  to  describe  open 
quantum  systems,  i.e.,  systems  which  dissipate  mechanical  energy  and  show  an 
irreversible  evolution  in  time.  The  variety  of  attempts  can  be  divided  into  two  main 
categories:  they  either  use  the  system-plus-reservoir  approach  or  modify  the  quantum 
mechanical  equations  of  motion.  Among  the  second  category,  Hamiltonians  which 
are  explicitly  time-dependent  [1,2]  and  nonlinear  Schrodinger  equations  (nlse) 
[3-7]  have  been  proposed.  Most  of  these  approaches  were  criticized  because  they 
yield  some  dubious  or  even  unphysical  results  such  as  stationary  damped  states 
[3,4],  wrong  frequencies  for  the  damped  harmonic  oscillator  [3-5,7]  or  even  violation 
of  the  uncertainty  principle  [1,2].  However,  it  has  been  shown  [8,9],  that  all  the 
above-mentioned  shortcomings  can  be  overcome  if  time-symmetry  is  broken  on 
all  levels  of  the  equations  of  motion.  These  equations  are  the  ones  determining  the 
time-evolution  of  the  mean-values  of  position  and  momentum  (corresponding  to 
the  classical  equations  of  motion),  the  wave  mechanical  Schrodinger  equation  (SE), 
and  the  equation  for  the  related  density  function,  thus  turning  a  continuity  equation 
into  a  Fokker-Planck-type  equation.  This  results  in  a  nlse 
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ih  |  *(r,  t)  =  {—  ~  A  +  V(r)  +  ^W(r,  t)  ( 1 ) 

with  logarithmic  nonlinearity  in  position-space.  The  dissipative  nonlinear  term 

Ws  =  (In  *  -  (In  '!'»  (2) 

in  this  equation  corresponds  to  a  linear  velocity  dependent  frictional  force. 

However,  the  particular  logarithmic  form  of  the  frictional  term  Ws  is  connected 
with  the  definition  of  the  momentum-  or  velocity-operator  in  position-space.  (This 
definition  can  also  be  used  to  “rederive”  the  nlse  [10].)  This  can  be  seen,  e.g., 
from  Schrodinger’s  original  papers  [11,12],  where  he  used,  for  the  canonical  mo¬ 
mentum,  the  definition 


hV*  h 

=  ~  ~r-  =  —  V  In  =  rms . 

i  i 


(3) 


Obviously,  the  mean  value  of  p5 


<Pj>  =  J  '*'* 


h 


^  dr  =  \k*  —  V'F  dr 


is  equivalent  to  the  mean  value  of  the  quantum  mechanical  momentum-operator 
Pop  -  (h/i)V  in  position-space.  In  this  representation  the  negative  gradient  of  Ws 
has  the  form  of  a  frictional  force  linearly  proportional  to  the  velocity  \s,  —VWS 
=  “7  m\s. 

However,  in  momentum-space,  the  momentum-operator  is  simply  a  c-number, 
Pop  =  P,  whereas  now,  the  position-operator  is  connected  with  the  Nabla-operator 
in  momentum-space,  Vpi  via  rop  =  —(h/i)V p.  Therefore,  if  Ws  is  used  in  the  same 
logarithmic  form  as  in  position-space,  the  negative  momentum-space  gradient  of 
Ws,  —  VPWS,  no  longer  yields  a  frictional  force  which  is  linearly  proportional  to 
velocity  (with  a  negative  sign),  as  Vp  In  ^  is  linearly  proportional  to  position  (with 
a  positive  sign). 

So,  in  momentum-space,  this  form  of  the  NLSE  is  no  longer  correct  to  describe 
the  same  physical  situation.  This  can  also  be  seen  from  the  fact  that  the  Fourier- 
transform  (ft)  of  the  wave  packet  (wp)  solutions  in  position-space  are  (in  contrast 
to  the  linear  theory)  no  longer  solutions  of  the  nlse  in  momentum-space,  if  the 
form  of  Ws  remains  unchanged. 

In  order  to  find  a  form  of  the  nonlinear  frictional  term  which  is  valid  in  position- 
as  well  as  in  momentum-space,  results  obtained  from  the  NL-theory  in  position- 
space  can  be  used. 

This  new  form  shows  similarities  with  another  nonlinear  equation  which  was 
essentially  based  on  mathematical  arguments.  It  can  also,  at  least  formally,  be 
transformed  into  a  linear  differential  equation. 

Exact  analytic  Gaussian-shaped  wp  solutions  of  the  nlse  in  momentum-space 
will  be  given  and  their  properties  will  be  compared  with  those  of  the  corresponding 
wps  in  position-space. 
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To  an  Alternative  Form  of  the  Dissipative  Term 

Using  results  from  the  dissipative  theory  in  position-space  [  10, 1 3],  new  variables 
which  contain  the  same  information  on  the  wave-  and  particle-aspects  as  the  wave- 
function  of  the  time-dependent  SE  can  be  defined.  With  the  help  of  these  variables 
it  is  possible  to  express  the  nonlinear  dissipative  term  in  a  form  which  is  applicable 
in  position-  and  momentum-space. 

Analyzing  the  exact  analytic  Gaussian  wp  solutions  of  the  NLSE  (discussed  here 
for  the  damped  harmonic  oscillator  (HO)  in  one  dimension)  in  position-space 

>Fwp(*,0  =  N(t)  exp  i  y(t)x 2  +  ~  (p)x  +  K(t)  j  ,  (5) 

where  x  =  x  —  (x)  —  x  —  r\(t)  [the  explicit  form  of  the  N(t)  and  K(t)  is  not  relevant 
to  the  following  discussion],  it  is  found  that  the  maximum  of  the  wp  at  x  =  (x) 
(=  mean  value  of  position  x)  follows  the  classical  trajectory  (x)  =  i?(0,  which  obeys 
the  Newtonian  equation 

rj  +  7?)  +  =  0  ,  (6) 

thus  expressing  the  particle  aspect. 

The  wave  aspect,  connected  with  the  finite  wp-width  which  is  proportional  to 
the  position  uncertainty,  (x2)1/2  =  ((x2)  —  (x)2)1/2,  can  be  obtained  from  the 
imaginary  part  of  the  complex  coefficient  y{t)  via 


1  m 

y'  =  4<x2>  =  2W  ' 


(7) 


The  time-evolution  of  the  WP-width  is  governed  by  the  nonlinear  Riccati  equation 
for  the  complex  quantity  2hy{t)/m: 


+ 


T*  a?2  —  0  . 


(8) 


Additional  insight  into  the  dynamics  of  the  investigated  system  can  be  obtained  by 
linearizing  Eq.  (8)  with  the  help  of  the  substitution 


(9) 


introducing  the  new  complex  variable  X  =  u  4-  iz  -  a  exp (i<p  —  yt/2),  to  transform 
the  nonlinear  Riccati  equation  (8)  into  the  linear  Newtonian  equation  of  motion 

X  +  (*>2X  —  0  .  (10) 


It  can  be  shown  [10,13]  that  in  cartesian  coordinates,  z  is  directly  proportional 
to  the  classical  trajectory,  ( a0p0/m)z  =  <x)  =  r)(t\  thus  reflecting  the  particle  aspect, 
whereas  in  polar  coordinates,  the  absolute  value  of  X  is  proportional  to  a{t) 
=  (2 m(x2)lh)l/2,  as  defined  in  Eq.  (7),  thus  reflecting  the  wave  aspect  or  position- 
uncertainty. 
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The  desired  form  of  the  frictional  term  should  not  explicitly  depend  on  the  po¬ 
tential  of  the  problem,  but  somehow  be  connected  with  the  velocity  and/or  kinetic 
energy  of  the  system,  as  it  describes  a  velocity  dependent  frictional  force. 
Requiring  that  the  condition 

Ws*  wp=^wp  (ID 

has  to  be  fulfilled,  it  is  possible  to  express  Ws,  with  the  help  of  X,  in  terms  of  the 
operator  of  kinetic  energy  (and  its  mean  value): 

WD  =  t(t)  ^  (Pop  -  <Pop»  ■  (12) 

This  form  of  the  frictional  term  is  valid  in  position-  and  momentum-space.  It  has 
obviously  the  same  property  as  the  logarithmic  nonlinearity  Ws,  namely  a  dis¬ 
appearing  mean  value,  (Wd)  =  0.  Therefore,  WD  influences  the  dynamics  of  the 
observable  system  by  altering  the  wavefunction,  as  the  solution  of  the  NLSE  is  dif¬ 
ferent  from  the  solution  of  the  linear  SE,  although  WD  itself  is  not  observable.  This 
is  similar  to  the  situation  where  the  motion  of  a  Brownian  particle  is  affected  by 
the  interaction  with  the  environment,  although  the  motion  of  the  surrounding 
molecules  cannot  be  directly  observed. 


Comparison  with  Similar  Equations 

The  form  (12)  of  our  dissipative  nonlinearity  shows  similarity  to  an  attempt  by 
Gisin  [14-19]  to  describe  open  quantum  systems.  This  approach  is  based  on  the 
“assumption  that  quantum  systems  are  always  in  a  pure  state,  whether  exchanging 
energy  with  their  surrounding  or  not”  [18].  The  time-evolution  of  the  system  is 
supposed  to  be  compatible  with  the  Hilbert  space  structure  of  quantum  kinematics 
and  to  preserve  superpositions  in  a  certain  sense  (for  mathematical  details  see,  e.g., 
[17,19]).  It  can  be  shown  that  under  these  conditions  there  exists  a  contractive 
semigroup  of  linear  operators  Wt  =  eZt  with  a  generator  Z  which  can  be  written  as 

Z=-^H-B.  (13) 

h 


However,  a  corresponding  time-dependent  SE  of  the  form 


H- B 


m 


(14) 


would  have  the  “unpleasant  feature  that  it  does  not  preserve  the  norm  of  a  vector, 
which  is  the  usual  demand  for  the  evolution  equation  in  quantum  theory”  [20]. 
This  problem  can  be  solved  by  adding  the  mean  value  of  the  operator  B,  (B) 
=  J  ^*(t)B^(t)  dr/§  **(/)\k(0  dr ,  to  the  rhs  of  Eq.  (14).  Then  the  normalized 
solutions  \k(t)  satisfy  the  equation 
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which  is,  due  to  the  introduction  of  the  mean  value  (. B >,  a  nonlinear  differential 
equation. 

The  physical  meaning  of  the  operator  B  is  not  specified  in  Eq.  (15);  only  its 
mathematical  properties  are  used.  The  occurrence  of  (1?)  also  guarantees  the  prop¬ 
erty  mentioned  in  connection  with  our  dissipative  term  Ws  (which  is  also  valid  for 
WD\  namely  that  the  mean  value  of  the  term  additional  to  the  usual  Hamiltonian 
H  on  the  rhs  of  the  NLSE  disappears. 

According  to  Sz-Nagy’s  theorem  the  contraction  semigroup  Wt  can  be  dilated  to 
a  unitary  evolution  on  a  larger  Hilbert  space  [17-19]. 

It  was  also  shown  that  Eq.  (15)  may  be  derived  using  the  general  formalism  of 
master  equations  [16].  Due  to  the  formal  similarities,  the  same  should  also  apply 
to  our  NLSE. 

The  properties  of  Eq.  (15)  are  discussed  in  detail  in  [14];  here,  only  the  following 
will  be  mentioned: 


a)  Eq.  (15)  conserves  the  norm  of  'J'(J); 

b)  if  ^  is  an  eigenstate  of  B ,  then  Eq.  (15)  reduces  to  the  usual  SE; 

c)  the  evolution  of  the  mean  value  of  the  (time-independent)  operator  B  reads: 
(d/dt)(B)  =  -2((B2)  -  (B)2)  =  -2(B  -  (B))2  <  0  and  (d/dt)(B)  =  0  iff  ¥ 
is  an  eigenfunction  of  B. 


Especially  for  B  =  kH  this  leads  to 

|  (H)  =  -2 k((H2)  -  (H)2)  =  -2 k(H  -  (H)f  <  0  .  (16) 


Of  note  is  that  up  to  this  point  k  can  be  any  function  with  positive  valued  real  part 
in  order  to  guarantee  the  decrease  of  energy. 

Properties  a)  and  b)  are  also  valid  for  our  nonlinear  dissipative  terms  Ws  and 
WD.  However,  in  our  case,  the  physical  meaning  of  the  occurring  operators  is  spec¬ 
ified  and  related  to  some  frictional  force.  A  property  similar  to  c)  has  been  shown 
[9]  to  be  valid  also  for  the  logarithmic  dissipative  term  Ws. 

For  the  choice  B  =  kH ,  as  shown  in  Gisin’s  case,  the  decrease  of  the  mean  value 
of  energy  [see  Eq.  (16)]  would  be  proportional  to  (the  square  of )  H  =  T  +  V9  i.e., 
it  would  depend  explicitly  on  the  potential  V.  In  classical  mechanics,  however,  even 
for  V  ¥=  0  (e.g.,  as  for  the  damped  HO),  the  energy  dissipation  is  only  proportional 
to  the  kinetic  energy  T  according  to  Ec lass  =  —2y  Tclass.  Therefore,  a  form  of  B  which 
is  independent  of  V,  but  related  to  the  kinetic  energy  T  =  p2/2m,  seems  to  be 
reasonable.  This  condition,  however,  is  fulfilled  by  the  new  form  Wo  of  our  NL 
frictional  term.  So,  WD  can  be  considered  as  a  kind  of  physically-motivated  spec¬ 
ification  of  Gisin’s  formal,  mathematically-motivated  approach. 

Similarities  with  other  equations  can  be  found,  if  the  new  form  WD  in  our  nlse 
is  considered  in  position-space  (in  the  following  in  one  dimension), 


2m  dx 2 


2m  dx2 


(17) 
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The  second  derivative  with  complex  coefficient  can  be  combined  with  the  operator 
of  kinetic  energy  allowing  one  to  write  the  nlse  in  position-space  in  the  form 


with 


t:  *(x,t) 
at 


nx,t) 


At) 

At) 


(l+y  t\i  zt—  :  complex  “diffusion  coefficient” 
\  x/  2m 

~r  t  It~)  •  nonlinearity. 

ih  x  \2 mj 


(18) 


Formally,  g{t)  is  just  a  (complex)  time-dependent  function.  However,  in  order  to 
actually  calculate  the  mean  value,  the  solution  ^  of  the  nlse  has  to  be  known.* 
Considering  the  case  V  =  0,  i.e.,  free  motion  with  damping,  and  treating  g(t) 
formally  as  a  mere  time-dependent  function,  g(t)  can  be  removed  from  the  equation 
via  the  transformation 


^(x,?)  =  \k(x,0  exp]  - 
The  equation  for  ^(x,^). 


fW 

Jo 


Mxj)  -At)  £3  *(x,t)  =  o 


(19) 


(20) 


has  the  form  of  a  diffusion  equation  with  a  complex  time-dependent  diffusion  coef¬ 
ficient  f(t).  Keeping  in  mind  that  the  SE  for  the  free  motion  has  the  form  of  a 
diffusion  equation  with  a  purely  imaginary  diffusion  coefficient,  ih/2m ,  Eq.  (20)  is 
a  combination  of  a  reversible  linear  SE  and  an  irreversible  diffusion  equation.  For 
both  of  these  equations  the  superposition  principle  holds.  This  might  explain  why, 
for  the  damped  free  motion  in  position-space,  our  nlse  shows  a  kind  of  superpo¬ 
sition  property,  as  there  exist  plane-wave  type  solutions  which  can  be  superimposed 
to  form  a  Gaussian  WP-type  solution  of  the  nlse  [21]. 


Solutions  in  Momentum-Space 

In  the  following,  the  new  form  (12)  of  our  dissipative  frictional  term  shall  be 
considered  in  momentum-space  and  the  solutions  of  the  corresponding  nlse  (here 
for  the  HO  in  one  dimension)  shall  be  analyzed. 

Now  also  in  our  nonlinear  theory,  in  analogy  to  the  linear  theory,  the  solution 
in  momentum-space  can  simply  be  obtained  by  ft  of  the  solution  in  position- 
space.  So,  the  FT  of  the  wp-solution  (5), 


It  might  be  possible  to  apply  iterative  techniques,  e.g.,  as  in  the  Hartree-Fock  method. 
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*wp(P,0 


^wp(X0<?  ipx/h  dx 


leads  to  the  Gaussian  wp-solution  in  momentum-space, 


*wp  (P,t) 


1  x  f  \ia(t) 
MO  exp|  ‘  P 


&p  +  K(t) 

ft 


h 


with  p-  p-  (p)  and  (am/ih)  =  X/X. 

The  particle  aspect  is  again  expressed  by  the  fact  that  the  maximum  of  the  wp 
coincides  with  the  classical  momentum  (p)  =  mi 7,  where  the  classical  trajectory  y 
=  (x)  is  determined  by  the  Newtonian  equation  of  motion  (6).  ^ 

The  wave  aspect,  i.e.,  the  momentum-uncertainty  (p2)l/2  =  ((p2)  -  (p)2)17  or 
the  WP-width  in  momentum-space,  is  connected  with  the  real  part  of  the  complex 

coefficient  a(t)  via 


h  2 (p2)  ' 

The  time-evolution  of  the  WP-width  in  momentum-space  is  also  governed  by  a 
Riccati  equation,  now  for  the  complex  quantity  (amjih): 


As  can  be  seen  from  the  respective  definitions  (using  the  complex  variables  X 
and  X),  the  transition  from  position-  to  momentum-space  (in  order  to  obtain  the 
information  on  the  dynamics  of  the  wave  aspect)  does  not  actually  need  the  FT, 
but  simply  the  inverse  of  the  corresponding  quantity  has  to  be  calculated,  as 


am\  __  llh  r1  X 
ih)  U  /  X 


is  valid. 

The  nonlinear  complex  Riccati  Eq.  (24)  can,  as  well  as  the  Riccati  Eq.  (8)  in 
position-space,  be  linearized  to  the  complex  Newtonian  Eq.  (10).  The  connections 
among  these  equations  are  shown  in  Figure  1 .  Therefore,  with  the  results  obtained 
in  this  work,  analytical  expressions  for  the  time-dependence  of  position-  and  mo¬ 
mentum-uncertainty,  and  thus  of  the  uncertainty  product  for  dissipative  systems, 
can  be  determined  explicitly  simply  by  solving  the  complex  Newtonian  Eq.  (10) 
for  given  initial  conditions. 


Conclusions  and  Perspectives 

It  has  been  shown  that  it  is  possible  to  obtain  a  dissipative  wave  mechanical 
equation  with  a  nonlinear  dissipative  term  which  is  valid  in  position-  and  momen¬ 
tum-space.  This  form  of  our  nlse  shows  similarities  to  an  approach  by  Gisin  and, 
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complex  Ricattt  eq.  for  position-space 


complex  Ricatti  eq.  for  momentum-space 

Figure  1 .  Connections  among  the  equations  determining  the  time-dependence  of  position- 
and  momentum-uncertainties:  The  complex  variable  entering  the  nonlinear  Riccati  equation 
for  the  momentum  uncertainty  is  the  inverse  of  the  corresponding  variable  in  the  Riccati 
equation  for  the  position  uncertainty  and  both  are  linked  via  the  variable  A  in  the  complex 
linear  Newtonian  equation. 


for  the  damped  free  motion  in  position-space,  it  is  formally  a  combination  of  a 
reversible  SE  and  an  irreversible  diffusion  equation. 

The  nlse  in  momentum- space  possesses  exact  analytic  Gaussian  wp  solutions. 
The  maximum  of  the  WP  obeys  the  classical  equation  of  motion;  the  time-depen¬ 
dence  of  the  WP-width  is  determined  by  a  complex  Riccati  equation  and  provides 
the  time-dependence  of  the  momentum-uncertainty.  This  Riccati  equation  can  be 
obtained  from  the  corresponding  Riccati  equation  for  the  position-uncertainty  sim¬ 
ply  by  considering  the  inverse  variable,  not  needing  any  ft. 

In  a  continuation  of  this  work  it  shall  be  investigated  how  the  equations  of  motion 
describing  the  wave-  and  particle-aspects  in  momentum-space  are  connected  and 
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if  there  also  exists  a  dynamical  invariant  of  Ermakov-Lewis  type  similar  to  the 
dissipative  case  in  position-space  [10,13].  Furthermore,  the  question  of  the  super¬ 
position  properties  of  our  nlse  allowing  the  establishment  of  a  Green’s  function 
formalism  to  describe  the  propagation  of  a  dissipative  system  in  time  is  of  interest 
and  will  be  discussed  in  forthcoming  work. 
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Abstract 

Similarities  between  the  Franck-Condon  effect  and  the  concept  of  Squeezed  States  have  been  advanced 
as  a  conjecture  in  the  past,  or  noticed  as  a  marginal  fact,  without  giving  a  full  explanation  of  how  these 
two  important  concepts  are  connected.  In  this  work  we  outline  a  unifying  formulation  that  contains  as 
special  cases  both  phenomena.  The  power  of  the  new  approach  is  shown  in  a  general  formula  from  which 
all  particular  cases  can  be  derived.  ©  1994  John  Wiley  &  Sons,  Inc. 


Several  years  ago  we  started  working  the  derivation  of  recurrence  and  closed 
formula  for  Franck-Condon  factors  through  the  formalism  of  second  quantization 
[1].  Some  authors  had  tried  to  realize  this  goal  before  us.  The  pioneering  work  of 
S.  Koide  [2]  is  especially  worthy  of  mention,  because  he  reaches  a  closed  formulae 
first  derived  by  Ansbacher  [3]  by  means  of  analytic  methods. 

Other  authors  [4-8]  have  tried  in  some  way  to  give  a  complete  algebraic  for¬ 
mulation  for  this  problem.  Especially  important  is  the  work  of  Fernandez  [7]  and 
Ma  and  Rhodes  [9],  who  consider  the  problem  of  time-dependent  transition  prob¬ 
abilities  and  obtain  recurrence  relations  by  an  algebraic  formulation  which  is  in 
somehow  similar  to  the  one  we  worked  some  years  ago  [1,10, 1 1]. 

In  light  of  these  efforts  a  unique  algebraic  formulation  for  Franck-Condon, 
Squeezed  States,  and  the  Young’s  Double  Slit  experiment  seems  to  be  possible. 
This  will  reveal  the  mathematical  structure  and  clarify  the  physics  which  is  behind 
these  phenomena. 

In  this  article  we  are  going  to  give  a  common  algebraic  formulation  for  the 
Franck-Condon  and  the  Squeezed  States  based  on  the  approach  we  had  worked 
in  the  past. 

The  essence  of  the  Franck-Condon  Effect,  the  Squeezed  States,  and  perhaps  the 
Double  Slit  experiment  can  be  seen  as  distortions  and  translations  from  one  potential 
well  to  another.  This  fact  can  be  described  quite  well  by  a  mapping  operator 
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\n)G  =  0\n)E  (1) 

between  the  states  of  the  two  wells. 

The  operator  &  can  assume  different  forms  depending  which  particular  case  we 
are  interested  in,  but  the  most  general  form  which  covers  all  particular  cases  is 

®  =  :  eCa+a  :  (2) 

where  a+  and  a  are  the  well-known  creation  and  annihilation  boson  operators.  The 
notation  :  :  means  normal  order  for  boson  operators.  A  particular  realization  of 
this  operator  for  the  case  of  the  Franck-Condon  Effect  is  given  in  Eq.  (8)  of 
Ref.  [10]. 

It  is  possible  to  derive  a  closed  formula  for  intensities;  this  can  be  done  by  means 
of  the  Cauchy  relations  for  complex  variable 

(m\&\n)  =  f f  0  0  <0 1  exp  (sa)0  exp (ta+)  1 0)  (3) 

and  by  using  the  well-known  creation  and  annihilation  properties  of  boson  operators 
and  the  integral  representation  of  Hermite  polynomials  we  obtain: 

[m,n]  /  n  \  (  D  \ 

(m\&\ri)  =  mm\n\)'<2  2  Mmn (4) 


where 


Mmnk 


efcC  (iAmyn-k  ^E\ I2y-k 

k\  (m  —  k)\  (n  ~~  k)\ 


and  Hn  is  a  Hermite  polynomial  of  degree  n  and  Ji  is  a  normalization  constant. 


This  expression  is  a  generalization  of  that  obtained  for  the  first  time  by  Ansbacher 
[3]  for  the  case  of  Franck-Condon  factors,  afterwards  it  has  been  rederived  by 
several  authors  [1,6].  The  important  thing  we  want  to  point  out  here  is  that  this 
formula  can  be  used  in  the  calculation  of  intensities  for  Franck-Condon  factors, 
Squeezed  States,  and  perhaps  the  Double  Slit  Experiment.  In  other  words  the  al¬ 


gebraic  formulation  in  the  present  work  unifies  both  phenomena  and  gives  the 
mathematical  support  to  the  idea  advanced  some  years  ago  by  Wheeler,  about  the 
parallelism  between  these  three  phenomena  [12]. 

In  order  to  derive  special  cases  from  Eq.  (4)  we  need  to  give  the  values  of  the 
constants  appearing  in  it  (A,  B ,  C,  D,  E,  F). 


The  squeezed  states  are  obtained  if  we  make  the  next  identification 


FRANCK-CONDON  FACTORS 


263 


where  the  normalization  constant  is  given  by 

/-la  I2  .9tanh|z|\ 
ft  =  (sech|z|)1/2  exp^  2 - Za!  2|z|  ) 

where  a  and  z  are  the  well-known  parameters  appearing  in  the  definition  of  the 
squeezed  operator  [13].  The  case  of  Franck-Condon  can  be  obtained  by  identifying 
directly  Eq.  (2)  with  Eq.  (8)  of  Ref.  [10]. 

In  order  to  show  the  power  of  the  present  formulation  we  expose  here  the  exact 
evaluation  of  the  coefficients  c„(\)  which  appear  in  Ref.  [14]  in  a  sudden  approx¬ 
imation  description  of  squeezed  states.  For  that  purpose  we  consider  the  expression 

<0 1  H„{-h*a)Hm(ha+)  =  ^  J^<0|?1°>  (5) 


where 


5?  =  f(s,  t,  a,  a*,  y)  =  exp(-(2  -  2h*ta)  exp(-i2  +  liysa*) 


it  can  be  easily  seen  that  Eq.  (5)  is  a  particular  case  of  Eq.  (3).  After  some  algebraic 
manipulations,  this  expression  can  be  evaluated  exactly,  giving 


(0 1  HJ,—iy*a)Hm(iya+)  |  0) 


^H\y\2k  H„-k(0)  H„ 
n\m\  2j 


-*(  0) 


k\  ( n  -  k)\  (m  -  k)\ 


where  [m,  n]  denotes  the  smaller  of  n  and  m,  in  fact  this  is  the  formula  which  gives 
the  Franck-Condon  overlap  [15]  for  squeezed  states  with  zero  displacement 
(a  =  0). 

Another  particular  case  of  our  formulation  is  the  case  treated  in  Ref.  [16],  we 
obtain  their  basic  Eq.  (4)  making  the  following  identifications 


n  =  0,  ft  = 


ifs 


s+  1 


1/2 


1  -  s 
2(1  +  s) 


B  =  - 


25a 
1  +  5 


from  our  Eq.  (4). 

In  summary  we  have  shown  here  that  there  exist  a  unifying  algebraic  formulation 
for  the  Franck-Condon  Effect,  the  Squeezed  States,  and  perhaps  the  Double  Slit 
experiment.  The  parallelism  between  these  phenomena  was  advanced  some  years 
ago  by  Wheeler  [12].  We  have  shown  here  that  they,  at  least  for  the  fcf  and  ss, 
can  be  obtained  as  particular  cases  of  a  more  general  algebraic  formulation.  Although 
we  have  shown  results  only  for  FCF  and  SS,  work  for  the  case  of  Young’s  Double 
Slit  experiment  is  in  progress. 
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Abstract 

The  contribution  of  the  electron  correlation  effects  to  the  calculated  dipole  polarizability  and  hyper¬ 
polarizability  of  the  first-row  atomic  anions  is  calculated  and  analyzed.  It  is  shown  that  the  total  correlation 
contribution  to  the  dipole  hyperpolarizability  is  extremely  large  with  the  Hartree-Fock  model  accounting 
for  only  a  small  fraction  of  the  accurate  result.  The  linear  and,  more  pronounced,  the  nonlinear  response 
of  atomic  anions  to  the  application  of  an  electric  field  emphatically  shows  the  effects  of  the  correlated 
motion  of  the  electrons.  ©  1994  John  Wiley  &  Sons,  Inc. 


The  static  dipole  polarizability  is  the  primary  response  of  an  atomic  or  molecular 
system  to  the  application  of  a  weak  static  electric  field  [1].  As  such  it  relates  to  a 
great  variety  of  physical  properties  and  phenomena  [1,2] .  In  recent  years  with  the 
increasing  accuracy  of  ab  initio  many-electron  calculations  there  has  been  a  growing 
interest  in  calculating  atomic  static  [3,4]  and  dynamic  [5]  dipole  polarizabilities. 
Since  the  last  10  years  or  so  one  has  accumulated  accurate  theoretical  predictions 
of  the  static  dipole  polarizabilities  of  neutral  ground  [3-5]  and  excited  [6]  states 
of  atoms.  Generally,  for  neutral  ground  state  atoms  10%  of  the  polarizability  value 
is  due  to  the  electron  correlation  effects  [3,4] .  It  is  only  in  more  recent  years,  that 
negative  ions  started  to  receive  more  systematic  attention  [7-19].  In  the  large 
majority  [7-17]  these  have  been  limited  to  closed-shell  H~  [1 1,15,16],  Li~  [12- 
14,17],  and  F~  [7-10]  systems.  The  dipole  polarizabilities  of B ~ ,  C~ ,  and  O  have 
recently  been  calculated  [18].  The  electron  correlation  effect  is  large  and  very  im¬ 
portant  for  these  anionic  systems.  For  the  dipole  hyperpolarizability  of  anions,  very 
few  theoretical  investigations  have  been  made  [14,16,17,20].  One  may  add  to  this 
that  experimental  values  for  the  dipole  polarizabilities  and  hyperpolarizabilities  of 
anions  are  generally  not  available.  In  fact,  there  is  only  little  chance  for  direct 
measurements  on  the  free  atomic  anion.  It  is  even  possible  that  the  free  atomic 
anion  is  unstable  towards  electron  detachment  as  Be~,  N~,  and  Ne~,  for  instance 
[21-23] .  These  prompt  the  necessity  of  well-defined  theoretical  procedures  to  pro¬ 
vide  accurate  numerical  results  and  for  understanding  the  contributing  terms  to 
the  dipole  polarizabilities  and  hyperpolarizabilities  of  stable  atomic  anions. 
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In  this  work  we  shall  report  that  for  some  atomic  anions  the  electron  correlation 
contribution  to  the  hyperpolarizability  may  gross  to  80  or  90%.  That  is,  the  true 
correlated  value  is  several  times  larger  than  the  Hartree-Fock  scf  value.  This  ex¬ 
treme  effect  is  demonstrated  for  the  stable  first-row  atomic  anions,  in  special  H~, 
C~,  and  F~,  using  many-body  perturbation  theory  (mbpt  ),  coupled-cluster  (cc), 
and  configuration  interaction  (ci)  models  [3],  within  the  finite-field  approach  [3,4] 
using  a  five-point  quartic  fit  of  the  energy.  All  calculations  were  made  using  the 
Gaussian-90  program  [24]. 

There  has  been  some  very  recent  interest  on  the  electron  correlation  effects  of 
H~  [11,16,22].  H~  is  a  particularly  interesting  system  to  analyze  in  more  detail 
the  role  of  the  electron  correlation  effects  to  the  dipole  polarizability  and  hyper¬ 
polarizability.  First,  it  is  the  only  negative  ion  where  the  exact  nonrelativistic  limits, 
both  for  the  energy  and  polarizability  (but  not  for  the  hyperpolarizability)  are 
known  [25].  Second,  there  are  no  systematic  analysis  of  the  low-order  correlation 
contributions.  Third,  as  H~  is  a  two-electron  system,  it  is  amenable  to  a  full  ci(fci) 
calculation  thus  allowing,  by  comparison  with  the  exact  result,  to  minimize  basis 
set  deficiencies  and  to  get  a  clear  picture  of  the  correlation  effects  and  the  accuracy 
of  low-order  theoretical  methods.  Also,  the  dipole  hyperpolarizability  of  H~  is  still 
subjected  to  large  uncertainties  [16]  and  the  precise  role  of  the  electron  correlation 
effects  are  not  known.  We  have  thus  first  calculated  the  dipole  polarizability  and 
dipole  hyperpolarizability  of  H~ .  The  basis  set  was  optimized  for  obtaining  the  fci 
dipole  polarizability  of  H~  within  the  exact  nonrelativistic  limit.  The  final  basis 
consists  of  125-8/7  5d2f  totally  uncontracted  gaussian-type  functions.  The  FCI  en¬ 
ergy  is  only  0.22  mhartree  above  the  exact  result  [25,26]  and  0.39  mhartree  below 
the  previously  best  fci  result  [22] .  Another  measure  of  the  accuracy  of  the  present 
results  for  H~  can  be  obtained  from  the  calculated  electron  affinity  value  of  0.749 
eV.  This  is  better  than  the  best  basis-set-based  result  available  (0.740  eV)  [22]  and 
comparable  with  the  near-exact  135-term  Hylleraas  [25]  value  of  0.755  eV  or  the 
recommended  exact  value  of  0.754  eV  [27].  The  calculated  polarizability  and  hy¬ 
perpolarizability  of  H~  are  given  in  Table  I.  The  SCF  result  for  the  polarizability  is 
in  sharp  agreement  with  the  exact  coupled-Hartree-Fock  result  and  it  is  only  44% 
of  the  fci  result.  Of  course,  only  single  (S)  and  double  (D)  excitations  contribute 
to  the  correlation  effects  in  H~ .  It  is  seen  in  Table  I  that  single  excitations  are 
crucial.  The  indirect  contribution  of  single  excitations  (obtained  from  the  difference 
of  fci  and  cid)  is  higher  than  the  direct  contribution  of  double  excitations  (obtained 
from  the  difference  of  cid  and  SCF).  This  is  perhaps  the  first  example  where  the 
contribution  of  single  excitations  to  the  dipole  polarizability  is  higher  than  the 
double  excitations  contribution.  As  a  consequence,  the  partial  inclusion  of  single 
excitations  contribution  gives  only  poor  results  compared  to  the  fci  limit. 

The  situation  is  more  dramatic  for  the  dipole  hyperpolarizability.  Previous  cal¬ 
culations  for  the  dipole  hyperpolarizability  of  H~  include  an  early  uncoupled 
Hartree-Fock  value  of  8.0  X  10 6  a.u  [28]  which  is  nearly  twice  our  SCF  (with 
coupled  Hartree-Fock  quality)  value  of  4.37  X  10 6  a.u.  Previous  correlated  cal¬ 
culations  were  performed  by  using  variation-perturbation  theory  with  Hylleraas- 
type  functions,  but  were  considered  unreliable  by  the  authors  [20].  Indeed  their 
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Table  I.  The  calculated  dipole  polarizability  and  hyperpolarizability  of  H 
with  different  electron  correlation  contributions.  Basis  set  is  12 s  8 p  5  d  2/ 


Method 

Polarizability  (a.u) 

Hyperpolarizability  (106  a.u) 

SCF 

91.39 

4.37 

MBPT  (2) 

124.48 

11.25 

MBPT  (3) 

136.46 

12.04 

D-mbpt  (4) 

143.50 

13.18 

SD-mbpt  (4) 

147.73 

14.84 

CID 

146.20 

12.79 

CCD 

146.20 

12.79 

CCD  +  S  (4) 

150.43 

14.45 

CCD  +  S  (CCD) 

160.21 

16.59 

FCI 

208.30 

54.29 

MCSCF-LR  a 

217.0 

MEMPTb 

201.8 

76.0 

Exact' 

206.39  ±  2.45 

a  Multiconfigurational  self-consistent-field  linear  response  value  of  Fowler, 
Jorgensen,  and  Olsen  [1 1]. 

b  Many-electron  many-photon  theory  results  of  Nicolaides,  Mercouris,  and 
Piangos  [16]. 

c  135-term  Hylleraas-type  result  of  Glover  and  Weinhold  [25]. 


value  of  17.0  X  106  a.u.  is  too  small.  A  recent  calculation  [16]  has  given  the  more 
acceptable  value  of  76.0  X  106  a.u.  Our  best  calculated  value  is  54.3  X  106  a.u.  It 
is  remarkable  to  see  that  the  SCF  model  accounts  for  less  than  10%  of  the  exact 
result.  It  is  also  striking  to  see  that  the  indirect  contribution  of  single  excitations  is 
responsible  for  about  76%  of  the  total  dipole  hyperpolarizability  of/7-.  A  complete 
double-excitation  calculation  obtained  either  from  CCD  or  CID  will  give  only  about 
24%  of  the  exact  value— an  extremely  poor  performance. 

The  situation  for  the  other  stable  atomic  anions  is  also  interesting.  Li~  shows  an 
unusual  near-degeneracy  [12,13]  in  the  ground  state,  which  is  even  more  acute 
then  the  classical  Be  case.  Correlation  effects  are  found  to  be  very  large  for  the 
dipole  polarizability  and  hyperpolarizability  of  Li~  [12-14].  The  other  stable  anions 
of  the  first-row  present  similar  extreme  correlation  contribution.  The  polarizability 
of  F~  was  studied  by  several  investigators  [7-10]  with  results  limited  to  the  mbpt(4) 
level  or  to  very  limited  inclusion  of  triple  excitations.  It  is  clear  that  high  order 
single  and  triple  excitations  are  necessary  for  an  accurate  description  of  the  dipole 
polarizability  of  F" .  As  this  seems  to  have  not  been  considered  before  we  have  first 
calculated  the  dipole  polarizability  of  F~  using  highly  correlated  methods,  including 
qcisd(t  )  [29]  which  is  known  to  be  very  accurate  even  for  difficult  systems.  The 
results  obtained  with  a  large  uncontracted  basis  set  ( \As9p6d2f)  are  displayed  in 
Table  II  and  clearly  show  that  mbpt(4)  is  not  enough  to  give  accurate  estimates 
of  the  dipole  polarizability  and  hyperpolarizability  of  F-.  The  result  of  18.0  a.u 
for  the  dipole  polarizability  of  F-  should  be  the  best  result  now  available.  Table  II 
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Table  II.  The  calculated  dipole  polarizability  and  hyperpolarizability  of  F~ 
with  different  electron  correlation  contributions.  Basis  set  is  \4s9p6d2f 
Correlated  finite  field  energies  are  accurate  within  nine  decimal  figures  to  give 
an  estimated  numerical  error  of  0.005  X  106  a.u  for  the  hyperpolarizabilities. 
scf  energies  are  correct  within  10  decimal  figures. 


Method 

Polarizability  (a.u) 

Hyperpolarizability  (106  a.u) 

SCF 

10.91 

0.009 

MBPT  (2) 

17.19 

0.022 

MBPT  (3) 

12.86 

0.010 

DQ-MBPT  (4) 

14.36 

0.014 

SDQ-MBPT  (4) 

17.47 

0.026 

MBPT  (4) 

21.05 

0.044 

CCD 

14.03' 

0.012 

CCD  +  ST  (4) 

20.72 

0.041 

QC1SD 

18.01 

0.048 

QCISD  (T) 

17.96 

0.058 

also  shows  the  calculated  hyperpolarizability  of  F~ .  No  previous  high-order  cal¬ 
culations  were  found  in  the  literature.  Our  results  indicate  that  the  scf  value  is 
only  15%  of  the  best  calculated  result.  Thus  the  correlation  contribution  to  the 
hyperpolarizability  of  F~  is  about  85%.  The  actual  correlated  result  is  thus  nearly 
seven  times  larger  than  the  Hartree-Fock  model  implies. 

The  extreme  correlation  contribution  to  the  hyperpolarizability  of  atomic  anions 
may  be  exemplified  again  in  C~(4S).  The  dipole  polarizability  of  C~  has  been 
reported  recently  [18],  but  there  is  no  previous  studies  of  its  hyperpolarizability. 
Again  a  large  uncontracted  basis  set,  \As9p6d2f,  was  used  to  compute  its  polar¬ 
izability  and  hyperpolarizability  using  the  high-order  qcisd(t)  correlated  scheme 
[29].  The  results  of  our  calculation  (with  all  electrons  included)  for  the  dipole 
polarizability  and  hyperpolarizability  of  C~  are  given  in  Table  III.  For  the  dipole 
polarizability,  the  SCF  value  is  about  60%  of  the  best  correlated  result  and  the 
inclusion  of  high-order  single  and  triple  excitations  are  very  important.  For  the 
hyperpolarizability,  the  Hartree-Fock  model  accounts  for  only  about  23%  of  the 
qcisd(t  )  value  with  an  extreme  contribution  of  the  electron  correlation  effects  of 
about  77%.  In  other  words,  the  true  correlated  value  for  the  dipole  hyperpolarizability 
of  C~  is  about  four  times  larger  than  that  obtained  at  the  uncorrelated  SCF  level. 
The  inclusion  of  all  electrons  (core  and  valence)  in  the  correlated  calculations  is 
important  to  obtain  absolute  values  for  the  polarizability  and  hyperpolarizability, 
but  the  qualitative  aspects  regarding  the  extreme  correlation  contributions  are  ob¬ 
tained  already  at  the  frozen  core  level. 

Summarizing,  high-order  single  and  triple  excitations  give  important  contributions 
to  the  dipole  polarizability  and  hyperpolarizability  of  the  stable  first-row  atomic 
anions.  For  H~  we  show  that  the  indirect  contribution  of  single  excitations  is  larger 
than  the  direct  contribution  of  double  excitations  to  the  dipole  polarizability.  For 
the  dipole  hyperpolarizability,  electron  correlation  effects  are  extreme  and  account 
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Table  III.  The  calculated  dipole  polarizability  and  hyperpolarizability  of  C 
with  different  electron  correlation  contributions.  Basis  set  is  I4s9p6d2f 


Method 

Polarizability  (a.u) 

Hyperpolarizability  (106  a.u) 

SCF 

50.13 

0.61 

MBPT  (2) 

65.89 

1.28 

MBPT  (3) 

65.04 

1.22 

DQ-MBPT  (4) 

63.31 

1.14 

SDQ-MBPT  (4) 

66.06 

1.30 

MBPT  (4) 

71.02 

1.53 

CCD 

63.26 

1.14 

CCD  +  ST  (4) 

70.98 

1.53 

CCD  +  ST  (CCD) 

69.94 

1.48 

QCISD 

74.92 

2.22 

QCISD  (T) 

81.55 

2.63 

for  more  than  90%  of  the  full  Cl  result,  more  than  75%  coming  from  the  interaction 
of  single  and  double  excitations.  The  full  Cl  value  for  the  dipole  polarizability  is 
208.3  a.u  as  compared  to  the  exact  nonrelativistic  limit  of  206.4  ±  2.5  a.u.  For  the 
dipole  hyperpolarizability  this  limit  is  not  known  and  our  full  Cl  result  is  54.3  X 
106  a.u.  For  C~  and  F~,  the  Hartree-Fock  treatment  gives  only  about  25  and  15%, 
respectively,  of  the  best  correlated  values  for  the  dipole  hyperpolarizabilities,  with 
about  75  and  85%  being  due  to  the  detailed  correlated  motion  of  the  electrons.  Our 
best  value  for  the  dipole  hyperpolarizability  of  C~  is  2.63  X  106  a.u  and  for  F  it 
is  0.06  X  106  a.u  obtained  using  the  highly  correlated  QCISD(T )  theoretical  scheme. 
These  extreme  correlation  effects  should  be  considered  in  any  theoretical  treatment 
of  the  electric  properties  of  atomic  anions.  To  the  application  of  an  electric  field, 
atomic  anions  respond  linearly,  and  more  markedly  nonlinearly,  with  a  highly 
correlated  motion  of  the  electrons. 
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Abstract 

Previous  measurements  of  tunnel  conductance  in  heterostructures  and  experiments  with  Josephson 
junctions  suggest  quantum  tunneling  has  a  definite  duration.  We  use  semiclassical  methods  to  determine 
the  effects  of  this  delay  on  the  tunneling  current  in  a  laser-assisted  stm.  A  planar-planar  STM  model  is 
used  with  the  exact  multiple  image  potential,  and  the  energy  distribution  for  a  free-electron  metal.  It  is 
necessary  to  average  over  the  phase  at  barrier  entry,  and  iteration  with  back  propagated  solutions  is 
required  to  obtain  the  transmission  coefficients  for  evenly  spaced  phases  and  specified  energies  at  barrier 
entry.  The  simulations  suggest  that  the  dependence  of  the  tunneling  current  on  the  wavelength  of  illu¬ 
mination  can  serve  as  a  basis  for  determining  the  duration  of  barrier  traversal.  A  power  flux  density  of 
10 11  W/m2  would  be  required  at  several  wavelengths  from  1  to  10  pm.  It  is  possible  that  thermal  effects 
could  be  separated  from  the  modeled  phenomena  by  determining  the  time  dependence  of  the  tunneling 
current  with  a  pulsed  laser.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

The  duration  of  quantum  tunneling  is  a  topic  that  has  generated  much  interest 
and  controversy  (for  reviews,  see  [1]).  Experiments  with  heterostructures  [2]  and 
Josephson  junctions  [3]  suggest  that  a  specific  time  is  associated  with  barrier  trav¬ 
ersal.  A  variety  of  methods  have  been  considered  for  calculating  the  duration  of 
quantum  tunneling,  but  their  results  do  not  agree  [1] . 

The  duration  of  quantum  tunneling  has  practical  significance.  For  example, 
measurements  of  tunnel  conductance  in  heterostructures  differ  from  theory  by  as 
much  as  2  orders  of  magnitude  unless  the  image  corrections  are  adjusted  to  allow 
for  the  traversal  time  [2].  Tunneling  also  occurs  in  the  ohmic  contacts  of  most 
solid  state  devices  [4],  but  the  traversal  time  is  probably  on  the  order  of  10  fs  so 
that  it  may  generally  be  ignored.  However,  we  anticipate  that  delays  from  quantum 
tunneling  will  have  to  be  considered  as  progress  continues  in  picosecond  switching, 
terahertz  technology,  and  femtosecond  optical  pulses  [  5  ] .  At  high  energies  it  is 
necessary  to  include  tunneling  through  the  reaction  barrier  in  transition  state  theory 
in  order  to  obtain  accurate  quantal  microcanonical  rate  constants  [6].  Thus,  it 
appears  that  the  accuracy  of  models  of  chemical  reactivity  would  be  increased  by 
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correcting  for  delays  from  barrier  penetration  in  addition  to  the  well-known  delays 
associated  with  passage  over  the  barrier  [  7  ] . 

Laser-assisted  scanning  tunneling  microscopy,  utilizing  the  effects  of  laser  radia¬ 
tion  impinging  on  the  gap  between  the  tip  and  the  sample  in  a  scanning  tunneling 
microscope  (stm),  has  shown  promise  for  obtaining  improved  resolution,  and  has 
been  considered  for  use  in  imaging  insulators  [  8  ] .  The  stm  is  extremely  sensitive 
to  radiation,  and  has  been  used  as  a  detector  to  record  absorption  or  fluorescence 
in  laser  spectroscopy  [9]. 

An  experimental  value  of  operational  tunneling  time  has  been  reported  that  is 
based  on  measurements  of  current  rectification  in  a  laser-illuminated  stm  [10]. 
However,  the  observed  decrease  in  current  with  increasing  barrier  length  is  similar 
to  the  dependence  of  transmission  on  the  barrier  length,  so  that  it  appears  that  the 
data  may  be  explained  without  reference  to  tunneling  times.  It  is  the  objective  of 
the  present  paper  to  examine  the  effects  of  the  finite  duration  of  quantum  tunneling 
in  laser-assisted  scanning  tunneling  microscopy  in  order  to  determine  if  there  are 
phenomena  that  could  be  used  as  a  basis  for  experimentally  determining  the  duration 
of  quantum  tunneling. 


Method  of  Analysis 

We  model  tunneling  [  1 1  ]  as  a  “climb  over  the  barrier”  during  energy  fluctuations 
consistent  with  the  uncertainty  principle.  Cohen  postulated  [12]  that  the  probability 
of  such  fluctuations  decreases  exponentially  with  the  product  of  their  magnitude 
and  duration,  which  product  we  refer  to  as  the  action  of  a  fluctuation,  and  showed 
that  this  assumption  leads  to  the  wkb  solution  for  an  opaque  barrier.  Cohen  did 
not  consider  the  duration  of  tunneling,  but  we  have  shown  [11]  that  his  postulate 
results  in  the  semiclassical  traversal  time  which  is  given  by 

r“  =  Jo  \/ 2[K(z)  —  jfJ  ^  (1) 

where  m  and  E  are  the  mass  and  energy  of  the  particle  and  the  barrier  has  height 
V(z )  over  the  interval  0  <  z  <  d.  We  follow  the  convention  of  calling  this  time 
“semiclassical”  because  it  is  the  classical  time  for  traversing  the  inverted  bar¬ 
rier  [13]. 

While  the  result  is  not  unanimous,  1 1  other  methods  of  analysis,  in  addition  to 
considerations  of  energy  fluctuations,  also  result  in  the  semiclassical  expression  for 
the  traversal  time.  These  procedures  include  ( 1)  barrier  modulation  [13,14],  (2) 
modulating  the  incident  wave  [  1 5  ] ,  ( 3 )  solutions  for  a  sharp  wave  packet  [  1 6  ] ,  ( 4 ) 
flux-flux  correlation  [17],  ( 5 )  dissipation  by  a  string  [18],  (6)  dynamics  of  image 
potentials  [  1 9  ] ,  ( 7 )  temperature  effects  in  macroscopic  tunneling  [  20  ] ,  ( 8 )  “bounce” 
time  for  macroscopic  tunneling  [21],  (9)  distributions  of  momentum  [22],  (10) 
duality  with  ray  optics  [23  ] ,  and  (11)  calculations  with  path  integrals  [24] .  Analyses 
based  on  measurements  of  the  tunnel  conductance  in  heterostructures  [2]  are  also 
consistent  with  the  semiclassical  expression  for  the  traversal  time. 
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Various  approximations  have  been  used  in  modeling  the  potential  barrier  of  an 
STM.  For  example,  Miskovsky  et  al.  [25,26]  used  numerical  methods  to  solve  the 
one-dimensional  Schrodinger  equation  for  hyperboloidal-planar  and  planar-planar 
models  of  the  metal  electrodes  with  exact  multiple  image  corrections  for  the  tran¬ 
siting  electrons.  They  also  tested  an  approximation  used  earlier  by  Simmons  [  27] 
for  the  image  corrections  with  planar-planar  models.  Garcia  [28]  made  calculations 
for  planar-planar  and  hemispherical-planar  models,  as  well  as  for  the  approximation 
of  a  square  potential  barrier.  To  avoid  being  restricted  to  electrode  shapes  for  which 
analytical  solutions  for  the  potential  are  known,  Bar’yudin  et  al.  [29]  used  numerical 
methods  to  calculate  the  potential  when  the  tip  electrode  was  represented  by  a  cone 
with  a  smoothed  vertex.  However,  it  appears  that  they  did  not  make  image  correc¬ 
tions  for  the  transiting  electrons.  Previous  to  the  work  in  the  present  paper,  as  a 
first  approximation  we  [30]  used  a  square  potential  barrier  and  neglected  image 
corrections  for  the  transiting  electrons. 

Miskovsky  et  al.  [25,26]  found  little  difference  between  the  values  of  the  potential 
with  hyperboloidal-planar  and  planar-planar  models  of  the  metal  electrodes  in  an 
STM  when  using  exact  multiple  image  corrections  for  the  transiting  electrons.  There 
is  a  slight  asymmetry  in  the  hyperboloidal-planar  barrier  which  is  only  significant 
when  calculating  current  rectification,  a  phenomenon  which  would  not  occur  with¬ 
out  asymmetry.  Since  none  of  the  models  correspond  to  the  actual  shapes  of  the 
electrodes  in  an  STM,  for  simplicity  we  have  chosen  to  use  the  planar-planar  model 
in  the  present  work. 

The  exact  multiple  image  correction  to  the  potential  barrier  for  a  planar-planar 
model  [25,27,31]  is  given  by 


Vj(z)  =  - 


(2) 


where  e  is  the  magnitude  of  the  electron  charge,  z  is  the  coordinate  of  the  transiting 
electron,  and  the  two  image  planes  are  located  at  z  =  0  and  z  =  s,  respectively. 
Miskovsky  et  al.  [25]  found  that  the  values  of  potential  for  planar-planar  models 
of  an  STM  are  typically  within  2%  of  those  for  hyperboloidal-planar  models  when 
exact  multiple  image  corrections  are  used.  However,  when  only  the  first  few  terms 
of  the  infinite  series  in  Eq.  (2)  are  used  there  are  errors  of  40-50%,  and  errors  of 
10-15%  occur  using  the  Simmons  approximation  for  the  series.  Other  approxi¬ 
mations  to  the  infinite  series  have  also  been  suggested  [32].  In  computations  we 
use  the  following  expression  in  which  the  part  of  the  summation  for  n  >  N  —  1  is 
approximated  by  an  integral: 


V,(z)  =  - 


*2  r±,  %' ( _ ™ 

2z+  ~  \(/«)2- 


87T£o 


n—  1 


(3) 


Six-place  accuracy  is  obtained  using  Eq.  (3)  with  N  =  7,  whereas  79  terms  are 
required  in  order  to  obtain  the  same  accuracy  with  Eq.  (2). 
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We  use  a  free  electron  model  for  the  tip  and  sample  electrodes,  and  define  the 
potential  as  zero  for  z  <  zT  which  corresponds  to  the  tip.  For  z  >  zs,  which  cor¬ 
responds  to  the  sample  electrode,  the  potential  is  given  by  eFT  —  eFS  —  eV0  where 
eFT  and  eFS  are  the  Fermi  energy  of  the  tip  and  the  sample,  respectively,  and  V0  is 
the  applied  potential  difference.  In  the  gap  between  the  two  electrodes,  for  which 
zT<  z  <  zs,  the  potential  is  given  by 

V(z)  =  cFT+  cf)T-\-  (z/s)[eFS  +  (f>s~  eFT-  <f>T~  eV0]  +  V7(z),  (4) 

where  (j>T  and  <f>s  are  the  work  functions  of  the  tip  and  the  sample,  respectively,  and 
Vi(z)  is  the  image  correction  specified  in  Eq.  (2).  The  trapezoidal  portion  of  the 
barrier,  which  is  added  to  the  image  potential  in  Eq.  (4),  is  probably  appropriate 
for  distances  greater  than  0.3  nm  from  flat  metal  surfaces  [33] .  The  detailed  shape 
of  the  barrier  at  smaller  distances,  which  depends  on  the  particular  metal  surface, 
is  not  considered  in  the  present  analysis.  The  values  of  zT  and  zs  are  determined 
algebraically  by  setting  V(z)  in  Eq.  (4)  equal  to  the  specified  values  of  the  potential 
within  the  two  electrodes.  The  value  of  zT  is  slightly  greater  than  zero,  and  the  value 
of  zs  is  slightly  less  than  s,  so  that  the  two  image  planes  are  just  below  the  surfaces 
of  the  metal  electrodes. 

The  transmission  coefficient  is  calculated  by  approximating  the  potential  from 
Eq.  (4)  with  a  sequence  of  segments  that  are  flat  barriers  and/or  wells  [34].  We 
begin  by  assuming  that  there  is  a  transmitted  wave  with  unit  amplitude,  but  no 
incident  wave,  at  z  -  zs.  Then  the  wave  is  backpropagated,  by  sequentially  deter¬ 
mining  the  incident  and  reflected  waves  at  the  near  side  of  each  barrier  segment, 
until  the  amplitudes  of  the  incident  and  reflected  waves  are  found  at  z  =  zT.  Since 
the  transmitted  wave  is  required  to  have  unit  amplitude,  the  transmission  coefficient 
is  the  reciprocal  of  the  square  of  the  magnitude  of  the  incident  wave  at  z  =  zT. 
After  thus  calculating  the  forward  current,  this  procedure  is  repeated  to  determine 
the  transmission  coefficient  for  the  return  current  due  to  electrons  flowing  from 
the  sample  to  the  tip. 

In  modeling  the  effects  of  a  laser,  we  assume  that  each  barrier  segment  is  traversed 
in  the  semiclassical  time  (or  the  classical  time  where  the  energy  exceeds  the  poten¬ 
tial).  The  energy  of  the  electron  is  updated  at  each  step  of  back  propagation  in 
calculating  the  transmission  coefficient  in  order  to  allow  for  interaction  with  the 
electric  field  of  light.  Both  the  change  in  energy  during  transit  of  the  barrier  and 
the  traversal  time  depend  on  the  phase  on  entering  the  barrier.  It  is  necessary  to 
average  the  values  of  the  transmission  coefficient  for  a  large  number  of  evenly 
spaced  phases  at  barrier  entry  for  each  specified  value  of  energy  in  order  to  determine 
the  mean  values  of  the  forward  and  reverse  current.  Iteration  with  the  back  prop¬ 
agated  solutions  is  required  in  order  to  obtain  each  of  the  averaged  values  of  the 
transmission  coefficient  since  each  must  correspond  to  a  specified  value  of  particle 
energy  and  phase  on  entering  the  barrier. 

The  magnitude  of  the  forward  or  return  current  density  is  given  by  the  product 
of  the  electronic  charge  and  the  number  of  electrons  per  unit  area  per  unit  time, 
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J(F,  T)  =  e  f"  Z>(F,  E)N(E,  T)  dE , 

Jo 


(5) 


where  D(F,  E)  is  the  transmission  coefficient  and  N(E ,  7")  dE  is  the  number  of 
electrons  with  normal  energy  between  E  and  E  +  dE  impinging  on  the  surface 
from  within  the  metal.  Following  the  free-electron  theory  of  metals  [33],  and  as¬ 
suming  that  N(E,  T)  is  given  by  the  equilibrium  value,  we  set 


N(E,  T)  = 


mkBT 

2^ln 


1+eXP("^).’ 


(6) 


where  wi  is  the  electron  mass,  kB  is  Boltzmann’s  constant,  T  is  the  temperature, 
and  h  is  Planck’s  constant.  The  Fermi  energy  eF  is  set  equal  to  eT  for  the  forward 
current  and  es  when  determining  the  return  current. 

We  use  Eq.  (5)  to  determine  the  forward  and  return  current  density  at  specified 
values  of  the  applied  potential,  temperature,  and  the  wavelength  and  power  flux 
density  of  the  laser.  The  integral  in  Eq.  (5)  is  approximated  by  a  sum  over  a  series 
of  evenly  spaced  energies  that  are  chosen  so  that  the  values  of  current  density  at 
the  ends  of  the  interval  are  less  than  0.1%  of  the  maximum.  The  increment  of 
energy  is  typically  0.05  eV.  A  total  of  1500  barrier  intervals  is  typically  required  in 
the  calculations.  Time-averaged  values  of  the  current  density  are  determined  using 
the  results  for  a  minimum  of  100  evenly  spaced  phases  at  barrier  entry.  The  results 
of  simulations  are  presented  in  the  following  section  of  this  paper. 


Results  of  Analysis 

Figures  1  and  2  illustrate  some  of  the  phenomena  which  occur  in  an  STM  without 
illumination  by  a  laser.  These  two  figures  show  the  ratio  of  the  return  current  to 
the  forward  current,  and  the  net  tunneling  current,  as  a  function  of  the  applied  bias 
potential  V0.  Since  a  planar-planar  model  was  used,  and  both  electrodes  were 
assumed  to  have  the  same  characteristics  (tungsten  with  <f>  =  4.5  eV  at  20°C),  there 
is  no  asymmetry  so  that  the  return  current  equals  the  forward  current  for  a  net 
tunneling  current  of  zero  when  there  is  no  applied  bias. 

The  length  of  the  gap  between  the  tip  and  the  sample  is  10  A  in  Figure  1  and  15 
A  in  Figure  2.  -In  Figure  1  it  may  be  seen  that  the  tunneling  current  increases 
linearly  with  the  applied  bias  when  the  gap  length  is  10  A.  The  tunneling  junction 
has  an  electrical  resistance  of  approximately  3  MQ  with  this  relatively  short  gap 
length.  In  Figure  2  it  may  be  seen  that,  except  for  very  low  values  of  the  applied 
bias,  the  tunneling  current  increases  exponentially  with  the  bias  when  the  gap  length 
has  the  relatively  large  value  of  15  A.  These  observations  may  be  understood  by 
considering  the  behavior  of  the  wkb  approximation  for  the  transmission  coefficients. 
The  net  tunneling  current,  which  is  the  difference  between  the  forward  and  reverse 
currents,  may  be  approximated  by  an  integral  of  the  difference  between  two  ex¬ 
ponentials.  For  small  values  of  the  gap  length,  or  for  sufficiently  small  values  of 
the  applied  bias  with  a  gap  having  arbitrary  length,  the  difference  between  the  two 
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Figure  1 .  Ratio  of  the  return  current  to  the  forward  current,  and  the  net  tunneling  current, 
as  a  function  of  the  applied  bias  for  an  stm  with  a  tungsten  tip  and  sample  at  20°C  and 

a  gap  length  of  10  A. 


Figure  2.  Ratio  of  the  return  current  to  the  forward  current,  and  the  net  tunneling  current, 
as  a  function  of  the  applied  bias  for  an  stm  with  a  tungsten  tip  and  sample  at  20 °C  and 

a  gap  length  of  1 5  A. 
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exponential  terms  is  small.  Thus,  by  expanding  the  difference  between  the  two 
terms,  it  may  be  seen  that  the  tunneling  current  is  approximately  proportional  to 
the  applied  bias.  However,  for  relatively  large  values  of  the  gap  length,  the  expo¬ 
nential  term  representing  the  return  current  may  be  neglected,  so  that  the  tunneling 
current  varies  exponentially  with  the  applied  bias.  There  is  a  significant  difference 
in  the  energy  distributions  of  the  tunneling  electrons  for  the  two  gap  lengths  used 
in  Figures  1  and  2.  Lower  energy  electrons  may  cross  the  barrier  in  the  forward 
and  return  currents  with  10  A  than  with  the  longer  gap  length  of  15  A. 

Figure  3  shows  the  tunneling  current  as  a  function  of  the  applied  bias  and  the 
length  of  the  gap  in  a  planar-planar  model  of  an  STM.  As  in  the  two  previous 
figures,  the  calculations  were  made  for  a  tungsten  tip  and  sample  at  20  °C,  assuming 
there  is  no  illumination  by  a  laser.  It  may  be  seen  in  Figure  3  that,  for  small  values 
of  the  applied  bias,  the  tunneling  current  decreases  by  approximately  a  factor  of 
10  for  each  increase  in  the  gap  length  by  1  A,  which  agrees  with  experiments  [35] . 
However*  the  slope  is  decreased  for  larger  values  of  the  applied  bias.  This  change 
in  slope  is  consistent  with  the  observation  that  the  tunneling  current  varies  more 
rapidly  with  the  applied  bias  as  the  gap  length  is  increased,  noted  previously  when 
comparing  Figures  1  and  2. 

The  simulations  described  in  the  remainder  of  this  section  include  illumination 
by  a  laser.  As  the  electrons  interact  with  the  electric  field  of  light,  the  periodic  nature 
of  that  field  serves  as  a  clock  for  the  motion  of  the  particles.  Thus,  unlike  the 
simulations  described  earlier,  specific  effects  will  be  seen  that  are  caused  by  the 
finite  duration  of  quantum  tunneling. 


Figure  3.  Tunneling  current  as  a  function  of  the  applied  bias  and  the  length  of  the  gap 
for  an  STM  with  a  tungsten  tip  and  sample  at  20°C. 
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Figure  4  shows  the  effect  of  the  applied  bias  and  the  length  of  the  gap  on  the 
ratio  of  the  tunneling  current  with  illumination  by  a  laser  to  the  current  without 
the  laser.  The  calculations  were  made  for  a  tungsten  tip  and  sample  at  20 °C,  as¬ 
suming  a  power  flux  density  of  1011  W/m2  at  a  wavelength  of  10  n m.  Analyses 
suggest  [36]  that  illumination  with  such  a  large  value  of  power  flux  density  would 
cause  significant  heating  of  the  tip  in  an  STM,  thus  increasing  emission  from  the 
tip.  This  conclusion  is  supported  by  analyses  and  experiments  [37,38]  made  pre¬ 
viously  for  laser-illuminated  field  emission.  Nevertheless,  it  was  necessary  to  use  a 
power  flux  density  of  10 11  W/m2  in  the  simulations  in  order  to  obtain  significant 
effects.  For  a  power  flux  density  of  10 11  W/m2  the  electric  field  intensity  has  a 
magnitude  of  8.68  MV  /m  rms,  so  that  the  change  in  energy  of  the  transiting  electrons 
over  the  short  distance  between  the  tip  and  the  sample  is  on  the  order  of  milli- 
electron  volts. 

It  may  be  seen  in  Figure  4  that,  for  constant  applied  bias,  the  effect  of  the  laser 
on  the  tunneling  current  increases  with  the  length  of  the  gap.  This  may  be  understood 
because  increasing  the  gap  length  increases  the  total  distance  for  interaction  of  the 
electrons  with  the  electric  field  of  light.  For  a  constant  gap  length  and  small  values 
of  the  applied  bias,  the  increase  in  the  tunneling  current  due  to  the  laser  increases 
with  the  bias.  This  may  be  understood  because  increasing  the  bias  decreases  the 
contribution  to  the  current  by  electrons  with  energies  well  below  the  Fermi  level, 
and  we  have  found  that  the  effect  of  the  laser  is  less  pronounced  at  the  lower 
energies.  However,  the  increase  in  tunneling  current  is  less  pronounced  for  large 
values  of  the  bias,  which  we  attribute  to  increased  dominance  by  the  static  field 


Figure  4.  Increase  in  tunneling  current  due  to  illumination  as  a  function  of  the  applied 
bias  and  the  length  of  the  gap  for  an  STM  with  a  tungsten  tip  and  sample  at  20°C  with  a 
power  flux  density  of  10 11  W/m2  at  a  wavelength  of  10  jum. 
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rather  than  the  electric  field  of  light.  Thus,  for  a  constant  gap  length  there  is  a  value 
of  the  applied  bias  for  which  the  effect  of  the  laser  is  maximum. 

Figure  5  shows  the  effect  of  the  wavelength  of  illumination  on  the  ratio  of  the 
tunneling  current  with  the  laser  to  the  current  without  illumination.  The  calculations 
were  made  for  an  STM  with  a  tungsten  tip  and  sample  at  20  °C,  with  a  gap  length 
of  25  A,  an  applied  bias  of5  V,  andapower  flux  density  of  1011  W/m2.The  values 
of  gap  length  and  bias  were  chosen  to  correspond  to  the  values  for  which  the  effect 
of  illumination  appears  to  be  maximum  in  Figure  4. 

It  may  be  seen  in  Figure  5  that  the  increase  in  tunneling  current  due  to  illumi¬ 
nation  vanishes  for  short  wavelengths,  and  approaches  a  constant  in  the  long  wave¬ 
length  limit.  These  two  characteristics  are  in  agreement  with  the  results  of  earlier 
calculations  in  which,  as  a  first  approximation,  we  [30]  used  a  square  potential 
barrier  and  neglected  image  corrections  for  the  transiting  electrons.  These  two  effects 
may  be  understood  as  follows:  For  modulation  at  long  wavelengths  for  which  the 
period  is  much  greater  than  the  traversal  time,  the  tunneling  electron  sees  a  static 
electric  field  added  to  the  barrier.  Thus,  the  height  of  the  barrier  is  either  increased 
or  decreased  depending  on  the  phase  at  the  time  at  which  the  particle  enters  the 
barrier.  Due  to  the  nonlinear  dependence  of  current  on  barrier  height  these  changes 
do  not  cancel,  but  rather  the  mean  value  of  the  tunneling  current  is  increased.  For 
example,  by  averaging  values  calculated  using  the  transmission  coefficient  for  a 
rectangular  barrier  it  may  be  shown  that  when  2.5  eV  electrons  are  incident  on  a 
barrier  with  a  height  of  3.0  V  and  a  length  of  2.0  nm  the  current  is  increased  by 
4.2%  by  adding  1%  modulation  to  the  height  of  the  barrier.  On  the  other  hand,  for 


Figure  5.  Increase  in  tunneling  current  due  to  illumination  as  a  function  of  the  wavelength 
for  an  STM  with  a  tungsten  tip  and  sample  at  20°C  with  a  gap  length  of  25  A,  an  applied 
bias  of  5  V,  and  a  power  flux  density  of  10 11  W/m2. 
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illumination  at  short  wavelengths  for  which  the  period  is  less  than  the  traversal 
time,  the  energy  of  the  tunneling  electron  is  alternately  increased  and  decreased 
due  to  interaction  with  the  electric  field  of  light,  and  the  effects  due  to  these  changes 
tend  to  cancel. 


Discussion  and  Conclusions 

Pimpale,  Holloway,  and  Smith  [14]  described  numerical  solutions  for  a  Gaussian 
wavepacket  incident  on  a  Gaussian  barrier  with  oscillating  height.  Their  method 
differs  from  that  used  in  this  paper,  but  they  also  observed  that  ( 1 )  transmission  is 
maximum  in  the  low  frequency  limit,  (2)  transmission  decreases  to  that  for  the 
unmodulated  barrier  in  the  high  frequency  limit,  and  (3)  averaging  is  required 
because  the  transmission  depends  on  the  phase  at  the  time  at  which  the  particle 
enters  the  barrier.  The  numerical  procedure  used  in  the  present  paper  has  the  ad¬ 
vantage  that  the  tunneling  current  may  be  determined  for  a  specific  value  of  energy, 
or  for  the  distribution  of  energies  in  a  free-electron  metal,  rather  than  for  the  dis¬ 
tribution  in  a  Gaussian  wavepacket. 

The  results  of  the  analysis  presented  in  this  paper  suggest  that  it  may  be  possible 
to  determine  the  duration  of  barrier  traversal  from  measurements  of  the  tunneling 
current  in  a  laser-assisted  STM.  However,  there  are  many  phenomena  [39]  such  as 
current  rectification,  photoassisted  tunneling,  thermal-assisted  tunneling,  and  ther¬ 
mal  expansion,  which  have  not  been  included  in  the  present  simulations.  For  ex¬ 
ample,  the  calculations  for  Figures  4  and  5  were  made  assuming  illumination  with 
a  power  flux  density  of  10 11  W/m2,  and  analyses  and  experiments  [36-38]  have 
shown  that  a  substantial  increase  in  the  tunneling  current  may  be  caused  by  heating 
of  the  tip  with  the  focused  beam  of  a  laser.  It  is  possible  that  thermal  effects  could 
be  separated  from  the  nonthermal  phenomena  which  are  modeled  in  this  paper  by 
measuring  the  time  dependence  of  the  tunneling  current.  This  is  because  the  thermal 
relaxation  time  is  on  the  order  of  to  ms,  as  compared  with  times  of  a  few  pi¬ 
coseconds  for  the  nonthermal  effects  of  laser  illumination  [38]. 

Since  a  planar-planar  model  was  used  for  the  barrier  in  an  STM,  we  have  not 
included  the  asymmetry  which  others  [25,26]  have  previously  considered  as  the 
cause  for  optical  rectification.  An  experimental  Value  of  operational  tunneling 
time  has  already  been  determined  from  measurements  of  rectification  in  a  laser- 
illuminated  STM  [10].  However,  it  appears  that  the  experimental  phenomena  that 
have  been  referred  to  as  “current  rectification”  may  be  attributed  to  other  effects 
instead  of  the  interaction  of  the  electric  field  of  light  with  tunneling  electrons  [36,40] . 
Thus,  it  is  possible  that  the  experimental  phenomena  are  not  directly  related  to  the 
duration  of  quantum  tunneling. 

Even  with  a  power  flux  density  of  10 1 1  W/m2,  for  which  the  electric  field  is  8.68 
MV/m  rms,  the  effects  of  a  laser  on  the  tunneling  current  in  an  stm  are  quite 
small.  This  is  because  the  change  in  energy  of  the  transiting  electrons  over  the  short 
distance  between  the  tip  and  the  sample  is  only  on  the  order  of  milli-electron  volts. 
In  Figure  4  it  is  seen  that  the  effect  of  a  laser  on  the  tunneling  current  increases 


LASER-ASSISTED  SCANNING  TUNNELING  MICROSCOPY 


281 


with  the  length  of  the  gap,  which  may  be  understood  because  this  increases  the 
total  distance  for  interaction  of  the  electrons  with  the  electric  field  of  light.  Thus, 
it  may  be  more  appropriate  to  use  a  field  emission  tube  than  an  STM  for  the  deter¬ 
mination  of  tunneling  times  because  of  the  increased  length  of  the  barrier  [41]. 
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Abstract 

We  present  early  results  from  the  CHANNEL  code,  which  simulates  the  passage  of  ionized  projectiles 
through  bulk  solids.  CHANNEL  solves  the  classical  equations  of  motion  for  the  projectile  using  a  force 
obtained  from  the  gradient  of  the  quantum  mechanically  derived  coulombic  potential  of  the  solid  [de¬ 
termined  via  a  full  potential  augmented  plane  wave  (flapw)  calculation  on  the  bulk]  and  a  quantum 
mechanical  energy  dissipation  term,  the  stopping  power,  as  determined  from  the  method  of  Echenique, 
Nieminen,  and  Ritchie.  The  code  then  generates  the  trajectory  of  the  ionic  projectile  for  a  given  incident 
position  on  the  unit  cell  face  and  an  initial  velocity.  We  use  CHANNEL  to  generate  an  ion  (proton) 
implantation  profile  for  the  test  case  of  simple  cubic  hydrogen  with  the  projectile’s  initial  velocity  parallel 
to  the  (100)  channel.  Further  preliminary  results  for  ion  implantation  profiles  of  protons  in  diamond 
structured  Si,  with  initial  velocity  along  the  (100)  and  (110)  channels,  are  given.  ©  1994  John  Wiley  & 
Sons,  Inc. 


Introduction 

The  CHANNEL  code  (a  detailed  treatment  will  be  presented  in  Ref.  [1])  calculates 
trajectories  for  low  energy  ions,  E  «  1-25  keV,  impinging  on  bulk  solids.  The 
method  uses  realistic,  quantum  mechanically  derived  charge  densities,  potentials, 
and  forces  in  describing  the  interaction  of  the  projectile  (thus  far  restricted  to  protons) 
with  the  bulk  solid  target.  This  type  of  simulation  enables  prediction  of  the  pene¬ 
tration  depths  (ion  implantation  profiles)  as  well  as  observation  of  the  channeling 
behavior  of  the  incident  ions. 

Successful  modeling  of  ion  trajectories  in  solids  is  obviously  desirable.  With  a 
physically  realistic  simulation,  a  theoretical  study  of  ion  implantation  profiles  can 
be  made.  In  turn,  this  would  allow  predictive  studies  of  defect  growth  in  micro¬ 
electronic  devices  which  are  subjected  to  various  environments  in  which  radiation 
effects  are  important  (solar  flux,  reactor  sensors,  radiation  therapy).  In  addition, 
predictive  studies  on  various  potential  radiation  hard  compounds  and  devices,  as 
well  as  radiation  detectors,  can  be  made.  Lastly,  the  usefulness  of  such  a  simulation 
in  the  doping  of  semiconductor  chips  can  not  be  overlooked. 

This  method  is  intended  to  yield  better  (in  the  sense  of  including  more  basic 
physics)  results  than  previous  codes  such  as  the  Monte  Carlo  codes,  MARLOWE 
[2-5],  TRIM  [6-8],  BCCRY  [9],  PEPPER  [10],  BABOUM  [11],  and  LAROSE 
[  1 2].  In  particular  these  tend  to  treat  the  scattering  of  the  incident  ions  by  the  crystal 
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lattice  as  binary  collisions  using  screened  atomic-like  potentials.  The  dissipation 
force  used  includes,  in  an  approximate  way,  both  electronic  and  nuclear  energy 
dissipation  terms.  Some  programs,  like  TRIM  and  BABOUM,  treat  the  target  as 
an  amorphous  or  random  solid.  However,  TRIM  can  do  calculations  of  various 
compounds  which  have  up  to  three  layers  of  differing  materials.  CHANNEL,  on 
the  other  hand,  appears  to  be  the  first  code  which  uses  a  potential  and  charge 
density  generated  from  a  quantum  mechanical  crystalline  calculation.  In  calculating 
ion  implantation  profiles,  the  ion’s  incident  energy  must,  by  necessity,  be  low  (of 
the  order  keV’s)  in  order  for  the  ion  to  be  trapped  in  the  solid  in  a  computationally 
realistic  time.  Thus,  by  the  nature  of  the  problem,  the  ion’s  trajectory  will  be  influ¬ 
enced  strongly  by  the  potential  with  which  it  interacts.  This  means  that  a  method 
for  calculating  accurate  ion  implantation  profiles  must  have  a  realistic  representation 
of  the  crystalline  potential  as  well  as  of  the  dissipative  force.  The  method  of  Eche- 
nique  et  al.  [13a,  13b,  14],  which  we  use,  is  particularly  suited  for  low  energy  ions. 
It  expresses  the  energy  dissipation  term  as  a  function  of  the  local  electronic  density. 
This  dependence  allows  an  accurate  determination  of  the  dissipative  force  from 
the  solid’s  electronic  charge  density.  As  we  use  a  density  functional  theory  based 
calculation  of  the  bulk  structure,  and  use  the  density  to  determine  the  energy  dis¬ 
sipation  (stopping)  this  method  is  conceptually  consistent,  at  least  in  the  sense  that 
all  essential  quantities  are  obtained  from  the  electron  density. 

We  present  our  first  calculations  simulating  the  interaction  of  E  <  1  keV  protons 
with  extended  systems.  The  first  solid  target  studied  was  simple  cubic  hydrogen 
(scH).  The  reasons  for  selecting  scH  as  our  test  case  are:  (1)  It  has  a  very  simple 
structure  which  allows  us  straightforwardly  to  analyze  the  results,  and  (2)  the  one- 
electron  nature  of  H  makes  the  calculation  of  the  potential  and  charge  density 
computationally  easy.  After  analyzing  the  scH  results  and  determining  the  success 
of  the  method,  we  present  preliminary  results  for  Si  with  the  diamond  structure, 
space  group  Fd3m-Oj  (diaSi). 


Method 


In  approaching  the  simulation  of  ion  trajectories  in  solids,  we  make  the  following 
argument.  We  want:  (1)  a  realistic  representation  of  the  crystalline  coulombic  po¬ 
tential  which  the  ion  will  see,  (2)  a  realistic  energy  dissipation  term  to  represent 
the  stopping  power  of  the  solid,  and  (3)  a  reasonably  quick  computation.  With 
these  goals  in  mind,  we  chose  to  use  a  classical  equation  of  motion  for  which  the 
input  potentials  are  quantum  mechanical  in  origin.  Newton’s  equation  of  motion 
for  the  force  F  on  a  particle  at  position  r  is 

F(r)  =  mr,  (1) 

where 

r  =  r(t)  (2) 

is  the  particle’s  (in  this  case  a  proton)  position  as  a  function  of  time  t.  The 
force,  F(r), 
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F(r)  =  -VC7(r)  -  —  (r)  (3) 

consists  of  two  parts:  the  coulombic  potential  U( r)  and  —{dE/dx){ r),  the  energy  loss 
(stopping  power)  along  the  trajectory.  Both  of  these  quantities  are  determined  via 
a  full  potential  augmented  plane  wave  (flapw)  calculation  [15].  The  stopping  power 
can  be  written 


-  (r)  =  Q(rs(r))\,  (4) 

where 

rl  r)  =  [3/47rp(r)]1/3.  (5) 

Here  Q(rs)  is  the  energy  dissipation  term  due  to  interaction  of  the  projectile  with 
the  solid  as  it  passes  through  the  system,  evaluated  here  using  the  method  of  Eche- 
nique  [13,14],  which  is  consistent  with  the  local  density  approximation  (lda)  used 
in  the  flapw  calculation  [  1 5].  Q  is  dependent  on  the  local  electron  density,  described 
by  the  local  electron  sphere  radius  rs.  The  value  for  the  dissipative  force  <2(r)v  can 
be  written  analytically  [13,14]  as 

Q(r)v  =  v%p(r)(7tr(U/r),  (6) 

where  vF  is  the  Fermi  velocity  of  the  solid  and  v  is  the  ion’s  velocity.  The  transport 
cross  section  atr  is  given  by 

<r„  =  ^  £  (X  +  1)  sin2[5A(£»  -  W£f)],  (7) 

VF  A=0 

where  5X  are  the  scattering  phase  shifts  at  the  Fermi  energy  £>.  Throughout  the 
calculation,  Hartree  atomic  units  are  used. 

Channels 

Before  proceeding  to  the  results,  it  is  useful  to  consider  what  is  meant  by  the 
terminology  “channel”  in  a  solid.  Qualitatively  a  channel  is  an  open  region  char¬ 
acterized  by  a  longitudinal  axis  aligned  parallel  with  a  crystallographic  direction. 
The  qualitative  aspect  is  that  the  channel  cross  section  is  determined  by  the  extent 
to  which  the  atoms  adjacent  to  the  channel  protrude  into  it.  As  the  concept  of  an 
atomic  radius  is  chemically  and  physically  appealing,  but  ill-defined,  so  also  are 
the  lateral  boundaries  of  the  channel  ill-defined.  Thus,  when  one  looks  along  a 
channel  direction  or  axis  in  a  hard  sphere  model,  gaps  or  holes  in  the  solid  which 
run  the  entire  extent  of  the  solid  (see  Figs.  1  and  2)  are  visible.  The  center  of  a 
channel  is  defined  as  the  center  of  the  geometric  figure  defined  by  the  projection 
of  the  space  group  sites  onto  the  path  perpendicular  to  the  channel  axis.  Again  see 
Figures  1  and  2.  Note:  The  channel  center  need  not  have  the  same  point  group 
symmetry  as  that  used  to  find  the  channel  (see  Fig.  2,  Si  (100)  and  (110)). 
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Figure  1 .  Simple  cubic  lattice:  (a)  simple  cubic  unit  cell  showing  the  atomic  positions  of 
H;  (b)  view  along  the  axis  of  the  (100)  channel;  (c)  a  view  looking  down  the  (110)  channel; 
(d)  a  view  looking  down  the  (111)  channel.  The  X’s  mark  the  center  of  the  channels  in 
(b),  (c),  and  (d).  The  various  sizes  of  spheres  indicates  that  the  atoms  are  at  different 
distances  along  the  channel  direction. 


Classically,  channels  represent  directions  of  lowered  resistance  to  the  passage  of 
ions.  They  are  regions  of  relatively  low  electronic  charge  density  as  they  are  interstitial 
regions  by  nature,  and  as  such  are  regions  of  low  stopping  power  (regions  of  low 
Q  and  high  rs).  Thus  a  projectile  will  have  a  tendency  to  be  deflected  from  an 
arbitrary  path  so  that  it  travels  along  a  channel  regardless  of  its  incident  velocity, 
provided  that  its  velocity  is  not  so  great  that  it  does  not  scatter  to  any  appreciable 
extent  as  it  traverses  the  solid.  What  this  implies  is  that  an  ion  will  channel  axially 
provided  the  ion’s  velocity  [v  =  (Uj.  t?u),  where  v±  is  the  component  of  the  velocity 
transverse  to  the  channel  direction  and  V\\  the  component  parallel  to  the  channel 
direction]  does  not  have  too  large  a  perpendicular  component.  This  says  that  v± 
has  a  maximum  value  beyond  which  axial  channeling  does  not  occur  and  that 
vTx  is  sample-dependent.  Thus  there  is  a  maximum  impact  angle  that  an  ion  can 
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a)  <  1  0  0  > 
ChWQAAl 


Figure  2.  Diamond  structure  lattice:  (a)  view  looking  down  the  (100)  channel;  (b)  view 
looking  down  the  (110)  channel.  The  various  shapes  and  sizes  of  spheres  indicate  that 
the  atoms  are  actually  located  at  different  distances  along  the  channel  direction.  The  dashed 
lines  are  used  to  indicate  the  channel  boundaries. 


have  in  order  to  undergo  channeling  immediately  upon  entering  the  solid.  For  high 
energy  projectiles  ( E  >  1  MeV)  this  impact  angle,  given  by  arctan(u1/U||),  has  a 
upper  limit  which  can  be  approximated  by 

arctan(t>±/t;||)  «  v±ax/v,  (8) 

where  uTax  for  scH  was  found  to  be  less  than  0. 1  keV.  For  scH  this  analysis  would 
lead  to  a  maximum  incident  angle  for  channeling  of  approximately  0.5°.  For  high 
energy  ions  this  angle  is  very  important  since  vL  may  not  be  dissipated  quickly 
enough  to  allow  the  ion  to  demonstrate  any  axial  channeling  before  leaving  the 
sample.  Low  energy  projectiles  (E  in  keV  range)  tend  to  follow  the  potential  well 
minima  within  a  relatively  short  distance  after  entering  the  solid  so  that  this  max¬ 
imum  impact  angle  is  not  very  critical,  as  our  trajectories  indicate. 


(a-n.)  y  (a.u.) 
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Figure  3.  Case  (1)  of  sc  f 
plane;  (b)  the  vx  compc 
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Vx  VS  X 


Figure  3.  Case  (1)  of  scH:  (c)  enlargement  of  the  end  of  the  proton’s  trajectory.  The 
orbiting  behavior  indicates  that  the  proton  is  trapped  by  the  crystalline  potential. 


Results 


scH 

The  first  test  case  was  simple  cubic  atomic  hydrogen  (scH).  We  performed  an 
flapw  calculation  using  a  lattice  constant  ae  -  2.54  a.u.  This  lattice  constant  was 
chosen  because  it  is  slightly  compressed  relative  to  equilibrium  [16],  and  conse¬ 
quently  gives  electronic  densities  in  the  interstitial  region  which  ensure  rs  <  5  a.u. 
(upper  limit  of  our  analytic  fit  to  Q(rs);  see  Ref.  [1]).  The  parameters  of  interest  for 
the  flapw  calculation  are  the  plane  wave  expansion  cutoff  Rmt  *  Kmax  =  8  ( Rmt  is 
the  radius  of  the  atomic  sphere  and  Kmax  the  largest  plane  wave  used),  the  maximum 
/value  used  in  the  spherical  harmonic  expansion  of  the  potential  and  charge  density 
is  /max  -  6,  and  the  radius  of  the  spherical  regions  is  Rmt  =  1.2699  a.u.  (chosen  to 
require  touching,  nonoverlapping  spheres).  We  chose  the  (100)  channel  and  tried 
various  initial  conditions.  The  first  case  (1)  was  with  the  proton’s  initial  velocity 
parallel  to  [100]  direction  and  its  initial  position  the  center  of  the  channel.  The 
second  case  (2)  had  the  same  initial  position  as  (1)  but  the  proton’s  initial  velocity 
was  tilted  relative  to  the  channel  direction  so  that  v±  >  v±ax.  The  third  case  (3)  had 
the  same  initial  velocity  as  ( 1)  but  with  the  initial  position  displaced  from  the  center 
of  the  channel.  These  three  cases  demonstrate  that  CHANNEL  has  the  proper 
behavior  for  an  accurate  simulation  of  proton  trajectories  in  solids. 
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Case  (1)  results  are  shown  in  Figure  3.  It  shows  that  a  proton  traveling  down  the 
highly  symmetric  channel  center  does  not  deviate  from  its  initial  path  until  it  is 
trapped,  and  even  then  the  deviation  is  small  (see  Fig.  3).  The  straight  line  trajectory 
is  what  one  would  expect  from  symmetry.  The  absence  of  apparent  deviation  from 
a  straight  line  path  demonstrates  that  numerical  errors  are  not  accumulating.  If 
they  were,  unrealistic  trajectories  would  occur,  which  is  not  the  case.  In  examining 
the  numerical  values  of  the  ion’s  position  the  deviations  from  a  linear  trajectory 
are  of  the  order  10“ 14  a.u.  perpendicular  to  the  channel  direction  until  the  proton 
is  trapped.  Once  the  proton  is  trapped  within  a  unit  cell,  any  deviation  due  to 
numerical  round  off  has  no  effect  since  the  proton  remains  within  the  same  unit 
cell  [see  Fig.  3(c)], 

Case  (2)  results  are  shown  in  Figure  4.  We  see  that  the  proton  travels  several  unit 
cells  perpendicular  to  the  channel  axis  before  v±  is  sufficiently  dissipated  that  the 
proton’s  average  trajectory  is  along  the  channel  axis.  Once  the  proton  starts  to 
channel,  the  trajectory  clearly  demonstrates  that  the  proton  scatters  from  the  solid’s 
potential  energy  surface,  and  an  oscillatory  path  ensues. 

Case  (3)  results  are  shown  in  Figure  5.  Figure  5(a)  shows  that,  though  the  proton 
initially  has  a  velocity  with  v±  =  0,  it  is  subjected  immediately  to  a  force  perpen¬ 
dicular  to  (100)  and  v±  very  quickly  becomes  nonzero.  Figure  5(b)  displays  the 
trajectory  of  the  proton  as  it  becomes  trapped  by  the  solid.  The  end  of  the  trajectory 


0  100  200  300  400  500  600 

x  (a.u.) 

Figure  4.  Case  (2)  of  scH.  The  dotted  lines  indicate  the  projection  of  the  atomic  positions 
onto  the  x~y  plane.  The  x-axis  is  parallel  to  the  (100)  channel. 
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shows  that  the  proton  “stops”  in  the  crystal  with  nonzero  velocity.  It  is  trapped  in 
a  potential  well,  then  comes  to  rest  slowly,  much  latter.  Figure  5(b)  clearly  shows 
the  proton  scatters  from  the  potential  maxima  which  are  located  at  the  atomic  sites 
(the  projections  of  the  atomic  sites  are  represented  by  the  diamonds).  Thus  the 
proton’s  trajectory  becomes  quite  erratic  once  its  velocity  has  been  dissipated  suf¬ 
ficiently  to  the  point  where  the  velocity  is  of  the  order  of  the  trapping  velocity. 

Satisfied  that  CHANNEL  behaves  as  expected,  we  calculated  the  ion  implantation 
profile  of  scH  for  the  (100)  channel  for  0.25  keV  protons  (u  =  0.1  a.u.).  This  study 
required  an  entire  series  of  calculations  with  the  same  initial  velocity  but  different 
initial  positions  throughout  a  segment  of  the  channel  cross-sectional  area  which  is 
representative,  through  symmetry  operations,  of  the  entire  cross  section.  The  rep¬ 
resentative  area  is  shown  in  Figure  6,  and  the  ion  implantation  profile  in  Figure  7. 
There  were  a  total  of  45  points  in  the  representative  area  (which  corresponds  to 
only  1/8  of  the  full  channel  area).  This  selection  represents  a  total  of  289  points 
over  the  entire  cross-sectional  area.  From  the  ion  implantation  profile  we  see  that 
an  average  penetration  depth  of  625  a.u.  can  be  inferred.  In  addition  to  the  average 
penetration  depth  (usually  called  range),  there  are  three  other  distances  of  interest: 
the  path-length  straggling,  lateral  straggling,  and  penetration  depth  straggling  or 
range  straggling.  Path-length  straggling  is  defined  to  be  the  “variation  of  total  path 
length  for  monoenergetic  identical  ions”  [  1 7],  while  range  straggling  is  the  variation 
in  total  penetration  depth  and  lateral  straggling  is  the  variation  in  total  displacement 
perpendicular  to  the  channel  direction.  From  Figure  7  we  cannot  determine  the 
path  length  or  lateral  straggling  but  range  straggling  can  be  determined.  The  pen¬ 
etration  depth  straggling  for  scH  is  approximately  550  a.u. 

diaSi 

In  order  to  examine  systems  of  real  scientific  interest,  we  have  started  calculations 
of  the  ion  implantation  profiles  of  0.25  keV  protons  in  diamond  structured  Si 


Figure  6.  Diagram  of  the  representative  area  of  the  (100)  channel  cross  section  for  scH. 
The  solid  spheres  are  representative  of  the  computational  grid  points. 
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Figure  7.  Ion  Implantation  profile  for  0.25  keV  protons  incident  along  the  { 1 00)  channel 
of  scH.  The  depth  is  the  distance  along  the  [100]  direction  where  the  proton  is  trapped. 


(a-Si)  along  the  <100)  and  <110)  channels.  The  flapw  parameters  of  interest  are 
ae  -  10.26254  a.u.  (chosen  because  it  is  approximately  the  experimental  lattice 
constant  for  Si  [18]),  Rmt'  Kmax  =  8,  /max  =  6,  and  an  atomic  sphere  radius  Rmt  = 
2.2216  a.u.  Figure  8  shows  the  channel  cross  sections  for  the  <100)  and  <110) 
channels.  The  preliminary  results  are  shown  in  Figures  9(a)  and  9(b).  Figure  9 
demonstrates  that  a  greater  number  of  data  points  (trajectories)  is  needed  in  order 
for  the  implantation  profiles  for  diaSi  to  be  of  the  same  quality  as  that  presented 
for  scH  <100).  The  grid  used  for  the  diaSi  <100)  channel  has  42  points  covering 
the  full  channel  cross-sectional  area,  while  the  grid  for  diaSi  <110)  has  only  39 
points  in  the  full  channel  area.  With  so  few  points,  the  ion  implantation  results  are 
incomplete.  However,  if  we  ignore  backscattered  ions  (the  points  with  zero  pene¬ 
tration  depth),  we  can  see  that  the  profile  is  taking  shape.  In  particular,  Figure  9(a) 
would  tend  to  indicate  the  average  penetration  depth  to  be  around  250  a.u.  for 
<100)  with  a  penetration  depth  straggling  of  570  a.u.  Figure  9(b)  seems  to  indicate 
that  the  average  penetration  depth  could  be  approximately  850  a.u.  with  a  straggling 
of  1250  a.u.  Hopefully,  we  will  not  have  to  calculate  289  points,  as  in  scH,  in  order 
to  get  a  representative  profile.  What  is  definitely  observable  from  these  figures  is 
that  the  proton’s  penetration  depth  along  the  <110)  channel  is  approximately  three 
times  its  penetration  depth  along  the  <100)  channel.  This  is  expected  since  from 
Figures  8(a)  and  8(b)  one  can  see  that  the  <110)  channel  is  much  more  open  than 
the  <100)  channel.  Also  notice  that  the  center  of  the  <100)  channel  has  only  the 
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Figure  8.  Si  diamond  structure  channel  cross  sections,  (a)  The  (100)  channel  cross  section 
used  in  the  calculation  of  the  implantation  profile.  The  X’s  are  representative  of  the  grid 
points  used  in  the  calculation,  (b)  The  (110)  channel  cross  section  used  in  the  calculation 
of  the  ion  implantation  profile.  The  squares  in  the  left  half  of  the  figure  represent  the  grid 
of  point  used  in  the  calculation.  The  squares  are  only  in  the  representative  area  of  the 

cross  section. 


identity  as  a  symmetry  operation.  This  explains  why  the  trajectory  of  a  proton 
starting  out  on  the  channel  center  initially  with  v±  -  0  quickly  has  imparted  to  it 
a  vL  #  0.  On  observing  the  (110)  channel,  one  can  see  that  the  channel  center  has 
a  mirror  plane  passing  through  it.  As  a  result  of  this  symmetry,  a  proton  starting 
out  on  the  (110)  channel  center,  with  v  purely  along  the  [110]  direction,  has  very 
little 


v±  ~  (u[ll0]>  %01]) 

imparted  to  it  in  the  [llO]  direction  (the  perpendicular  to  this  mirror  plane).  The 
value  of  U[iio]  is  not  precisely  zero,  as  it  should  be,  due  to  numerical  roundoff  which 
exists  in  specifying  the  center  of  the  (110)  channel.  The  only  sizable  component 
to  v±  is  in  the  [001]  direction  (the  mirror  plane  contains  this  direction). 


counts  counts 


12 


counts  vs  depth 


Figure  9.  diaSi  implantation  profiles  for  0.25  keV  protons,  (a)  Protons  incident  along 
the  (100)  channel  of  Si:  The  depth  is  the  distance  along  the  [100]  direction  where  the 
proton  is  trapped,  (b)  Protons  incident  along  the  (110)  channel  of  Si:  The  depth  is  the 
distance  along  the  [110]  channel  where  the  proton  is  trapped. 
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Conclusion 

We  have  demonstrated,  using  scH,  that  CHANNEL  can  be  used  to  simulate 
realistically  the  trajectories  of  charged  particles  in  solids.  The  choice  of  scH  was 
one  of  computational  expedience  since  it  is  a  one-electron  atom  and  the  solid  has 
one  atom  per  unit  cell.  Even  so  we  were  able  to  generate  hypothetical  ion  implan¬ 
tation  profiles  for  protons  traveling  parallel  to  the  (100)  channel.  We  are  unable 
to  compare  our  results  with  experiment,  but  that  was  not  the  purpose  of  the  cal¬ 
culation  on  scH. 

Furthermore,  we  have  presented  preliminary  results  for  implantation  profiles  for 
diaSi  along  the  (100)  and  (110)  channels.  There  is  an  insufficient  number  of 
trajectories  to  determine  a  well-defined  average  penetration  depth.  The  curves  appear 
to  indicate  an  more  trajectories  will  produce  a  proper  profile.  The  curves  also  indicate 
that  the  depth  of  the  (1 10)  channel  should  be  3  times  that  of  the  (100).  Because 
of  the  reduced  symmetry  of  the  (100)  and  (110)  channels  compared  to  the  scH 
(100)  channel,  the  diaSi  system  poses  a  computational  challenge.  The  diaSi  (100) 
channel  has  no  symmetry  operation  except  for  the  identity  operation  which  causes 
the  representative  area  of  the  channel  cross-section  to  be  the  entire  channel  area. 
diaSi  ( 1 10)  is  only  slightly  better  because  it  has  a  reflection  symmetry  which  makes 
the  representative  cross-sectional  area  1/2  the  full  cross  section.  However,  this  re¬ 
duction  in  computational  work  is  more  than  compensated  by  the  greater  penetration 
depth  of  the  proton  along  this  direction.  We  shall  present  these  results  in  a  future 
paper,  in  which  we  also  will  discuss  our  calculational  method  in  great  detail. 

From  our  results  we  can  conclude  that  the  method  has  promise  for  describing 
in  a  quasiclassical  manner  the  channeling  of  low  energy  ions,  E  in  keV  range,  in 
extended  systems.  There  have  been  numerous  studies  on  single  crystal  Si  [4,9,19- 
29];  however,  so  far  we  have  not  found  any  low  energy  channeling  results  for  protons 
(although  low  energy  B  implantation  has  been  studied  [9]).  This  is  may  be  a  technical 
problem  with  the  experiments  which  we  hope  can  be  solved  in  the  future.  We  have 
been  in  contact  with  an  experimental  group  that  believes  energies  of  1  keV  are  too 
low  for  their  present  detectors  [30]. 
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Abstract 

We  present  the  effects  of  structure,  chemical  binding,  and  number  of  layers  on  stopping  for  clean, 
unreconstructed  diamond  (100)  one-  to  four-layer  films  with  lattice  constants  fixed  at  crystalline  values, 
and  for  graphite  mono-  and  dilayers  at  their  optimized  bond  lengths.  First  Born  approximation  stopping 
cross  sections  for  protons  incident  normally  on  the  films  were  obtained  from  orbital  versions  of  the 
kinetic  theory  of  stopping  and  of  the  local  plasma  approximation.  The  required  electron  momentum 
density  was  generated  from  the  real  space  Kohn— Sham  orbitals.  We  find  a  strong  static  quantum  size 
effect  in  the  stopping  of  diamond  ultra-thin  films  as  opposed  to  graphite,  for  which  the  stopping  is  only 
very  weakly  dependent  on  the  layer  number.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

For  many  years,  there  has  been  great  interest,  both  theoretically  and  technolog¬ 
ically,  in  the  allotropic  forms  of  carbon,  especially  graphite  and  diamond.  Because 
graphite  and  diamond  are  the  simplest  realistic  materials  with  both  a  typical 
covalently  bonded  structure  and  with  four  valence  electrons,  these  forms  are 
ideal  targets  in  which  to  study  structural  effects  upon  stopping.  Graphite  has  a  six- 
membered  benzenelike  structure  with  delocalized  double  bonds.  Its  interplanar 
binding  is  very  weak.  Diamond  has  localized  single  bonds  and  is  isostructural  with 
several  important  electronic  materials  such  as  Ge  and  Si.  A  number  of  diamond- 
based  electronic  devices  have  been  proposed  [1].  Advances  in  material  science  have 
made  possible  the  deposition  of  diamond  thin  films  on  many  other  surfaces  [2]. 
Experimentally  [3],  for  example,  a  1X1  low  energy  electron  diffraction  pattern 
appears  when  a  polished  C(  100)  surface  is  heated  from  500  to  700  K.  Thus  we  were 
motivated  to  study  the  C(100)  diamond  film. 

The  stopping  of  graphite  and  diamond  was  studied  experimentally  by  Softky  [4] 
in  1961  and  later  on  by  Grygoriev  et  al.  [5]  and  by  Kulikaukas  et  al.  [6].  Several 
papers  concerning  stopping  in  bulk  carbon  allotropes  have  been  published  [7-15] 
as  well.  Experimental  determination  of  stopping  properties  in  these  cases  generally 
is  by  extrapolation  to  zero  sample  thickness  in  order  to  eliminate  multiple  scattering 
effects.  So  far  as  we  are  aware,  stopping  in  ultrathin  films  (e.g.,  a  few  layers)  of 
allotropic  carbon  has  never  been  published.  Experimentally  20-100  A  (100)  carbon 
foils  can  be  produced,  but  stopping  in  them  is  yet  to  be  measured  [16]. 
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The  lattice  structure  and  electronic  structure  for  energy  optimized  graphite  1-2 
layers  and  for  clean,  unreconstructed  diamond  (100)  1-4  layers  (fixed  at  the  bulk 
crystalline  bond  length)  have  been  calculated  previously  by  Trickey  et  al.  [17]  and 
Wu  et  al.  [18],  respectively.  Here  we  present  calculations  of  the  stopping  cross 
section  for  protons  (at  normal  incidence)  on  these  ultrathin  layered  structures  and 
on  the  isolated  carbon  atom.  The  next  section  describes  the  method  that  we  used. 
The  results  and  discussion  are  in  the  third  section,  where  the  effects  on  stopping  of 
structure,  chemical  binding,  and  layer  number  (i.e.,  the  quantum  size  effect)  are 
discussed. 


Methods 


The  linear  energy  loss  of  a  charged  particle  in  matter  is  given  as 


1  dE  0/_^  4ttZ?Z2 

~nTx  =  S(v)  =  ^^L(vX 


(1) 


where  N  is  the  number  density  of  scatterers  (here  the  carbon  atom),  S  and  L  are 
the  stopping  cross  section  and  stopping  number  per  scatterer,  v  the  velocity  of  the 
incident  particle  (we  restrict  the  discussion  to  protons  in  this  work),  E  its  energy, 
x  its  path  length,  Z\  the  projectile  charge,  and  Z2  the  number  of  electrons  per  target 
atom.  We  use  Hartree  atomic  units  in  Eq.  (1)  and  throughout  this  paper  unless 
otherwise  noted.  The  effects  of  channeling  and  of  projectile  dynamic  charge  state 
are  neglected.  Following  Sabin  and  Oddershede  [19],  first  Born  approximation 
stopping  cross  sections  for  protons  incident  normally  on  the  films  were  calculated 
with  an  orbital  version  of  the  kinetic  theory  of  stopping  [20].  Details  are  found  in 
Ref.  [21].  Briefly, 

m  =  z  mi),  (2) 

/ 


L,(7)=\  diixy  f  dv2z  pi(v2)L°(  |  v  -  V2 1 )  ^  J-  - v.  (3) 
J —co  J— oo  |  V  Vj  | 

In  Eq.  (3),  p/(v2)  is  the  momentum  density  of  the  /th  energy  sub-band,  L?  is  of 
Bethe  form  with  a  low  energy  cutoff 


where  n  is  the  number  of  atoms  per  unit  cell  and  ??/  is  the  occupation  of  the  /th 
energy  subband  (see  below). 

The  orbital  mean  excitation  energy  7/  is  determined  through  the  orbital  local 
plasma  approximation  (olpa)  [22] 


da  p,(7)  ln[(4xp(7))l/2]. 


(5) 
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The  required  momentum  density  is  constructed  from  the  Fourier  transform  of 
the  real  space  Kohn-Sham  (ks)  eigenfunctions 


*a(p)  = 


[2ttAU^]1/2  J- 


I  dz  J 

J- 00  Jn 


da 


pj(p)=  2 

IcGBZ 

The  KS  functions  were  obtained  in  the  local  density  approximation  to  density  func¬ 
tional  theory  (lda/dft)  utilizing  the  Hedin-Lundqvist  (hl)  exchange-correlation 
functional.  This  scheme  is  internally  consistent,  as  p  plays  the  pivotal  role  in  both 
dft  and  the  OLPA.  The  atomic  calculations  used  the  same  OLPA  and  lda  but  were 
spin  polarized  in  the  central  field  approximation.  A  is  the  area  per  unit  cell,  A^cen  is 
the  total  number  of  periodic  unit  cells,  and  p  =  t  in  atomic  units. 

The  electronic  structure  calculations  were  carried  out  using  the  films  [23-25] 
code  which  solves  the  KS  equations  in  a  Gaussian  basis  representation.  The  basis 
sets  are  given  in  Refs.  [  1 7]  and  [  1 8].  The  atomic  calculation  used  the  9s4p  Hartree- 
Fock  basis  of  Van  Duijneveldt  [26]. 

The  graphite  dilayer  (monolayer)  has  a  simple  hexagonal  Bravais  lattice  with 
four  (two)  carbon  atoms  per  unit  cell  [see  Fig.  1(a)].  Each  lattice  plane  perpendicular 
to  the  c-axis  has  a  honeycomb  arrangement.  For  graphite,  the  irreducible  triangle 
of  the  Brillouin  zone  (bz)  is  1/12  of  the  zone,  with  37  Tc  points  in  that  triangle.  The 
diamond  films  have  square  lattice  symmetry  and  n  atoms  in  one  unit  cell  of  an 


<  110  > 


<  no  > 


Figure  1 .  Top  views  of  the  lattice  structures  of  allotropic  carbon  ultra-thin  films.  The 
atoms  denoted  by  “1”  are  in  layer  1,  and  those  by  “2”  are  in  the  second  layer,  etc.  “1,2” 
means  atoms  in  layer  1  and  2  are  in  overhead  positions,  etc.:  (a)  lattice  structure  of  the 
graphite  2L;  (b)  lattice  structure  of  the  diamond  4L. 
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«-layer  [see  Fig.  1(b)].  The  irreducible  regions  for  the  diamond  1  and  3  layers  are 
1/8  of  the  BZ  with  21  Tc  points  in  the  region,  and,  for  the  diamond  2  and  4  layers, 
are  1/4  of  the  BZ  with  36  k  points  [18]  in  the  region. 

The  energy  sub-bands  in  equation  (3)  were  chosen  as  follows.  The  atomic  stopping 
cross  section  was  used  for  the  Is  bands  (in  tests  with  graphite,  the  difference  in 
stopping  between  the  atom  and  the  monolayer  for  this  band  is  in  the  second  decimal 
place  at  the  peak  value  of  the  stopping  cross-section).  For  the  diamond  monolayer, 
we  used  one  window  for  the  2s-like  band  and  one  window  for  the  2p-like  band, 
since  both  the  2s-like  band  and  2p-like  band  are  narrow  and  do  not  overlap.  Except 
in  the  diamond  monolayer,  the  valence  bands  of  2s-like  and  2p-like  character  all 
overlap,  so  we  divided  the  energy  range  of  those  valence  bands  into  four  windows 
for  graphite  and  three  windows  for  diamond  2-4  layers  according  to 

Ei  =  E2*t  +  l[(E%£  -  ESSfo/ml  l  =  0,  1,  2,  . . . ,  m.  (7) 

Here  m  is  the  number  of  windows  for  2s-like  and  2p-like  orbitals.  We  tested  the 
dependence  of  the  calculated  stopping  cross  section  on  the  number  of  windows  for 
the  graphite  1L  at  one  velocity  (v  =  1.75  a.u.)  near  the  maximum  in  the  stopping 
cross  section  and  find  the  discrepancy  between  the  two  calculations  (m  =  3  and 
m  =  4)  to  be  in  the  second  decimal  place  (see  Table  I).  As  the  velocity  of  the  proton 
is  along  the  z  direction  (i.e.,  normal  to  the  film),  we  have  tested  whether  the  pxy- 
and  pz-like  states  have  the  same  contributions  to  the  stopping  cross  section.  The 
stopping  cross  section  obtained  using  two  windows  by  treating  2pz  and  (2s,  2pxy) 
separately  (i.e.,  the  whole  2pz  as  one  window,  the  whole  2s  and  2pxy  as  another 
window)  is  12.45 1  X  10~15  eV  cm2/atom.  If  we  treat  the  2s  and  2p  together  (combine 
2pz  and  2s,  2pxv  together  in  one  window),  with  m  =  2,  the  stopping  cross  section 
is  12.515  X  10  15  eV  cm2/atom.  The  discrepancy  in  stopping  cross  section  between 
these  two  cases  is  also  in  the  second  decimal  place  (see  Table  I).  For  the  diamond 
2-4L  system,  the  2s,  2pxy  orbitals  and  2pz  orbital  are  hybridized  and,  hence,  cannot 
be  separated. 


Table  I.  The  total  stopping  cross  section  S  at  v  =  1.75  a.u.  as  a  function  of  the  total  number  of  divisions 
of  the  energy  subband  for  graphite-  1L  in  the  energetically  optimized  state  with  a  =  4.635  a.u. 


Orbitals 

No.  of 
windows 

/(eV) 

S 

(10-15  eV 
cm2/atom) 

Treat  2s  and  2p  bands  together 

1 

67.033 

12.181 

2 

67.033 

12.515 

3 

67.033 

12.620 

4 

67.033 

12.614 

Separate  2pz  from  2s,  2pxy  bands 


2s,  2pxy 

2pz 


1 

1 


67.033 


12.451 
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Results 

Calculated  stopping  cross  sections  for  the  isolated  carbon  atom  and  for  graphite 
1-  and  2-layer  system  are  shown  in  Figure  2  and  Table  II.  The  stopping  for  protons 
of  bulk  graphite  at  v  =  6.63  a.u.  is  also  indicated  for  comparison  with  the  experi¬ 
mental  result  (see  below).  We  find  the  stopping  to  be  nearly  additive  per  layer,  that 
is,  the  difference  in  stopping  per  atom  between  the  graphite  monolayer  and  dilayer 
is  rather  small,  no  doubt  due  to  the  weak  interplanar  binding  [17]  (0.03  eV/atom). 
The  presence  of  the  graphite  second  layer  has  no  significant  influence  on  the  electron 
density  of  the  first  layer  and  thus  neither  on  the  stopping.  The  present  result  for 
the  stopping  cross  section  for  the  graphite-2L  at  a  proton  velocity  of  6.63  a.u.  (or 
proton  energy  Ep  =  1.1  MeV)  is  Sg.2L  =  4.30  X  10~15  eV  cm2/atom,  which  is  in 
reasonable  agreement  with  the  measurement  of  Sg.buik  =  4.0  X  10“ 15  eV  cm2/atom 
for  carbon  obtained  by  Fuchs  et  al.  [27]  and  with  Softky’s  estimate  [4]  for  bulk 
graphite  of  Sg.b{l]k  =  3.98  ±  0.2  X  1(T15  eV  cm2/atom  at  Ep  =  1.1  MeV.  {The 
uncertainties  in  the  Softky  data  arise  from  uncertainties  in  the  scattering  cross 
section  (a)  extrapolation  and  in  the  value  of  the  incident  energy  used,  which  could 
be  in  error  by  as  much  as  5%  according  to  Ref.  [4]}. 

Calculated  stopping  cross  sections  for  the  isolated  carbon  atom  and  for  diamond 
^-layers  (n  =  1-4)  films  are  shown  in  Figure  3  and  Table  III.  We  find  a  strong  static 
quantum  size  effect  in  the  stopping  for  the  ultra-thin  diamond  films:  The  stopping 
cross  section  decreases  significantly  as  the  number  of  layers  increases.  This  effect 
arises  from  the  interaction  between  layers  which  is  much  stronger  in  diamond  (e.g., 
the  interplanar  binding  energy  for  diamond-2L  [  1 8]  is  3.93  eV/atom)  than  in  graph¬ 
ite.  The  total  energy  per  atom  decreases  and  the  mean  excitation  energy  increases 


Figure  2.  The  stopping  cross  section  of  isolated  carbon  atom,  graphite- 1L  and  graphite- 
2L  as  functions  of  proton  velocity. 
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Table  II.  The  stopping  properties  at  vmax  as  a  function  of  the  total  number  of  layers  for 

the  graphite-«La 


System 

Atom 

Monolayer 

Dilayer 

Lattice  constants  (a.u.) 

a  =  4.635 

a  =  4.636, 

c  -  12.988 

Umax  (a.U.) 

1.52 

1.75 

1.89 

Is  orbital 

/.s(eV) 

240.24 

242.81 

242.82 

S,sb 

0.34 

0.39 

0.41 

2s  and  2p  orbitals 

/2s'»(eV) 

24.96(2s) 

36.31 

36.28 

9.87(2s) 

3.07 

2.95 

4P1  (eV) 

21.17(2p) 

38.42 

38.44 

C  (2) 

*J2s 

8.29(2p) 

2.62 

2.57 

4™  (eV) 

36.28 

36.46 

S*<3> 

4.17 

4.27 

4<4>  (eV) 

28.81 

29.74 

SW4> 

2.38 

2.47 

/tot(eV) 

50.26 

67.03 

67.27 

6max 

18.50 

12.63 

12.67 

S  at  v  -  6.63  a.u. 

4.68 

4.31 

4.30 

a  The  superscripts  on  I  and  S  are  the  indices  of  the  windows.  The  intraplanar  unit  cell 
parameter  (hexagonal  bond  length)  is  denoted  as  a.  The  interplanar  separations  are  des¬ 
ignated  as  c.  The  stopping  cross  section  S  is  in  10~15  eV  cm2/atom. 
b  From  the  spin  polarized  atomic  calculation. 


as  the  number  of  layers  increases.  Experimental  [13]  and  semi-empirical  [9,14,15] 
results  have  been  given  for  stopping  in  bulk  diamond.  In  Figure  3,  we  present  the 
semi-empirical  values  of  Sabin  and  Oddershede  (so)  [9]  and  of  Elkomoss  and  Pape 
(ep)  [14],  and  compare  them  to  stopping  for  dilute  gas  and  thin  films  targets  as 
calculated  here.  The  EP  values  were  generated  using  the  Brandt-Reinheimer  for¬ 
malism  [14],  which  is  a  generalization  of  the  Lindhard  and  Winther  [28]  dielectric 
scheme.  The  energy  gap  is  obtained  by  requiring  that  k(0,  0,  Eg )  is  equal  to  the 
static  dielectric  constant  of  diamond.  Sabin  and  Oddershede  [9]  use  momentum 
densities  obtained  from  the  valence  electron  Compton  profiles  and  a  mean  excitation 
energy  (/  =  96  eV)  derived  from  experimental  stopping  cross  section.  This  value  is 
slightly  higher  than  that  reported  by  Tung  and  Watt  (tw)  [  1 5]  (86  eV)  derived  from 
a  combined  method  that  treats  the  core  electrons  in  the  local  plasma  approximation 
and  uses  the  dielectric  response  formalism  for  valence  electrons.  Lowering  of  the 
SO  mean  excitation  energy  to  that  of  TW  would  raise  the  so  values  of  the  stopping 
cross  section  and  put  them  in  much  better  agreement  with  the  results  of  EP.  In  any 
case,  it  is  clear  that  the  stopping  cross  section  becomes  lower  and  broadens  with 
increasing  layer  number,  as  seen  previously  [29],  and  that  the  approach  to  bulk 
behavior  is  slow;  that  is,  there  is  a  strong  quantum  size  effect. 

The  calculated  lattice  constants  for  graphite  films  [17]  are  shown  in  Table  II. 
The  experimental  value  of  the  bond  length  of  bulk  diamond  is  2.919  a.u.  [30], 
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Figure  3.  The  stopping  cross  section  of  an  isolated  carbon  atom,  of  diamond  1-4  layer 
films,  and  of  bulk  diamond  as  functions  of  proton  velocity. 


which  leads  to  an  intraplanar  lattice  constant  of  4.766  a.u.  and  an  interplanar 
distance  of  1.685  a.u.  at  which  the  lattice  constants  of  the  diamond  films  were 
fixed.  The  graphite  monolayer  has  an  areal  density  of  0.648  electrons/a.u.  [2],  which 


Table  III.  The  stopping  properties  at  vmax  as  the  function  of  the  total  number  of  layers  for  the 

diamond-«L.a 


System 

Monolayer 

Dilayer 

Trilayer 

Fourlayer 

Umax  (a.U.) 

1.48 

1.73 

1.88 

1.91 

1  s  orbital 

/is(eV) 

240.09 

241.34 

241.70 

241.94 

S,sb 

0.33 

0.38 

0.41 

0.42 

2s  and  2p  orbitals 

W’(eV) 

25.76  (2s) 

33.22 

34.69 

34.92 

*S,2s&2p(1) 

9.77 

4.01 

2.64 

3.01 

W2)(eV) 

22.90  (2p) 

31.93 

34.31 

35.05 

S2s&2Pi2) 

8.36 

2.72 

4.57 

4.11 

/2s&2p(3)  (eV) 

27.97 

29.84 

31.64 

C  (3) 

*J2s&2p 

7.88 

6.23 

5.88 

(eV) 

52.117 

60.10 

62.98 

64.54 

•Slot  at  umax 

‘S^graphiie/'S'1  diamond  at  V 

=  6.63  a.u. 

18.46 

0.932 

14.99 

0.969 

13.86 

0.982 

13.42 

0.987 

a  The  superscripts  on  I  and  S  are  the  indices  of  the  windows.  The  stopping  cross  section  *S  is  in  10  15 
eV  cm2/atom. 

b  From  a  spin  polarized  atomic  calculation. 
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Figure  4.  The  ratio  of  the  stopping  cross  section  for  1.1  MeV  protons  in  the  graphite 
dilayer  to  that  in  the  diamond-nL  as  a  function  of  the  number  of  layers  (denoted  by  solid 
line).  The  error  bars  represent  estimated  uncertainties  in  our  calculations.  The  dotted  line 
represents  the  ratio  of  the  stopping  cross  section  in  bulk  graphite  to  that  in  bulk  diamond. 

The  shadowed  area  represents  the  reported  standard  deviation  in  the  bulk  value. 


is  higher  than  that  for  the  diamond  monolayer  (0.264  electrons/a.u.  [2]).  Typically 
a  higher  areal  electron  density  causes  a  lower  stopping  cross  section  and  a  higher 
value  of  the  velocity  at  which  the  stopping  cross  section  achieves  its  maximum 
value,  since  the  system  with  higher  areal  density  has  lower  total  energy  and  higher 
mean  excitation  energy.  This  binding  dependence  for  graphite  and  diamond  is 
consistent  with  that  general  tendency. 

Since  the  stopping  cross  sections  of  ls-like  core  bands  are  less  than  5%  of  the 
total  stopping  cross  section  for  both  graphite  and  diamond,  the  difference  between 
graphite  and  diamond  stopping  is  clearly  due  to  the  valence  band  electron  distri¬ 
bution,  just  as  one  would  expect  on  chemical  grounds.  Graphite  films  and  diamond 
films  have  quite  different  electronic  structures,  and  correspondingly  different  valence 
density  distributions.  Bulk  graphite  has  a  small  overlap  of  the  valence  and  conduction 
bands,  whereas  bulk  diamond  is  an  insulator  with  a  large  band  gap.  Therefore,  the 
contribution  to  the  mean  excitation  energy,  7,  from  the  interband  transition  is 
smaller  for  graphite  than  for  diamond  leading  to  a  larger  stopping  cross  section  for 
bulk  graphite  than  for  bulk  diamond. 

In  early  1961,  Softky  [4]  experimentally  found  the  stopping  power  of  bulk  graphite 
to  be  1.0604  ±  0.0090  times  that  of  bulk  diamond  for  1.1  MeV  protons.  (The 
quoted  error  refers  to  the  standard  deviation  in  the  measurements.)  More  recently, 
Kulikauskas  et  al.  [6]  gave  experimental  results  at  lower  proton  energies  which 
when  extrapolated  give  good  agreement  with  the  value  reported  by  Softky.  Gry- 
goriev’s  calculation  [5],  based  on  the  dielectric  description  and  average  concentra¬ 
tions  of  electrons  in  graphite  and  diamond,  gave  a  similar  ratio.  The  ratio  of  the 
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stopping  cross  section  for  the  graphite-2L  to  that  of  the  diamond  riL  as  a  function 
of  layer  number  ( n )  is  shown  in  Table  III  and  plotted  in  Figure  4.  (Because  of  the 
weak  binding  in  graphite,  it  is  reasonable  to  use  the  2L  result  for  1  <  n  <  4.)  The 
numerical  uncertainty  of  our  calculation  is  estimated  to  be  2%.  Thus  the  present 
results  are  consistent  with  previous  findings  in  the  sense  that  we  find  thin  film 
stopping  is  distinct  from  that  of  bulk  [21,29].  However,  the  extrapolated  value  of 
the  ratio  is  0.99  at  Ep=  1.1  MeV.  The  remaining  discrepancy  in  the  extrapolated 
value  from  Softky’s  value  may  be  attributed  to  the  limitations  in  the  OLPA  model 
itself  and  the  unreconstructed  (hence  metallic  [18])  geometry  of  the  calculated 
diamond  films  or  systematic  error  in  the  measurement. 

To  summarize,  the  stopping  of  ultrathin  layers  of  carbon  allotropes  shows  a 
strong  static  quantum  size  effect  in  the  stopping  of  diamond  ^-layers.  This  is  in 
contrast  to  graphite,  where  the  layers  are  only  weakly  interacting,  compared  to 
diamond,  and  the  stopping  is  nearly  additive  per  layer. 
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Abstract 

A  comparison  is  made  between  the  ionization  potentials  (ips)  calculated  by  the  extended  Koopmans’ 
theorem  (ekt)  and  by  taking  energy  differences  (Aci)  from  configuration  interaction  calculations  in  the 
same  basis.  Several  ionization  potentials  were  calculated  for  LiH,  He2,  and  Li2.  The  best  Aip,  the  difference 
between  the  ekt  ip  and  the  corresponding  Aci  value,  was  0.05  meV  for  the  2 a  orbital  for  LiH  and  83.5 
meV  for  the  3a  orbital.  The  Aips  for  He2  were  0.7  meV  for  the  lcru  orbital,  6  eV  for  the  2<ru  orbital,  5 
meV  for  the  2ag  orbital,  and  3  eV  for  the  3 ag  orbital.  The  Aips  for  Li2  are  0. 1  meV  for  2<xg,  53  meV  for 
3<7g,  0.6  meV  for  2<ru,  and  1.73  eV  for  3cru.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

The  extended  Koopman’s  theorem  (ekt)  [1-3]  provides  a  method  of  calculating 
ionization  potentials  (ips)  from  certain  classes  of  reference  wave  functions  which 
go  beyond  the  independent  particle  model.  Such  wave  functions  include  multi- 
configurational-self-coriSistent  field  (mcscf)  wave  functions,  full  configuration  in¬ 
teraction  wave  functions,  and  complete-active-space-self-consistent  field  wave 
functions.  The  ekt  can  be  considered  to  be  a  special  case  of  the  electron  propagator 
and  Green’s  function  methods  [4-8]. 

There  has  been  considerable  interest  [9-16]  in  the  question  of  the  exactness  of 
the  ekt  for  the  lowest  ionization  potential.  It  has  been  demonstrated  numerically 
that  the  lowest  ekt  ip  is  exact  [11,13].  There  is  also  numerical  evidence  the  ekt  is 
exact  for  ips  besides  the  lowest  one  [15,16]. 

The  EKT  IPs  are  found  by  solving  the  matrix  equations 

(V  -  *y)c  =  0  ,  (1) 

where 

Vij  =  —(tyN\a][H,  dj]  |  .  (2) 

We  can  consider  dt  to  annihilate  the  rth  natural  spin  orbital  (nso)  of  1 4^)  to  form 
an  (N  —  l)-electron  wave  function  | 
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i^r1)  =  £!**>. 

The  |  form  the  basis  for  the  (N  -  l)-electron  wave  function,  although  the  (N 
—  l)-electron  wave  function  is  not  calculated  explicitly.  The  matrix  7  is  the  overlap 
matrix  of  the  |$f_1)  and  in  the  NSO-derived  basis  is  a  diagonal  matrix  of  NSO 
occupation  numbers 

m  =  yii  =  (*N\alai\*N).  (3) 

It  is  well  known  that  for  the  ground  states  of  most  atoms  and  molecules  N  of  the 
rij  are  near  1  and  the  rest  fall  ofF  very  rapidly  toward  zero.  It  is  not  known  whether 
any  are  actually  zero  for  ground  states  of  many-electron  atoms  and  molecules.  It 
is  believed  that  none  of  the  nt  are  zero  for  ground  states  of  2-electron  systems  [12]. 
Although  it  has  been  assumed  that  some  nt  might  be  zero  for  ground  states  of  many- 
electron  systems  [17],  it  is  unlikely  that  any  are  identically  zero  [12].  Related  ques¬ 
tions  are  whether  or  not  the  generalized  overlap  amplitudes,  also  called  Feynman 
Dyson  amplitudes,  are  linearly  independent  [18],  whether  or  not  the  |  $f_1)  form 
a  complete  set,  and  whether  or  not  the  set  of  eigenfunctions  having  nonzero  eigen¬ 
values  of  the  (N  -  l)-particle  reduced  density  matrix  form  a  complete  set  [17]. 
Although  we  do  not  expect  the  ekt  calculations  to  answer  these  questions,  whether 
or  not  all  the  ekt  ips  are  exact  depends  on  their  answers. 

The  fact  that  the  smallest  n,  rapidly  decreases  as  the  size  of  the  basis  set  is  increased 
causes  Eq.  (1)  to  become  ill-conditioned.  Thus  EKT  calculations  using  large  basis 
sets  which  result  in  a  number  of  very  small  become  difficult  and  seemingly 
impractical.  We  have  used  canonical  orthonormalization  to  eliminate  1 cor¬ 
responding  to  rit  below  a  certain  threshold.  However,  this  discards  information 
which  might  be  critical  for  obtaining  accurate  ionization  potentials. 

In  this  article  we  examine  several  IPs  for  He2,  LiH,  and  Li2  by  carrying  out  a 
series  of  calculations  in  which  basis  set  sizes  are  varied.  Basis  sets  are  systematically 
changed  from  one  calculation  to  the  next  by  including  additional  scf  virtual  orbitals 
in  the  set. 


Calculations  and  Results 

All  calculations  were  performed  using  the  GAMESS  [19]  program.  The  basis  sets 
for  He  and  Li  were  taken  from  Rizzo,  Clementi,  and  Sekiya  [20].  The  basis  set  for 
H  is  the  same  as  that  used  in  a  previous  calculation  on  H5  [11],  Self-consistent 
field  calculations  were  performed  on  each  of  the  three  molecules  He2,  LiH,  and 
Li2.  The  SCF  orbitals  were  systematically  included  in  full  Cl  calculations  on  the 
ground  state  of  the  neutral  molecules  He2,  LiH,  and  Li2,  and  on  several  states  of 
the  positive  ions  He2 ,  LiH+,  and  Li2 . 

The  number  of  orbitals  included  in  the  calculations  ranged  from  3-18  for  He2, 
3-23  for  LiH,  and  4-16  for  Li2.  In  order  to  avoid  numerical  difficulties  the  (N  - 
l)-electron  basis  functions,  were  excluded  from  the  basis  set  for  most  of 

the  calculations  if  the  corresponding  NSO  occupation  was  smaller  than  10-8.  This 
excludes  many  of  the  (N  -  l)-electron  basis  functions  for  the  larger  orbital  basis 


EXTENDED  KOOPMANS’  THEOREM 


311 


sets.  For  comparison  several  calculations  were  performed  using  smaller  nt  tolerances 
for  excluding  the  | 

The  ionization  energies  were  calculated  using  both  the  ekt  method  and  by  taking 
the  energy  difference  ip(Aci)  of  full  Cl  calculations  on  the  neutral  molecule  and  on 
the  ion  in  the  same  basis  set. 

In  the  work  of  Sundholm  and  Olsen  [13]  the  best  comparison  between  ekt  IPs 
and  Aci  IPs  was  obtained  using  only  s  orbitals.  If  the  higher  ekt  ips  involving 
ionization  from  a  orbitals  LiH,  He2,  and  Li2  are  exact  it  should  be  apparent  even 
when  only  s  orbitals  are  in  the  basis  set.  In  order  to  include  as  many  s  orbitals  as 
possible  within  the  constraints  of  our  computer  system  most  of  the  calculations 
were  performed  using  only  s  orbitals.  For  comparison  one  calculation  which  includes 
p  orbitals  was  performed  on  LiH. 

The  results  of  calculations  for  LiH,  He2,  and  Li2  are  given  in  Tables  I,  II,  and 
III,  respectively.  The  first  column  lists  the  number  of  SCF  orbitals  included  in  the 
basis  set  for  the  full  Cl  calculation.  The  number  of  natural  orbitals  kept  in  the  ekt 
calculation  is  given  in  parentheses.  The  IPs  are  labeled  by  the  ekt  orbital.  For 
example  2 a  for  LiH  corresponds  to  ionization  into  the  2S+(la22<r)  state  of  LiH+, 
3 a  corresponds  to  ionization  into  the  22+(l<r23 a)  state  of  LiH+,  l<ru  for  He2  cor¬ 
responds  to  ionization  into  the  2Sg  ( 1  o\  1  au)  state  of  He2  ,  and  2o-g  for  Li2  corresponds 
to  ionization  into  the  22g(l<rgl<T22o-g)  state  of  Li2,  etc. 


Table  I.  A  comparison  of  the  ekt  and  Aci  ionization  potentials  for  LiH  for  increasingly  larger  basis 
sets.  The  natural  orbital  occupation  tolerance  for  excluding  natural  orbitals  is  1(T8.  The  intemuclear 

distance  is  1.5955  A. 


Orbitals 

SCF  (nos) 

Energy 

2c 

3c 

Aci 

EKT 

AlP 

Aci 

EKT 

AlP 

5e  (3a) 

-7.97210 

7.9783 

7.9789 

0.0006 

21.9230 

21.9765 

0.0535 

10c  (6c) 

-7.97814 

7.5534 

7.5535 

0.00006 

19.1115 

19.2642 

0.1528 

12 C  (8cr) 

-7.98916 

7.3641 

7.3641 

0.00004 

19.1409 

19.1901 

0.0492 

15c  (lie) 

-8.00022 

7.3748 

7.3748 

0.00003 

19.2091 

19.2908 

0.0816 

19c  (15c) 

-8.00734 

7.3808 

7.3809 

0.00004 

19.2138 

19.2959 

0.0822 

23c  (17c) 

-8.00788 

7.3811 

7.3812 

0.00005 

19.2138 

19.2972 

0.0835 

15c67r  (8c27t) 

-7.99137 

7.6553 

7.6554 

0.00008 

19.8188 

20.1706 

0.3519 

Orbitals  SCF 

4c 

5c 

(nos) 

Energy 

Aci 

EKT 

AlP 

Aci 

EKT 

AlP 

5c  (3c) 

-7.97210 

26.6495 

67.2539 

40.6043 

_ 

_ 

_ 

10c  (6c) 

-7.97814 

22.2983 

23.0676 

0.7694 

24.7488 

27.9142 

3.1654 

12c  (8c) 

-7.98916 

22.0029 

22.1765 

0.1736 

24.5706 

30.0822 

5.5116 

15c  (lie) 

-8.00022 

22.0496 

22.3935 

0.3439 

24.6318 

30.0280 

5.3962 

19c  (15c) 

-8.00734 

22.0565 

22.3864 

0.3299 

24.6385 

30.1374 

5.4988 

23c  (17c) 

-8.00788 

22.0568 

22.4621 

0.4053 

24.6388 

30.1537 

5.5149 

15c67r  (8c27t) 

-7.99137 

22.7876 

23.5631 

0.7755 

24.4740 

27.0050 

2.5310 
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Table  II.  A  comparison  of  ekt  and  Aci  ionization  potentials  for  He2 .  The  natural  orbital  threshold 
for  excluding  natural  orbitals  is  10~8,  except  where  noted.  The  intemuclear  distance  is  1.08  A. 


Orbitals 

SCF  (NO) 

Energy 

lffu 

2ou 

Aci 

EKT 

Aip 

Aci 

EKT 

Aip 

2<rg2<ru  (lergleru) 

-5.60963 

19.8741 

19.8750 

0.0008 

_ 

— 

3crg3cru  (2ffg2cru) 

-5.60964 

19.8692 

19.8696 

0.0003 

53.2231 

69.4167 

16.1937 

4ffg4tru  (2<7g2c7u) 

-5.60964 

19.8518 

19.8525 

0.0007 

51.9285 

66.1611 

14.2327 

5ffg5o-u  (2<7g2o-u) 

-5.60967 

19.8026 

19.8047 

0.0021 

50.6575 

64.5225 

13.8651 

6<7g6<xu  (2trg2 <ru) 

-5.60986 

19.6890 

19.6947 

0.0057 

49.4427 

63.6271 

14.1843 

7crg7<ru  (3ag2au) 

-5.61078 

19.4876 

19.4892 

0.0017 

48.3200 

51.9340 

3.6140 

8o-g8(Tu  (3crg3cru) 

-5.61391 

19.2335 

19.2343 

0.0008 

44.9955 

51.8642 

6.8687 

9<rg8 <ru  (3crg4<ru) 

-5.61713 

19.1230 

19.1235 

0.0005 

44.8019 

50.1532 

5.3513 

9<rg8<r„  (5<rg6an)a 

-5.61713 

19.1230 

19.1235 

0.0005 

44.8019 

50.1327 

5.3308 

9ffg9au  (4<rg4o-u) 

-5.62075 

19.0037 

19.0038 

0.0001 

44.4116 

51.5217 

7.1101 

9crg9ffu  (7<Tg6o-u)b 

-5.62075 

19.0037 

19.0044 

0.0007 

44.4116 

50.0672 

5.6556 

Orbitals 

2°g 

3<xg 

SCF  (NO) 

Energy 

Aci 

EKT 

AIP 

ACI 

EKT 

Aip 

2o-g2o\,  (  1  (Tg  1  (Tu) 

-5.60963 

30.0663 

30.0671 

0.0008 

— 

— 

— 

3ag3o-u  (2ag2  <ru) 

-5.60964 

30.0612 

30.0614 

0.0002 

51.9024 

73.4326 

21.5302 

4o-g4(Tu  (2(Tg2 (Tu) 

-5.60964 

30.0420 

30.0428 

0.0007 

50.2310 

71.3962 

21.1651 

5(jg5au  (2ffg2(Tu) 

-5.60967 

29.9858 

29.9882 

0.0024 

48.6722 

69.9379 

21.2656 

6ag6au  (2<Tg2cru) 

-5.60986 

29.8520 

29.8587 

0.0067 

47.2402 

69.0186 

21.7784 

7 0g7 <y u  (3(7g2(Tu) 

-5.61078 

29.6050 

29.6062 

0.0012 

45.9840 

56.9979 

11.0139 

8<rg8<7u  (3o-g3o-u) 

-5.61391 

29.2706 

29.2734 

0.0029 

47.3676 

58.3577 

10.9901 

9ag8<ru  (3o-g4cru) 

-5.61713 

29.1061 

29.1109 

0.0048 

47.0126 

60.1247 

13.1121 

9<Tg8(Tu  (5 c^u) 

-5.61713 

29.1061 

29.1086 

0.0025 

47.0126 

57.1854 

10.1728 

9crg9(7u  (4<rg4(Tu) 

-5.62075 

28.9410 

28.9488 

0.0077 

46.7567 

57.5458 

10.7891 

9<7g9tru  (7<jg6ou)b 

-5.62075 

28.9410 

28.9461 

0.0051 

46.7567 

50.0604 

3.3037 

3  Natural  orbital  occupation  threshold  is  10  l2. 
b  Natural  orbital  occupation  threshold  is  10~13. 


LiH 

For  the  case  using  23  a  SCF  orbitals  and  17  a  no’s  the  Aip  for  the  lowest  IP,  2a, 
is  0.05  meV  whereas  the  Aip  for  the  next  ip,  3a,  is  83.5  meV.  The  lowest  ip  is  1670 
times  more  accurate  than  the  next  one.  The  4a  ekt  ip  is  about  five  times  less 
accurate  than  the  3  a  ekt  ip  and  the  5tr  ip  is  an  order  of  magnitude  less  accurate 
than  the  4 a  ekt  ip.  Adding  p  orbitals  to  the  basis  set  does  not  significantly  improve 
the  accuracy  of  the  EKT  IPs.  Lowering  the  nf  threshold  to  10“ 10  for  exclusion  of 
1 4>f-1)  gives  negative  Alps  in  some  cases  (not  shown  in  the  table).  This  illustrates 
the  problem  of  including  |  $f_1)  with  small  in  the  EKT  calculation. 

He2 

The  results  for  He2  are  given  in  Table  II.  The  best  Aip  values  for  the  lowest  ip, 
1  <ru,  are  0. 1  meV  to  0.7  meV  whereas  the  best  Aip  for  the  2au  is  5.6  eV,  about  8000 
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Table  III.  A  comparison  of  ekt  and  Aci  ionization  potentials  for  Li2.  The  natural  orbital  occupation 
tolerances  for  excluding  natural  orbitals  are  10-8.  The  internuclear  distance  is  2.6731  A. 


Orbital  space 

SCF  (NO) 

Total 

energy 

2<xg 

3o-g 

Aci 

EKT 

Aip 

Aci 

EKT 

Aip 

4<xg4eru  (3<rg2<ru) 

-14.86434 

5.1457 

5.1463 

0.0006 

13.3540 

15.8394 

2.4854 

5<7g5cru  (3crg3o\,) 

-14.86657 

5.1897 

5.1901 

0.0005 

12.5370 

13.0719 

0.5349 

6  O g6  (X u  (4(7g3(Tu) 

-14.87012 

5.2358 

5.2366 

0.0008 

11.7940 

12.1503 

0.3563 

7cre7<ru  (5<rg4<ru) 

-14.87419 

5.2582 

5.2584 

0.0002 

11.2485 

11.3375 

0.0890 

8<rg8o\,  (ScrgScru) 

-14.87658 

5.2778 

5.2779 

0.0001 

11.1113 

11.1643 

0.0531 

2  <TU 

3<ru 

SCF  (NO) 

Energy 

Aci 

EKT 

Aip 

ACI 

EKT 

Aip 

4rjg4cru  (3 <Tg2cru) 

-14.86434 

9.3520 

9.3716 

0.0196 

13.9851 

67.3533 

53.3682 

5ffg5(Tu  (3ag3<7u) 

-14.86657 

8.8066 

8.8352 

0.0286 

13.3470 

67.4025 

54.0554 

6cre6<ru  (4ag3<ru) 

-14.87012 

8.3671 

8.3807 

0.0136 

12.7659 

28.2091 

15.4431 

7(7g7o-u  (5<Tg4(Tu) 

-14.87419 

8.1080 

8.1082 

0.0002 

12.3718 

13.3395 

0.9677 

8<rg8o-u  (5<rg5<7u) 

-14.87658 

8.0359 

8.0365 

0.0006 

12.2437 

13.9737 

1.7300 

times  greater.  The  Aip  for  the  2<rg  IP  is  about  5  meV*  about  10  times  greater  than 
for  the  lffu  IP.  Both  the  l<ru  and  the  2<rg  IPs  can  be  calculated  very  accurately. 

Li  2 

The  2crg  Aip  of  0. 1  meV  is  530  times  less  than  the  53  meV  Aip  for  the  3<rg  ip. 
The  2(7g  ip  is  about  six  times  more  accurate  than  the  2 <ru  IP.  The  3tru  ekt  ip  is 
considerably  worse.  As  is  the  case  with  He2,  the  lowest  ekt  ip  of  each  symmetry 
type,  CTg  and  au,  can  be  calculated  very  accurately,  with  the  lowest  ip  of  the  two 
being  an  order  of  magnitude  more  accurate. 

Conclusions 

In  He2  and  Li2  the  next  to  lowest  IP,  corresponding  to  a  different  symmetry  state 
of  the  ion  than  the  ground  state  symmetry,  can  be  calculated  very  accurately,  several 
orders  of  magnitude  more  accurately  than  the  next  ip.  But  it  is  not  quite  as  accurate 
as  the  lowest  ip  of  the  molecule.  The  question  arises  whether  it  is  not  as  accurate 
because  it  is  not  exact,  or  whether  the  method  of  calculation  has  systematically 
excluded  the  basis  functions  needed  to  obtain  more  accurate  calculations.  In  a 
previous  calculation  on  Be  [16]  the  basis  set  was  partially  optimized  to  obtain  the 
best  EKT  ip  into  the  second  2P  state  of  Be+.  This  produced  an  accurate  ekt  ip  into 
the  lowest  2P  state  of  Be+  and  resulted  in  the  conclusion  that  this  EKT  IP  was  exact. 
A  systematic  calculation  of  the  same  system  resulted  in  the  conclusion  that  it  was 
not  exact  [15].  Whether  or  not  these  “other  symmetry”  lowest  ekt  IPs  are  exact, 
it  appears  that  in  some  cases  they  can  be  obtained  very  accurately. 
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It  is  not  clear  whether  the  apparent  inaccuracy  of  the  ekt  IPs  is  due  to  a  fun¬ 
damental  theoretical  limit  or  whether  it  is  because  in  practical  calculations  the  ill- 
conditioned  nature  of  Eq.  (1)  precludes  including  natural  orbitals  with  very  small 
occupation  numbers.  It  is  likely  that  none  of  the  natural  orbital  occupation  numbers 
are  identically  zero  for  ground  states  of  atoms  and  molecules  [12].  If  this  is  the 
case,  one  would  expect  that  all  of  the  ekt  IPs  would  be  exact  and  that  the  inaccuracies 
of  the  higher  ekt  ips  is  due  to  omitting  orbitals  which  are  needed  to  obtain  accurate 
ionization  energies. 
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Abstract 

The  ab  initio  theories  of  electronic  and  vibrational  circular  dichroism  are  presented  in  brief.  For 
electronic  circular  dichroism,  emphasis  is  placed  on  the  derivation  of  optical  rotatory  strengths  by  the 
perturbative  configuration  interaction  approach  as  implemented  in  the  program  PCI.  An  application  to 
the  chiroptical  properties  of  the  disulfide  chromophore  is  described.  In  the  infrared  region,  the  ab  initio 
vibronic  coupling  theory  (vct)  of  vibrational  circular  dichroism  (VCD)  as  implemented  in  the  program 
vct90,  is  presented.  The  relationship  to  the  ab  initio  magnetic  field  perturbation  (mfp)  formalism  and 
an  approximate  locally  distributed  origin-gauge  (ldo)  model  is  described.  The  vct  and  mfp  formulations 
are  compared  in  large  basis  set  and  electron  correlated  studies  of  the  model  system,  NHDT,  and  the 
experimentally  characterized  molecule,  2,3-dideuteriooxirane.  The  LDO  model  of  VCD  is  applied,  together 
with  nmr  and  molecular  mechanics  techniques,  to  the  investigation  of  the  conformations  of  the  anticancer 
drug,  taxol.  Coupled  oscillator  models  are  introduced.  Applications  both  in  the  area  of  electronic  and 
vibrational  circular  dichroism,  especially  to  the  determination  of  secondary  structures  of  proteins  and 
nucleic  acids  are  remarked  upon.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Chemical  transformations  take  place  in  three-dimensional  space  and  often  require 
that  the  reactants  be  able  to  adopt  very  specific  geometric  configurations  and  to 
orient  themselves  in  a  very  specific  manner  relative  to  each  other.  For  complex 
molecules,  the  spatial  requirements  are  especially  severe,  but  tremendous  advantages 
from  the  point  of  view  of  reactivity  and  selectivity  can  be  gained  by  shape-selective 
and  shape  sensitive  catalysis.  A  billion  years  of  evolution  has  resulted  in  very  so¬ 
phisticated  (bio)chemical  systems,  which  chemists  are  vigorously  trying  to  under¬ 
stand  and  emulate  in  vitro.  A  proper  study  of  the  three-dimensional  structures  of 
molecules  logically  requires  techniques  that  are  sensitive  to  the  three-dimensional 
shapes  of  molecules,  especially  the  property  of  handedness.  Such  techniques  are 
collectively  called  chiroptical  techniques.  We  will  focus  here  on  two  of  these,  elec¬ 
tronic  circular  dichroism  (ecd),  and  its  counterpart  in  the  infrared  region,  vibrational 
circular  dichroism  (VCD).  A  CD  spectrometer  measures  the  difference,  A A  =  AL- 
Ar  or  Ae  =  eL-  eR,  between  absorbance  of  left  and  right  circularly  polarized  light 
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as  a  function  of  wave  length,  X.  The  area  under  the  CD  absorption  band  of  a  given 
transition,  K,  is  [i?]*, 

f 00 

[R]k  =  0.229  X  10-38  Jo  A cK(\)—  (1) 

in  units  of  esu2  cm2,  where  Ac  is  in  units  of  molar  extinction  coefficient.  An  analogous 
measure  of  the  area  under  the  normal  absorption  band  determines  the  oscillator 
strength  of  the  transition.  Observation  of  a  nonzero  CD  spectrum  requires  that  the 
substance  being  measured  be  chiral  (not  superimposible  on  its  own  mirror  image), 
and  separated  (at  least  in  part)  from  its  antipode.  All  biological  molecules  of  any 
complexity  satisfy  these  requirements.  A  substance  that  consists  of  a  single  antipode 
is  said  to  be  optically  pure.  The  absolute  configuration  of  a  chiral  substance  distin¬ 
guishes  between  the  two  antipodes.  It  is  a  major  thrust  of  synthetic  organic  chemistry 
to  synthesize  substances  in  optically  pure  form  and  of  known  absolute  configuration. 

The  antipodes  (enantiomers)  of  a  substance  have  rotatory  strengths  of  equal 
magnitude  and  opposite  in  sign  for  every  transition.  The  CD  spectra  are  mirror 
images  of  each  other.  It  is  sufficient  to  predict  correctly  the  sign  of  the  rotatory 
strength  of  any  single  transition  in  order  to  assign  the  absolute  configuration  of  a 
substance,  provided  the  relative  spatial  dispositions  of  the  nuclei,  i.e.,  the  confor¬ 
mation,  is  known.  Different  conformations  of  the  same  optically  pure  substance 
may  have  very  different  CD  spectra,  with  rotatory  strengths  differing  in  magnitude 
and  possibly  in  sign.  The  principal  application  of  chiroptical  techniques  to  bio¬ 
molecules  is  not  to  determine  absolute  configurations,  since  for  the  most  part,  these 
are  now  known,  but  rather  to  determine  conformations  on  which  the  biological 
activity  usually  depends. 

The  theoretical  foundation  for  all  chiroptical  techniques  was  elucidated  by  Ro- 
senfeld  [1],  who  showed  that  the  rotatory  strength  of  a  transition,  assumed  to  be 
between  states  0  and  n,  is  simply  the  imaginary  part  of  the  scalar  product  of  the 
electric  dipole  and  magnetic  dipole  transition  moments  for  the  transition. 

[R]k  =  [*o  „]  =  Im«*o  I S I  *«>  •  <*»  I  *o»  (2) 

where  the  electric  and  magnetic  dipole  moment  operators,  ~jl  and  m,  respectively, 
are  the  sums  of  electron  and  nuclear  operators, 

?  =  Jie  +  pn  =  -2  +  2  ZieRi  (3) 

i  I 


and 


m  =  me  +  mn  =  -2  X  A  +  2  ttt-  &  X  ?7  (4) 

/  2 me  j  2MiC 

The  quantities,  e ,  m,  ?„  and  pu  are  the  charge,  mass,  position,  and  momentum 
of  the  zth  electron,  Z7e,  Mh  Rh  and  PIy  are  the  charge,  mass,  position,  and  mo¬ 
mentum  of  the  /th  nucleus,  and  c  is  the  speed  of  light.  The  deceptively  simple 
Rosenfeld  relationship  (2)  has  been  difficult  to  put  into  practice,  and  despite  a  20 
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year  head  start  for  CD  spectroscopists,  the  first  reliable  determination  of  the  absolute 
configuration  of  any  substance  [2]  was  accomplished  by  x-ray  diffraction  in  1951, 
100  years  after  Pasteur  showed  that  optical  activity  has  a  molecular  origin.  The 
transition  may  be  between  electronic  states,  typically  the  ground  state  and  an  elec¬ 
tronically  excited  state,  or  it  may  be  between  different  vibrational  levels  of  the 
ground  state.  The  theoretical  description  of  ECD  and  VCD  [3,4]  therefore  reduces 
to  the  task  of  evaluating  these  transition  moments  in  a  computationally  viable 
manner.  Because  of  the  inherent  complexities  involved,  reasonably  reliable  eval¬ 
uation  of  transition  moments  has  only  been  available  relatively  recently  for  small 
molecules.  For  the  foreseeable  future,  direct  application  of  ab  initio  methods  to 
even  the  smallest  biopolymers  must  remain  out  of  reach.  However,  it  was  realized 
very  early  that  coupled  oscillating  electric  dipoles  separated  in  space  and  chirally 
disposed  will  have  an  associated  magnetic  dipole  moment.  Coupling  of  degenerate 
electric  dipole  allowed  transitions  (exciton  coupling)  of  two  chromophores,  suitably 
oriented  in  space,  will  therefore  give  rise  to  a  couplet  of  Cotton  Effects  (CEs)  whose 
order,  positive-negative  or  negative-positive,  is  intrinsically  related  to  the  hand¬ 
edness  of  the  spatial  arrangement  of  the  chromophoric  groups.  The  exciton  coupling 
technique,  as  elucidated  by  Harada  and  Nakanishi  [5,6],  can  be  reliably  used  to 
determine  the  absolute  configurations  of  molecules  that  have  degenerate  chromo¬ 
phores,  and  the  principles  of  the  approach  have  been  extended  to  large  arrays  of 
quasidegenerate  chromophores,  as  may  occur  in  a  polymeric  material  such  as  a 
peptide  or  a  nucleic  acid. 

We  describe  here  successful  ab  initio  approaches  to  both  ecd  and  VCD  which 
have  been  implemented  as  computer  programs  and  which  permit  the  prediction 
of  optical  rotatory  strengths  by  Eq.  (2),  citing  several  studies  on  small  molecules 
that  have  been  carried  out  in  our  laboratories.  The  results  of  the  ab  initio  studies 
can  be  applied  to  the  much  larger  molecules  of  interest  to  biochemists,  as  illustrated 
by  the  study  of  the  conformations  of  taxol  described  below.  We  conclude  by  de¬ 
scribing  several  theoretical  and  experimental  approaches  to  the  determination  of 
information  about  the  secondary  and  tertiary  structures  of  polypeptides  and  nu¬ 
cleic  acids. 


Theory  of  Electronic  Circular  Dichroism 

The  usual  practice  in  electronic  structure  calculations  is  to  invoke  the  Born- 
Oppenheimer  approximation  of  fixed  nuclei.  Electronic  transitions  are  accurately 
described  in  the  same  approximation,  i.e.,  as  vertical  transitions.  This  has  the  con¬ 
sequence  that  the  nuclear  part  of  the  electric  and  magnetic  dipole  moment  operators 
do  not  contribute  to  the  transition  moments,  and  one  may  deal  solely  with  the 
electronic  part  of  the  molecular  wave  function.  The  method  used  to  calculate  electric 
dipole  and  magnetic  dipole  transition  moments  is  based  on  a  configuration  inter¬ 
action  (Cl)  expansion  of  the  ground  and  excited  states.  In  order  to  make  this  approach 
viable,  the  expansions  must  be  truncated  severely.  Each  wave  function  is  considered 
to  consist  of  a  small  set  of  strongly  interacting  configurations  (the  “zero-order”  set) 
and  a  much  larger  set  of  weakly  interacting  configurations  (the  “first-order”  set). 
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The  effects  of  the  zero-order  set  are  accommodated  by  diagonalization  of  the  Ham¬ 
iltonian  matrix  of  these  configurations.  The  effects  of  the  first-order  set  are  included 
by  perturbation  theory.  The  ground  and  excited  states  are  assumed  not  to  mix.  In 
assembling  the  dipole  matrix  elements,  all  terms  up  to  first  order  are  kept.  The 
method,  implemented  in  a  computer  code  we  call  pci,  has  seen  extensive  use  [7- 
10],  and  has  been  described  in  detail  elsewhere  [1 1].  A  brief  summary  here  serves 
to  illustrate  the  main  features.  Partially  correlated  wave  functions  for  the  ground 
and  excited  states  are  determined  to  first  order  in  Rayleigh-Schrodinger  perturbation 
theory, 


*„  =  *°n-  ^jClnff  (5) 

where 

0*  =  -  (*°n\H\*°n))  (6) 

H  is  the  exact  Hamiltonian,  <3?/  is  a  singlet  singly  or  more  highly  excited  config¬ 
uration  derived  from  the  Hartree-Fock  determinant,  $HF)  and  'P °n  is  a  linear  com¬ 
bination  of  strongly  interacting  <£?  (/  ¥=  j)  selected  from  at  most  singly  excited  con¬ 
figurations.  Thus,  for  the  ground  state,  -  $HF.  All  configurations  for  which  the 
interaction  coefficient  anj  [Eq.  (6)]  is  greater  than  0.03  are  normally  included  in  the 
zero-order  part  of  the  Cl  wavefunction.  Electric  dipole  transition  moments  in  the 
length  (r)on  and  velocity  (v)on  formalism  and  magnetic  dipole  transition  moments 
(m)on  are  explicitly  evaluated  from 

<r>o„  =  <*oiri*n>  (7) 

0>o„  =  <*o  I V*  |  *„>/(£„  -  Eo)  (8) 

(m)„ o  =  -i(Vn  I  me\ ¥0>  (9) 

where  the  dipole,  gradient,  and  magnetic  moment  operators,  ~jle,  V*  and  me,  have 
their  usual  definitions  [see,  for  example,  Eqs.  (3)  and  (4)],  and 

E„  =  (10) 

Oscillator  strengths  Rai,  are  calculated  by  the  “mixed”  formalism 

f0n  =  l  <^o|V|^„)-<r)^  (11) 

Optical  rotatory  strengths  are  evaluated  as 

[Ro„Y  =  (r)on-(m)„  o  (12) 

and  in  the  origin  independent  form 

[^0„r  =  <^>0«-<^)„0  (13) 

The  extent  of  the  deviation  of  the  vectors  (r)  and  (y)  from  collinearity  and  in 
magnitude  is  a  measure  of  the  quality  of  the  wave  function  for  the  particular  states 
and  of  the  origin  dependence  of  [R]r. 
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In  practice,  configurations  are  generated  from  only  10  to  20  of  the  highest  occupied 
MOs  and  the  lowest  50  to  75  unoccupied  MOs.  Program  PCI  is  implemented  on 
IBM  RS/6000  work  stations  and  acquires  molecular  orbital  coefficients  and  integrals 
from  the  GAUSSIAN  92  [12]  read-write  file. 

There  is  only  one  other  ab  initio  implementation  of  the  theory  of  optical  activity 
to  calculate  optical  rotatory  strengths,  that  due  to  Hansen  and  Bouman  [13],  based 
on  the  random-phase  approximation  (rpa)  [14]  and  implemented  in  the  program 
package,  rpac  [15]. 

We  describe  below  an  application,  of  ECD  as  implemented  in  the  program  pci  to 
the  study  of  the  chiroptical  properties  of  the  disulfide  group  [16]. 

The  Chiroptical  Properties  of  the  Disulfide  Group 

The  disulfide  chromophore,  which  occurs  in  proteins  as  disulfide  bridges  formed 
from  the  oxidative  coupling  of  L-cysteine  moieties,  may  exhibit  a  non-negligible 
CD  in  the  region  230-330  nm.  The  equilibrium  geometry  about  a  disulfide  bond 
corresponds  to  a  chiral  arrangement  of  the  bonds  to  sulfur  with  a  dihedral  angle 
in  the  vicinity  of  90°.  The  essential  features  of  the  electronic  states  may  be  under¬ 
stood  from  an  analysis  of  the  state  correlation  diagram  for  the  simplest  model, 


Figure  1 .  The  state  correlation  diagram  for  H2S2  of  P  chirality.  The  numbers  at  the  points 
are  the  values  of  the  rotatory  strengths,  [jR]r  X  10"40  cgs,  of  the  transition  to  that  state  at 
the  geometry  indicated  by  the  points  (see  Ref.  16). 
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Figure  2.  Calculated  ecd  spectra  of  P-di  methyl  disulfide  at  three  values  of  the  dihedral 
angle.  Gaussian  line  shapes  whose  areas  correspond  to  the  calculated  rotatory  strengths 
(width  at  half  height  —  0.67  eV)  were  centered  on  the  transition  energies.  The  vertical  bars 
are  proportional  to  [Pf  (see  Ref.  16). 


H2S2,  shown  in  Figure  1  [16].  The  first  two  transitions  are  to  valence  states,  and 
are  designated  n  tr*.  Over  the  range  of  dihedral  angle  which  corresponds  to  P 
chirality,  the  lowest  excited  state  has  B  symmetry  in  the  range  0  <  90°  and  A 
symmetry  in  the  range  0  <  90°.  The  lowest  B  state  increases  in  energy  with  increasing 
dihedral  angle  until  it  has  an  avoided  crossing  with  a  descending  state  of  the  same 
symmetry  near  0  =  1 20°.  The  predicted  rotatory  strength  of  the  1 1 A  -►  1  lB  transition 
increases  monotonically  with  dihedral  angle,  reaching  a  maximum  positive  value 
near  100°  and  then  decreases  rapidly  to  zero.  The  second  excited  state,  2ja,  increases 
with  dihedral  angle  until  it  suffers  an  avoided  crossing  near  <f>  -  60°  and  then 
rapidly  descends,  crossing  the  rising  1  *B  state  to  become  the  lowest  excited  state  at 
0  >  90°.  The  rotatory  strength  of  the  1!a  -►  2*a  transition  is  positive  before  the 
avoided  crossing,  but  changes  sign  and  reaches  its  largest  negative  value  shortly 
after.  The  rotatory  strength  then  decreases  monotonically  in  magnitude  with  in¬ 
creasing  dihedral  angle.  At  the  crossing  of  the  two  states,  a  net  positive  rotatory 
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strength  is  predicted.  Dimethyl  disulfide  is  a  more  realistic  model  for  cystine.  The 
state  correlation  diagram  is  qualitatively  similar.  Simulated  CD  spectra  of  dimethyl 
disulfide  of  P  chirality  at  three  dihedral  angles  are  shown  in  Figure  2.  At  dihedral 
angles  less  than  or  equal  to  90°,  a  positive  first  Cotton  effect  (CE)  is  predicted, 
whereas  for  greater  than  100°,  a  first  negative  CE  is  expected,  followed  by  a  positive 
CE  [16]. 


Theory  of  Vibrational  Circular  Dichroism 

For  vibrational  transitions  it  is  not  appropriate  to  use  the  Born-Oppenheimer 
(bo)  approximation  because  the  ground  and  excited  states  in  the  context  of  vibra¬ 
tional  transitions  have  the  same  electronic  wave  function  and  differ  only  in  the 
nuclear  wavefunctions,  a  consequence  of  which  is  that  the  electronic  contribution 
to  the  magnetic  dipole  transition  matrix  element  vanishes  in  the  BO  approximation. 
One  must  therefore  use  non-BO  wavefunctions  [  1 7-20].  The  molecular  wavefunction 
of  an  arbitrary  electronic-vibrational  state,  {e\  j/)>  can  be  expanded  in  terms  of 
adiabatic  bo  wavefunctions  as 


*(r,  Q)e'y  «  W,  GV$(G)< 


2  a(QU',eMr,  Q)enQ)es  (14) 

e&e*) 


where  the  coefficient,  a(Q)e'v'^  is  approximated  by  first  order  Rayleigh-Schrodinger 
perturbation  theory,  using  that  part  of  the  nuclear  kinetic  energy  operator  that  acts 
on  the  electronic  part  of  the  product  wavefunction  as  the  perturbation.  Thus, 

(nr,  Q)em)eJf\nr,  ev$(6W)  (15) 

E(Q)ev  _  E(Q)e,y 


where 


A  missing  first  order  term  which  describes  the  nuclear  (represented  by  G)  de- 
pendence  of  the  electronic  part  of  the  wavefunction  is  introduced  by  means  of  a 
Taylor  expansion  about  the  equilibrium  geometry  of  the  ground  electronic  state 

nr,  Q),  ~  CM.  +  (^f)  Q  (17> 

Only  first  order  terms  are  kept  in  the  wavefunction  and  in  the  derived  matrix 
elements.  Nuclear  wavefunctions  are  assumed  to  be  harmonic  oscillator  wave- 
functions.  The  harmonic  force  field  and  the  nuclear  displacement  dependence  of 
the  electronic  part  of  the  wave  function  are  readily  derived  by  standard  quantum 
mechanical  methods  as  implemented  in  computer  codes  such  as  GAUSSIAN 
92  [12]. 

Implementations  of  two  a  priori  theoretical  models,  vibromc  coupling  theory 
(VCT)  [18]  and  the  magnetic  field  perturbation  (MFP)  theory  [19,20],  and  two  ap- 
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proximate  but  nonempirical  models,  the  localized  molecular  orbital  (LMO)  theory 
[21]  model  and  the  locally  distributed  origins  gauge  (ldo)  model  [22],  have  been 
developed,  and  except  for  the  last  and  most  recent,  have  seen  broad  application  in 
recent  years. 

In  the  vct  method  [18],  the  individual  electric  and  magnetic  dipole  transition 
moments  are  evaluated  by  Eqs.  (18)  and  (19),  respectively, 
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where  Z7  is  the  bare  nuclear  charge  of  atom  /  with  positional  vector,  Rh  and  dis¬ 
placement  vector  Sj  =  (dRj/dQj)  during  a  normal  vibrational  mode  Qj9  r(  and  /?, 
are  the  position  and  momentum  of  the  rth  electron,  is  the  frequency  of  the  jth 
normal  mode,  and  Ee  —  E0  is  the  vertical  electronic  excitation  energy.  The  electronic 
wavefunctions  are  denoted  as  \R0  and  for  ground  and  excited  states,  respectively. 
The  superscript  vc  indicates  that  these  expressions  describe  a  vibronic  coupling 
mechanism  for  ir  and  VCD  intensities.  The  subscripts  (00,01)  and  (01,00)  signify 
that  the  transition  is  between  the  0  and  1  vibrational  levels  of  the  ground  electronic 
state  of  the  molecule. 


Since  the  usual  description  of  the  ground  electronic  wavefunction  is  the  Hartree- 
Fock  determinant  (the  SCF  level),  the  set  of  excited  states  may  be  restricted  to  the 
set  of  singlet  spin  adapted  singly  excited  configurations  as  was  the  practice  in  the 
case  of  ECD.  The  vct  rotatory  strength  is  then  the  imaginary  part  of  the  scalar 
product  of  ?(oo,oi)j  and  m\ of.ooy-  The  atomic  polar  tensor  (apt),  P^,  and  atomic 
axial  tensor  (aat),  M^,  can  be  extracted  from  Eqs.  (18)  and  (19)  respectively,  as: 
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The  definition  for  the  nuclear  and  electronic  parts  of  differ  by  a  factor  of 
i/2h  from  the  one  introduced  by  Stephens  [23].  The  aats  are  in  general  origin 
dependent,  and  may  be  calculated  either  in  a  common  origin  (co)  gauge  or  in  a 
distributed  origin  (DO)  gauge  [23,24].  With  rare  exceptions,  the  DO  results  are  con¬ 
sistently  better  than  the  CO  results  for  each  basis  set  and  at  both  the  SCF  and  mp2 
levels.  In  the  latter,  the  electronic  ground  state,  'Rq,  is  approximated  by  the  nor¬ 
malized  mp2  wavefunction,  i.e., 
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/  occspin  Vfrspin  \ 

*0  =  -^(^HF  +  ^MF2)  =  -M^HF  “I"  2  2  Cab*Pabj  (22) 

where  N  is  the  normalization  factor,  $Hf  is  the  hf  determinantal  wavefunction, 
$2,  is  the  doubly  excited  configuration  generated  by  replacement  of  the  occupied 
orbitals  a  and  b  by  the  virtual  orbitals  r  and  s,  and  Crasb  is  the  mp2  coefficient.  The 
apts  at  the  scf  level  or  mp2  level  may  be  adopted  directly  from  the  GAUSSIAN 
codes  [25].  aats  at  both  levels  are  obtained  using  the  locally  developed  program 
vct90  [24,26,27]. 

The  locally  distributed  origin-gauge  (ldo)  model  is  derived  from  the  exact  a 
priori  VCT  formalism  by  ignoring  terms  involving  local  atomic  magnetic  dipole 
moment  operators  and  converting  the  remaining  term  involving  velocity-form 
atomic  apt  contributions  to  position-form  in  a  well-defined  manner.  The  LDO 
model  rotational  strength  expression 

(i?“)LDO  «  J-  (  2  2  [PUBSa„,)\  •  f  2  2  e,yMPUB'SA'^\  (23) 

employs  a  position-form  apt  broken  down  into  atomic  contributions,  with  the 
moment  arm  in  the  magnetic  dipole  transition  moment  term  directed  to  the  con¬ 
tributing  atomic  center.  This  origin  choice,  which  differs  from  the  DO  gauge  (wherein 
the  moment  arm  is  to  the  moving  atom)  provides  a  more  local  description  of  the 
contributions  to  vcd  intensity.  This  method  requires  only  unperturbed,  nonlocal- 
ized,  ground-state  electronic  wavefunctions,  and  is  thus  suitable  for  calculations  on 
larger  molecules  than  those  to  which  the  current  a  priori  methods  are  limited.  The 
LDO  model  has  been  implemented  as  an  extension  of  the  vct90  software.  Good 
agreement  in  sign  and  relative  magnitude  for  calculations  with  the  LDO  model  and 
the  a  priori  methods  has  been  found  for  small  molecules  such  as  (5,5')-oxirane-2,3- 
d2,  (S, ^-cyclopropane- 1 ,2-d2,  and  for  the  OH-  and  NH-stretches  in  R-methyl  lactate 
and  (S)-l-amino-2-propanol  [22]. 

The  apts  and  aats  have  been  reformulated  [28]  in  terms  on  nuclear  shielding 
tensors  and  implemented  using  the  rpa  [14]  approximation  to  evaluate  the  sum 
over  states.  The  ab  initio  implementation  has  been  incorporated  into  the  RPAC 
program  [15]. 

In  vct,  the  first-order-non-BO  description  of  the  electronic  part  of  the  magnetic 
dipole  transition  moment,  at  the  level  of  first  order  Rayleigh-Schrodinger  pertur¬ 
bation  theory,  is  the  same  as  a  magnetic  field  perturbation  of  the  electronic  part  of 
the  wavefunction, 

/d^o\  =  _  y  M  y  (24) 

Uko  io  E9-Eo  € 

The  magnetic  field  dependence  of  the  electronic  part  of  the  wavefunction  [the  left- 
hand  side  of  Eq.  (24)]  at  the  scf  level  may  be  evaluated  directly  by  means  of  a 
coupled  perturbed  Hartree-Fock  calculation.  This  is  the  basis  of  the  magnetic  field 
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perturbation  (mfp)  formulation  of  VCD  intensities  [19,20].  The  mfp  approach  has 
been  implemented  first  by  Stephens  and  co-workers  [29]  and  others  [30,31],  and 
has  been  incorporated  into  the  ab  initio  program  system,  cadpac  [32].  Very  recently, 
the  mfp  method  has  been  extended  to  include  aats  evaluated  over  mcscf  wave- 
functions  expanded  in  terms  of  gauge  invariant  (London)  atomic  orbitals  [33]. 

Table  I  compares  the  results  of  different  VCD  models  with  Hartree-Fock-limit 
basis  sets  for  nhdt,  the  largest  molecule  for  which  such  a  limiting  comparison  may 
be  carried  out.  The  last  column  and  column  9  [vcT/MP2/vd3p(u)]  differ  by  an 
average  of  8%  and  represent  the  most  accurate  values  realistically  achievable  at  the 
post-HF  level.  A  direct  comparison  with  actual  experimental  results  for  the  2,3- 
dideuteriooxirane  molecule  [34]  is  shown  in  Table  II.  A  measure  of  the  average  % 
deviation  from  experimental  results  is  provided  across  the  bottom  of  the  table.  The 
mfp  method,  using  mp2  apts  and  force  field,  but  SCF  aats  [35],  yields  exceedingly 
good  results  in  all  regions  of  the  spectrum  except  the  C — H  stretching  region.  The 
VCT/MP2  is  a  little  less  successful  but  performs  equally  well  throughout  the  spectrum 
[27].  Accumulated  experience  with  vcd  calculations  suggests  that  the  most  critical 
factor  is  the  quality  of  the  molecular  force  field,  the  mp2  ab  initio  force  field  being 
significantly  superior  to  an  SCF  force  field.  Thus  reasonably  accurate  results  are 
achievable  with  a  much  smaller  basis  set  provided  an  mp2  force  field  is  used.  The 
vct/mp2/6-31G*  results  listed  in  Table  II  (column  5)  are  an  indication  of  this.  It 
is  particularly  noteworthy  that  comparable  results  may  be  obtained  at  the  fully  scf 
level  with  the  6-31G*(0  3)  (superstar)  basis  set  (Table  II,  column  9),  which  was  orig¬ 
inally  designed  to  mimic  a  much  larger  basis  set  for  vcd  applications  [36],  but 
which  fortuitously  yields  MP2-like  force  fields  and  geometries.  Experience  gained 
in  applications  to  a  series  of  three-ring  heterocycles  [24,37-40]  for  which  experi¬ 
mental  results  are  also  available  suggests  that  vct/scf/6-31G*(0  3)  calculations  may 
be  applied  with  confidence  to  larger  systems  which  begin  to  have  some  biological 
interest. 

We  describe  below  an  example  of  the  application  of  an  ab  initio  vcd  theoretical 
model,  the  ldo  model,  to  the  resolution  of  a  conformational  problem  in  an  inter¬ 
mediate  sized  molecule,  taxol. 


Conformational  Analysis  of  Taxol  Side-Chains 

A  combination  of  molecular  modeling  and  ab  initio  quantum  mechanical  cal¬ 
culations  and  VCD  and  IR  measurements  have  been  used  to  define  the  conformations 
of  the  anticancer  chemotherapeutic  agent  taxol  and  its  side-chain  methyl  ester  [41], 
This  work  was  a  collaborative  project  of  C.  S.  Swindell  and  co-workers  L.  E.  Chirlian, 
M.  M.  Francl,  J.  M.  Heerding,  and  N.  E.  Krauss,  of  Bryn  Mawr  College,  who 
provided  samples  [taxol  (1),  O-Ac-Baccatin  III  (2),  the  taxol  side-chain  methyl  ester 
(3)  and  taxols  with  modified  side-chains  4  and  5]  and  carried  out  molecular  modeling 
calculations,  and  T.  B.  Freedman  and  co-workers  D.  M.  Gigante  and  X.  Qu,  of 
Syracuse  University,  who  measured  the  spectra  and  performed  the  ab  initio  cal¬ 
culations  of  geometries,  frequencies,  and  vcd  intensities  of  taxol  fragments. 
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Table  II.  Rotatory  strengths3  of  (5',5')-2,3-ciideuteriooxirane  in  do  gauge. 
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Ph 


1  Taxol 


0 

Ph'^NH 

Ph'4'f>C02Me 

OH 

3  Taxol  side-chain 
methyl  ester 


2  O-Ac-Baccatin  HI 


0 

Ph^NH 


3k^>C02Me  ph/5xpCOjMe 


OH 


OH 


4, 5  Modified  taxol  side-chain  esters 


Conformational  data  on  the  free  taxol  ligand  in  aqueous  and  nonaqueous  solution 
is  a  preliminary  step  to  the  determination  of  the  bound  conformation  of  taxol  to 
cellular  microtubules.  Due  to  the  rigidity  of  the  baccatin  III  nucleus,  these  studies 
have  focused  on  the  side-chain  conformation.  In  aqueous  solution,  H-NMR  studies 
reveal  a  conformation  with  anti  2'-CH  and  3-CH  side-chain  bonds  [42],  whereas 
only  gauche  bonds  are  detected  in  chloroform  solution  [43].  In  the  crystal  structure 
of  the  side-chain  methyl  ester  3,  these  bonds  are  also  gauche  [44].  No  crystal  structure 
of  taxol  itself  is  available,  vcd  is  a  useful  additional  tool  for  determining  the  taxol 
side-chain  conformation,  since  VCD  intensity  arising  from  local  OH-  and  NH- 
oscillators  provides  information  on  the  local  chiral  environments  of  the  oscillators 
and  is  particularly  sensitive  to  stabilizing  intramolecular  hydrogen  bonding. 

Spectra  of  1,  2,  and  3  were  recorded  in  the  OH-  and  NH-stretching  regions 
(3800-3300  cm"1)  on  samples  0.0025  to  0.005  M  in  chloroform.  In  Figure  3,  the 
vcd  spectrum  of  the  taxol  side-chain  ester  (— )  is  compared  with  vcd  spectra  of 
taxol  from  which  the  vcd  features  of  the  baccatin  III  nucleus  have  been  subtracted 

( _ )?  or  from  which  the  features  of  both  baccatin  III  and  the  side-chain  methyl  ester 

have  been  subtracted.  For  the  side-chain  of  1,  a  single  intense  positive  OH- 
stretching  vcd  band  is  observed  at  3525  cm-1,  ascribed  to  a  conformer  with  strong 
OH  •  •  •  0=C  hydrogen  bonding  by  comparison  to  the  vcd  spectra  of  R-methyl 
lactate  and  R-methyl  mandelate  [45],  which  exhibit  similar  positive  OH-stretching 
vcd  features.  This  type  of  conformation  is  found  in  the  side-chain  crystal  structure, 
shown  in  Figure  4(A).  No  vcd  is  observed  for  the  peptide  NH-stretch  of  the  side- 
chain  corresponding  to  the  intense  sharp  absorption  feature  at  3443  cm"1,  assigned 
to  free  NH-stretch.  The  vcd  spectrum  of  the  A-ring  side-chain  in  taxol,  obtained 
by  subtracting  the  vcd  features  arising  from  the  OH-stretches  on  the  baccatin  III 
nucleus  from  the  vcd  spectrum  of  taxol,  is  dominated  by  an  intense  positive  feature 
that  coincides  with  that  of  the  side-chain  OH-stretch,  but  in  addition  two  new 
positive  bands  are  observed.  The  band  at  3592  cm  1  is  ascribed  to  a  more  weakly 
bound  OH-stretch,  and  the  fairly  sharp  positive  feature  at  3430  cm"1,  which  is 
shifted  to  lower  frequency  by  1 3  cm" 1  compared  to  free  NH-stretching  absorption, 
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WAVENUMBERS 


Figure  3.  Side-chain  OH-  and  NH-stretching  VCD  features:  (— )  side-chain  methyl  ester 
[1];  ( — )  [(taxol)-(  1 3-0-Ac-baccatin  III)]  vcd  difference  spectrum  [taxol-2];  (•  •  •  «)  [(taxol)- 
( 1 3-0-Ac-baccatin  III)-(side  chain  ester)]  vcd  difference  spectrum  [taxol-2-3]. 


is  assigned  to  a  hydrogen-bonded  NH-stretch.  By  contrast,  the  taxol  with  modified 
side-chain  4  or  5  exhibits  only  a  single  positive  OH-stretching  vcd  feature  that  is 
identical  in  frequency  and  intensity  with  the  spectra  of  the  corresponding  free  side- 
chain  methyl  ester.  These  data  are  consistent  with  a  dominant  side-chain  confor¬ 
mation  of  the  native  taxol  that  corresponds  to  (A),  and  a  minor  conformation  such 


Figure  4.  Taxol  side-chain  methyl  ester  conformations:  (A)  crystal  structure  geometry; 
(B)  structure  obtained  from  (A)  by  rotation  of  OH  and  ester  groups  for  OH  •  •  •  OCH3 
hydrogen  bonding  with  dihedral  angles  as  calculated  for  corresponding  methyl  lactate 

conformer. 
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as  (B),  in  which  a  ~180°  rotation  of  the  ester  group  introduces  a  weaker 
OH  •  •  •  OCH3  hydrogen  bond  and  a  reorientation  of  the  OH  that  now  allows 
NH  *  •  ■  OH  hydrogen  bonding  (Fig.  4).  We  have  previously  observed  that  peptide 
or  amino  NH  bonds  adjacent  to  a  chiral  center  and  involved  in  intramolecular 
hydrogen  bonding  exhibit  intense  monosignate  NH-stretching  vcd,  but  similar  free 
NH-stretches  do  not  [46].  A  similar  minor  conformer  does  not  occur  for  the  modified 
taxols  with  side-chains  4  or  5. 

Conformational  searches  of  the  entire  taxol  structure  using  internal  coordinate 
Monte  Carlo  conformational  searches  with  a  molecular  mechanics  mm2  force  field 
and  a  chloroform  continuum  solvation,  model  were  carried  out  at  Bryn  Mawr  College 
[41].  This  study  identified  four  low  energy  structural  types,  shown  in  Figure  5  (C)- 
(F),  where  the  A-ring  side-chain  and  any  associated  atoms  of  the  baccatin  III  nucleus 
are  highlighted.  To  further  define  solution  structures  consistent  with  the  VCD  data, 
a  taxol  side  chain  fragment  with  the  phenyl  groups  replaced  by  hydrogen  was  em¬ 
ployed  for  ab  initio  geometry,  frequency  and  vcd  intensity  calculations.  Geometries 
and  frequencies  for  six  optimized  side-chain  backbone  conformations  related  to 
the  mm2  structures  or  the  aqueous  solution  conformation  were  calculated  [Fig. 
6(G)-(H),  with  positions  of  the  phenyl  groups  shown  in  black].  Calculations  were 
performed  with  Gaussian  90  [25]  with  a  6-3 1G*  basis  set. 

The  results  of  the  ldo  model  [22]  vcd  calculations  of  the  fragment  OH-  and 
NH-stretches  in  the  six  taxol  fragment  conformers  are  shown  in  Figure  6.  The 
lowest  energy  conformer  (G)  corresponds  to  the  crystal  structure  of  the  side-chain 
(A)  and  to  mm2  structure  (C).  The  calculated  ldo  model  vcd  intensity  for  the 
OH-stretch  is  large  and  positive,  with  only  a  very  small  calculated  NH-stretching 
vcd  intensity,  in  agreement  with  experimental  measurements  for  the  side-chain 
ester  and  the  major  taxol  conformer.  Conformation  (H)  corresponds  to  the  aqueous 
solution  conformation  with  trans  methine  bonds.  Positive  OH-stretching  VCD, 
smaller  than  that  for  (G)  is  calculated;  this  structure  is  ruled  out  by  nmr  for  the 
chloroform  solution  conformation  of  native  taxol,  but  it  may  be  important  for  the 
modified  taxols,  which  exhibit  smaller  OH-stretching  anisotropy  ratios.  The  cal¬ 
culated  NH-  and  OH-stretching  VCD  intensities  for  conformation  (I)  are  both  pos¬ 
itive,  but  small.  Although  the  frequency  shift  for  (I)  compared  to  (G)  for  the  OH- 
stretch  agrees  with  that  for  the  minor  taxol  conformer,  the  NH-stretching  frequency 
does  not  decrease,  and  the  transannular  OH  -  •  -0=C  hydrogen  bonding  in  the 
corresponding  mm2  structure  (D)  would  decrease  the  OH-stretching  frequency  for 
the  side  chain  when  attached  to  baccatin  III.  In  conformation  J,  which  corresponds 
to  mm2  structure  (E),  the  OH-stretching  vcd  intensity  is  calculated  to  be  negative 
and  the  OH-stretching  frequency  to  be  lower  than  that  for  (G).  The  frequency  shifts 
and  NH-stretching  vcd  intensity  for  (K),  the  —'180°  ester  rotamer  of  (I),  do  not 
agree  with  experiment  for  the  minor  conformer.  Conformer  (L),  the  —180°  ester 
rotamer  of  (G),  which  corresponds  to  (B)  and  mm2  structure  (F),  is  calculated  to 
have  large  positive  vcd  intensities  for  both  the  NH-  and  OH-stretches  and  a  cal¬ 
culated  63  cm-1  increase  in  OH-stretching  frequency  and  1 1  cm  1  decrease  in  NH- 
stretching  frequency.  These  values  agree  well  with  the  positive  vcd  intensities  and 
frequency  shifts  (+67  and  -13  cm  ],  respectively)  observed  for  the  minor  taxol 
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E  F 

Figure  5.  Taxol  conformational  types  (C)-(F)  deduced  from  mm2  Monte  Carlo  confor¬ 
mational  searches. 


conformed  The  vcd  studies  and  geometry,  frequency,  and  intensity  calculations 
thus  support  conformational  type  (A/C/G)  for  the  major  conformer  in  taxol  and 
(B/F/L)  for  the  minor  conformer.  Further  examination  of  the  mm2  structures  reveals 
that  although  the  side-chain  OH  •  •  •  O  hydrogen  bonding  for  (F)  is  weaker  than 
that  for  (C),  steric  interaction  between  side-chain  and  baccatin  III  phenyl  groups 
is  relieved  in  (F)  and  a  stabilizing  /ram-annular  hydrogen  bond  can  form  between 
the  side  chain  NH  and  the  4-OAc  group  on  the  oxetane  ring. 

Chiroptical  Methods  for  Large  Molecules 

Larger  systems  require  the  application  of  approximate  theoretical  models.  We 
restrict  our  attention  to  applications  aimed  at  the  determination  of  secondary  and 
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G  0.0  Kcal 

v(OH)  4050  cm'1  R(OH)  =  +60 
v(NH)  3886  cm*1  R(NH)  = -0.3 


v{OH)  4023  cm*1  R(OH)  -  -67 
v(NH)  3888  cm*1  R(NH)  =  -3.9 


H  1.09  Kcal 

v(OH)  4051  cm*1  R(OH)  =  +31 
v{NH)  3882  cm*1  R(NH)  »  -0.3 


v(OH)  4068  cm*1  R(OH)  =  +28 
v(NH)  3910  cm*1  R(NH)  =  -1.0 


I  2.28  Kcal 

v{OH)  4109  cm*1  R(OH)  =  +2.6 
v(NH)  3890  cm*1  R{NH)  -  +4.8 


v(OH)  4113  cm*1  R(OH)  -  +49 
v(NH)  3875  cm*1  R(NH)  - +12 


Figure  6.  Gaussian  90  optimized  geometries  (6-3 1G*  basis)  calculated  for  a  taxol  side 
chain  methyl  ester  fragment  with  the  phenyl  rings  replaced  by  hydrogen  atoms.  Relative 
energies,  calculated  OH-  and  NH-stretching  frequencies  and  ldo  model  rotational  strengths 
(1044  esu2  cm2)  are  indicated  for  each  conformer. 


tertiary  structures  of  polypeptides  and  nucleic  acids.  The  intrinsic  difficulty  with 
chiroptical  methods,  as  with  any  spectroscopic  methods  applied  to  large  complex 
molecules  is  that  the  observed  spectrum  is  the  superposition  of  the  spectra  of  many 
subunits  which  differ  in  their  local  chirality  and  which  may  or  not  be  coupled  in 
some  way.  ECD  and  VCD  have  complementary  strengths  in  that  they  sample  different 
spatial  aspects  of  the  superstructure.  Electronic  transitions  may  have  rather  large 
dipole  moments  and  couple  over  large  distances,  ecd  therefore  may  be  expected 
to  be  diagnostic  of  long  range  order  of  the  superstructure.  On  the  other  hand, 
vibrational  transitions  are  associated  with  relatively  small  dipole  transition  moments, 
which  would  be  sensitive  to  local  geometry,  particularly  since  mechanical  coupling 
of  vibrational  modes  is  also  expected  to  contribute  to  the  chiroptical  properties. 

ECD  has  been  applied  to  the  analysis  of  peptide  secondary  structure  for  many 
decades.  The  basic  premise  is  that  biopolymers  exhibit  a  rather  limited  range  of 
structural  features  (a-helix,  /S-sheet,  “coil,”  etc.,  for  polypeptides,  A-form,  B-form, 
etc.,  for  polynucleotides).  Since  compositions  in  terms  of  these  structural  forms 
may  be  elicited  from  x-ray  and  nmr  data,  as  well  as  the  chiroptical  methods,  one 
approach  that  has  seen  good  success  is  the  deconvolution  of  a  set  of  experimental 
spectra  to  extract  spectra  of  the  “pure”  forms  (Fig.  7)  [47-50],  which  may  then  be 
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Figure  7.  Three  “pure”  ecd  curves  extracted  by  component  analysis  of  the  spectra  of  a 
set  of  proteins  (see  Ref.  58). 


fitted  to  spectra  of  polymers  of  unknown  make-up  to  obtain  the  fractions  or  weights 
of  the  various  forms. 

Little  in  the  way  of  “sector”  rules  has  emerged  to  assist  in  the  interpretation  of 
vcd  spectral  features  [51-54].  However,  early  VCD  measurements  of  polypeptides, 
principally  by  Keiderling’s  group,  have  demonstrated  that  different  secondary 
structural  types  give  rise  to  characteristic  vcd  patterns,  especially  in  the  amide  I 
(mainly  CO  stretch,  —  1 650  cm-1)  region  [55,56].  For  instance,  a-helical  structures 
yield  a  bisignate  amide  I  band,  the  sign  of  which  depends  on  the  handedness  of  the 


Figure  8.  Typical  vcd  spectra  of  polypeptides  in  the  Amide  I  region:  (a)  a-helix  (simulated 
spectrum  of  albumin  [Ref.  56)];  (b)  “random  coil”  (simulated  spectrum  of  poly-L-lysine 

[Ref.  56)]. 
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helix,  positive  for  a  right-handed  helix  [Fig.  8(a)],  and  not  on  the  absolute  config¬ 
uration  of  the  a-carbon.  The  “random  coil”  form  gives  rise  in  many  polypeptides 
and  proteins  to  a  large  negative  couplet,  especially  in  D20,  indicating  residual  order 
in  the  form  of  a  left-handed  helix.  Indeed,  VCD  has  been  shown  to  be  more  robust 
than  ecd  in  its  diagnosis  of  secondary  structure,  the  latter  being  more  sensitive  to 
the  presence  of  chromophores  other  than  the  amide  linkage  itself  [57],  A  component 
deconvolution  analogous  to  that  employed  for  ECD  has  also  met  with  good  success 
in  the  case  of  vcd  [57-59]. 

The  Empirical  Coupled  Oscillator  Models 

The  coupled  oscillator  models  of  chiroptical  properties  [60-63]  are  based  on 
molecular  exciton  coupling  theory  [64-66].  The  electronic  or  vibronic  transitions 
of  two  identical  chromophoric  groups,  with  frequency,  v0,  separated  in  space  by  a 
displacement  vector,  Rab,  will  couple  to  give  two  absorption  bands  at  frequencies, 

v+  =  v0  ±  Vab  (^5) 

with  associated  rotational  strengths, 

R±  =  +  ^  Rab  *  (jla  x  jlb)  (26) 

2c 

The  coupling  energy  is  assumed  to  be  primarily  due  to  dipolar  coupling, 

_  Ma  *  jj>b  _  3(/I a  *  Rgb)(]Zb  *  Rab)  ,yi\ 

\Rab\ 3  l^l5 

where  and  %  are  electric  dipole  transition  moment  vectors  associated  with  the 
two  chromophores.  The  approach  may  be  extended  to  more  than  two  coupled 
oscillators,  and  to  coupling  of  nearly  degenerate  oscillators  [67].  The  transitions 
should  be  strongly  electric  dipole  allowed.  Therefore,  tt-tt*  transitions  are  desirable 
for  ECD,  while  localized  excitations  such  as  carbonyl,  NH,  or  OH  stretching  vibra¬ 
tions  are  desirable  for  VCD.  One  needs  to  know  the  direction  and  magnitude  of  the 
transition  dipole  vectors.  The  magnitude  may  be  measured  from  the  normal  ab¬ 
sorption  spectrum.  The  direction  relative  to  the  chromophore  skeleton  can  be  in¬ 
ferred  from  the  electronic  structure  of  the  chromophore  if  the  nature  of  the  electronic 
transition  can  be  identified,  or  from  the  direction  of  the  bond  moment  in  the  case 
of  VCD.  Positioning  of  the  chromophores  in  space  corresponds  to  assignment  of 
dilferent  configurations  or  conformations  of  the  molecule.  Coupled  oscillator  models 
have  seen  wide  application  to  ecd.  They  form  the  basis  of  the  exciton  chirality 
method  for  the  determination  of  absolute  configurations  of  organic  compounds  as 
developed  by  Harada  and  Nakanishi  [5].  The  recent  report  by  Lightner  and  co¬ 
workers  on  the  conformational  analysis  of  bilirubin  illustrates  the  combined  power 
of  ECD  spectroscopy,  the  exciton  coupling  approach,  and  molecular  modelling  tech¬ 
niques  [68].  Coupled  oscillator  methods  were  first  applied  to  vcd  by  Holzwarth 
and  Chabay  [69],  who  applied  the  method  to  diketopiperazines  (formed  by  cyclic 
dimerization  of  amino  acids). 


334  RAUK  AND  FREEDMAN 

In  polypeptides,  the  obvious  chromophoric  transition  which  should  display  an 
intimate  connection  to  the  secondary  structure  is  the  tt-tt*  transition  of  the  amide 
backbone,  which  is  responsible  for  the  CD  bands  in  the  range  200-220  nm.  However, 
coupling  of  the  transitions  of  other  chromophores,  such  as  7r-7r*  transitions  of 
tyrosines  and  histidines,  or  n-a*  transitions  of  disulfide  links,  which  occur  at  longer 
wavelengths  (240-280  nm)  may  also  be  used  to  advantage.  A  matrix  formulation 
of  the  theory  of  optical  activity  has  been  developed  by  Bayley,  Nielsen,  and  Schell- 
mann  [70],  based  on  a  coupling  of  the  excited  states  of  the  various  chromophores 
in  a  manner  analogous  to  configuration  interaction.  Chromophores  may  be  coupled 
if  their  relative  coordinates  are  known.  The  method  has  been  applied  to  ribonuclease 
S  to  show  that  the  solution  phase  CD  spectral  features  may  be  predicted  from  the 
x-ray  coordinates,  thereby  suggesting  similarity  of  the  solution  conformation  to 
that  in  the  solid  state  [71]. 

The  limits  of  the  degenerate  coupled  oscillator  (dco)  model  (with  dipolar  cou¬ 
pling)  for  VCD  applications  have  been  evaluated  against  the  a  priori  magnetic  field 
perturbation  (mfp)  model  [72].  It  was  found  that  for  two  formaldehyde  molecules 
at  reasonably  large  separations,  the  dco  and  MFP  models  agreed  well,  but  severe 
and  unpredictable  deviations  may  occur  at  short  distances  or  when  the  coupling  is 
between  adjacent  oscillators  in  the  same  molecule.  In  the  latter  case,  the  frequency 
order  and  phasing  of  coupled  oscillators  is  best  obtained  from  an  ab  initio  normal 
coordinate  analysis,  rather  than  dipolar  coupling.  The  dco  model  has  been  applied 
to  predict  the  VCD  spectra  of  a  series  of  dicarbonyl-containing  steroids  with  sub¬ 
stantial  success  [73]. 


In  this  series  of  compounds,  which  satisfy  the  “remoteness”  criterion,  only  the 
steroid,  f$  37,  showed  only  a  monosignate  VCD  curve  rather  than  the  bisignate  signal 
expected  on  the  basis  of  the  coupled  oscillator  model  and  the  model  calculations. 

The  spectral  features  of  several  polypeptides  in  the  amide  I  (and  amide  II)  region 
were  well  reproduced  by  MFP  calculations  on  a  simple  alanine-like  model  dipeptide 
that  was  conformationally  constrained  to  mimic  the  local  geometry  of  not  only  a - 
helix  and  /3-sheet,  but  also  3i0-helix  and  left-handed  poly-L-proline  (as  a  model  for 
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random  coil)  [74].  The  DCO  model  failed  to  reproduce  these  features  of  the  dipeptides 
in  the  amide  I  region  and  fared  little  better  in  the  Amide  II  [74].  However,  the 
group  of  Diem  has  had  some  success  in  the  application  of  an  extended  coupled 
oscillator  model  to  the  conformations  of  a  prototypical  peptide,  poly-L-tyrosine 
[75],  and  have  used  the  method  to  detect  a  vcd  feature  in  the  amide  I  region 
associated  with  a  /3-turn  in  a  model  cyclic  peptide  [76]. 

Extensive  use  has  been  made  of  ECD  to  interpret  nucleic  acid  base  stacking, 
conformational  change,  and  double  helix  formation  [77].  If  the  nucleic  acids  are 
in  a  highly  condensed  form  (due  to  changes  in  the  tertiary  structure  from  extended 
to,  e.g.,  coiled),  additional  sources  of  rotatory  strength  are  observed  and  the  sign 
of  the  CD  may  depend  on  the  degree  of  aggregation.  This  psi-type  CD  [78]  has  been 
explained  by  invoking  intermediate  and  radiation  coupling  effects  as  well  as  dipole- 
dipole  coupling  [79].  Similar  effects  have  been  observed  in  complexes  between 
DNA  and  polypeptides  [80].  We  will  not  discuss  these  mechanisms  further. 

Polyribonucleic  acids  display  well-defined  couplets  in  their  vcd  spectra,  in  the 
1600  cm"1  to  1700  cm"1  region  (Fig.  9)  [81],  indicating  the  possible  applicability 
of  coupled  oscillator  models  [67,81].  The  separation  of  the  nucleic  acid  bases  in 
the  usual  helical  structures  ensures  independence  of  mechanical  coupling  of  chro- 
mophores  situated  on  them.  Three  of  the  nucleic  acid  bases  have  carbonyl  groups, 
but  since  the  molecules  also  have  conjugated  C=C  and  C=N  double  bonds,  it 


Figure  9.  vcd  spectra  of  poly(riboseadenine)  ( - ,  adapted  from  Ref.  81)  and 

poly(riboguanine).poly(ribocytosine)  (- . ,  adapted  from  Ref.  67). 
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is  not  clear  that  the  carbonyls  are  sufficiently  isolated  to  utilize  in  the  same  sense 
as  the  work  described  above  for  the  steroid-based  models.  Adenine  has  no  carbonyl 
groups  at  all.  Diem  and  co-workers  have  applied  the  degenerate  extended  coupled 
oscillator  (deco)  model  to  the  problem  of  the  helicity  of  nucleic  acids  carrying  out 
DECO  calculations  for  up  to  three  separate  modes  using  moment  directions  deduced 
from  semiempirical  force  field  calculations  [67]. 
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Abstract 

The  hydration  energies  of  the  proton,  hydroxyl  ion,  and  several  inorganic  ions  were  calculated  using 
the  multicavity  self-consistent  reaction  field  (MCa  scrf)  method  developed  for  the  quantum-mechanical 
modeling  of  rotationally  or  flexible  systems  in  dielectric  media.  The  ionic  complexes  H30+(H20)4, 
0H"(H20)4,  NH4 (H20)4,  and  Hal'(H20)4,  where  Hal  =  F,  Cl,  or  Br,  have  been  studied.  Each 
complex  was  divided  between  five  spheres,  corresponding  to  the  central  ion  and  four  water  molecules 
in  their  first  coordination  sphere,  respectively.  Each  cavity  was  surrounded  by  a  polarizable  medium 
with  the  dielectric  permittivity  of  water  at  room  temperature  (80).  The  ionic  hydration  energies  of  ions 
were  divided  into  specific  and  nonspecific  parts.  After  accounting  for  the  cavity-formation  energy  using 
scaled  particle  theory,  good  agreement  between  the  total  calculated  and  experimental  hydration  energies 
was  obtained  for  all  ions  studied.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Ad  hoc  calculation  of  ion  solvation  energies  has  been  a  target  for  many  generations 
of  physical  and  theoretical  chemists  [1-7].  The  classical  Born  model  [1]  of  the 
electrostatic  solvation  of  spherical  ions  had  been  thoroughly  tested  in  the  landmark 
work  by  Latimer,  Pitzer,  and  Slansky  [2],  and  by  others  [3-7].  The  more  recent 
developments  of  this  model  include  semiempirical  approaches  for  the  definition  of 
ionic  radii  in  solution  and  its  application  for  the  multiatomic  ions  of  more  complex 
shape  and  charge  distribution  (e.g.,  zwitterions)  [8-14].  Another  classical  approach 
to  ion  solvation  has  been  based  on  various  approximations  of  rigorous  statistical 
mechanical  theory  of  fluids,  which  include  the  mean,  spherical  approximation  (MSA) 
in  hypemetted  chain  (HNC)  or  Percus-Yevick  (py)  expansion  or  extended  reference 
interaction  site  (  rism  )  methods  [15-19].  The  Monte  Carlo  and  molecular  dynamic 
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simulations  of  ion  solvation  and  the  related  free-energy  perturbation  calculations 
[20-30]  can  conventionally  be  classified  also  as  classical  methods,  even  if  quantum- 
mechanical  intermolecular  interaction  potentials  are  applied  in  these  simulations. 

A  common  shortcoming  of  all  the  above-listed  methods  is  the  neglect  of  the 
influence  of  intermolecular  interactions  to  the  electronic  structure,  and  thus  to  the 
charge  distribution  of  the  solute  (ion)  and  solvent  molecules  in  the  vicinity  of  the 
solute.  Therefore,  different  variations  of  the  quantum-mechanical  approach,  based 
on  the  Bom  [1]  and  Kirkwood-Onsager  [31-33]  reaction  field  theories  of  the  di¬ 
electric  continuum  have  been  developed  by  a  number  of  groups  at  different  level 
of  quantum  theory  for  equilibrium  [34-59]  and  nonequilibrium  [60-63]  systems. 
Some  of  these  theories  have  been  subsequently  applied  to  the  calculation  of  the 
solvation  energies  of  molecules  and  ions  [64-70]. 

In  parallel  to  the  development  of  the  quantum-mechanical  continuum  solvation 
models,  the  ab  initio  calculations  of  ion-solvent  clusters  have  added  valuable  in¬ 
formation  about  the  structure  of  the  closest  solvation  shell  [71-82] .  While  in  most 
cases  correctly  calculating  the  ion  solvation  energies  in  different  gaseous  clusters 
[83,84],  these  supermolecule  calculations  neglect,  in  principle,  a  substantial  amount 
of  the  bulk  solvation  energy  due  to  the  solvent  orientational,  electronic,  and  nuclear 
polarization  in  the  field  of  the  solute  (ion).  Consequently,  the  most  feasible  way 
to  approach  these  calculations  would  seem  to  us  to  treat  the  closest  solvation  shell 
using  quantum  mechanics  and  then  include  the  dielectric  polarization  of  the  bulk 
solvent  [64,85-87].  A  recent  development  of  an  ab  initio  pseudopotential  free 
energy  approach,  an  interesting  unification  of  the  quantum  theory  and  statistical 
mechanics,  looks  very  promising,  from  this  viewpoint  [88,89]. 

In  this  work,  we  present  the  application  of  the  combined  supermolecule-contin- 
uum  approach  to  ion  solvation  using  a  recently  developed  multicavity  self-consistent 
reaction  field  (MCa  scrf)  method  [90] .  In  this  method,  the  (super) molecule  under 
the  study  is  divided  into  rotationally  inversionally  flexible  fragments,  each  of  which 
is  embedded  into  a  spherical  cavity,  surrounded  by  the  bulk  dielectric  continuum. 
Each  fragment  generates  its  own  reaction  field,  respectively,  which  are  interacting 
with  each  other  and  with  the  charge  distribution  in  other  fragments.  For  the  present 
application,  a  separate  cavity  has  been  ascribed  to  the  central  ion  and  each  of  the 
water  molecules  in  the  first  coordination  sphere. 

Theory 

The  MCa  scrf  method  has  been  described  in  detail  elsewhere  [  90  ]  and  thus  we 
present  only  the  essentials  here.  We  notice  that  the  interaction  of  charge  and  higher 
electrical  moments  of  a  charge  distribution  in  a  spherical  cavity  with  the  corre¬ 
sponding  reaction  fields  localized  in  the  center  of  cavity,  does  not  depend  on  the 
position  of  charge  or  (point)  multipole  centers  in  this  cavity  [33].  It  is  therefore 
possible  to  divide  a  molecule  between  two  or  more  spherical  cavities  which  border 
each  other  along  the  rotating  bond  or  the  bond  to  be  cleaved  in  the  process  of  a 
chemical  reaction.  Correspondingly  each  molecular  fragment  has  its  own  reaction 
field,  which  interacts  with  the  charge  density,  and  reaction  fields  of  other  fragments. 
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We  proceed  by  the  partitioning  the  solute  molecule  into  M  fragments,  indexed  as 
A,  B,  . . . ,  which  are  embedded  into  M spherical  cavities  with  radii  a0i  (i  =  A,  B, 

. . .).  Assuming  the  classical  Born-Kirkwood-Onsager  expansion  [33]  for  each  of 
these  fragments  one  obtains  an  expansion  for  the  total  energy  of  the  solute  in  a 
dielectric  medium. 

E  =  E0  +  is  (intrafragment)  +  E(  interfragment)  ( 1 ) 

The  terms  in  Eq.  ( 1 )  correspond  to  the  energies  arising  from  the  interaction  of 
the  partial  charge  and  the  moments  of  a  molecular  fragment  with  the  reaction  field 
( drf )  of  its  own  and  other  fragments  as  well  from  the  interaction  between  the 
reaction  fields  of  different  fragments.  Terms  in  these  expansions  proportional  to 
the  reaction  field  tensors  I\ , 


Tio  = 


e  —  1 
'l.Cdoi 


;r, 


(*-0  F 
(2c  +  1  )ali 


(2) 


can  be  referred  to  as  the  first-order  reaction  field  energies,  terms  proportional  to 
the  square  of  T  as  the  second-order  energies.  We  also  notice  that  the  use  of  the 
dielectric  permittivity  e  in  T  assumes  full  orientational  and  electronic  relaxation  of 
the  solvent  in  the  field  of  solute  fragment  and  therefore  corresponds  to  the  full 
coupling  between  rotational-inversional  movements  of  the  solute  fragment  and  the 
solvent  molecules.  In  formula  (2),  aoi  denotes  the  cavity  radius  for  a  given  frag¬ 
ment  i,  and  Ft  the  factor  accounting  for  a  net  charge  on  fragment  i  [90]. 

In  the  framework  of  the  one-electron  approximation,  the  quantum-mechanical 
extension  of  Eq.  ( 1 )  now  has  the  following  form,  up  to  the  dipole  term,  of 

Em  =  =  <0|H„|0>+  2  I\iOImaI0><<MmaI0> 

A 


+  2  M*|PaI*><*|PaI*>  +  22  iX*|maI*X*I/*bI*>/*5. 

A  A*B 

+  22  M*IPAl*><*|PB|*>/ft« 

A*B 

+  22  iX*|#*aI*X*IPbI*>/*5a+  22  M*ImaI*X*|PbI*>//&# 

A+B  A±B 

+  22  I Rab 

A*B 

+  22  r^.rfl<.<^|pA|^><0|PB|0>/^ 

A*B 

+  22  (3) 

A±B 


where  H0  is  the  solvent-unperturbed  Hamiltonian  of  the  solute  molecule,  RAb — 
the  distance  between  the  centers  of  adjacent  cavities,  jiA  and  mb  represent  the  dipole 
moment  operators  of  the  respective  molecular  fragments,  and  Pa  and  Pb  are  the 
projection  operators  of  the  partial  charge  on  fragments  A  and  B  to  the  total  molecular 
wavefunction  0,  respectively.  The  PA  and  PB  operators  can  be  defined  following 
one  of  the  possible  LCAO  MO  charge  partitioning  schemes  in  the  molecule.  Mulliken 
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charges  have  been  used  in  the  present  application.  It  is  possible  now  to  build  a 
variational  functional  for  the  total  energy  E  and  derive  the  corresponding  Hartree- 
Fock-type  equations  which  can  be  again  to  be  solved  iteratively  (scrf)  [90]. 

The  cavity  radius  for  the  water  molecule  was  calculated  on  the  basis  of  the  number 
density  p  at  the  room  temperature  as  follows  [91]: 

a0(H20)  =  (3/0/4TT)173  =  1.922  A.  (4) 

The  same  radius  was  used  for  the  cavities  for  hydroxonium  and  hydroxyl  ions,  by 
the  assumption  that  the  addition  or  the  removal  of  a  proton  from  the  water  molecule 
leads  to  a  small  change  in  its  internal  volume.  For  the  central  ion,  the  cavity  radius 
was  calculated  as  the  sum  of  the  corresponding  crystallographic  radius  of  the  ion 
and  the  diameter  of  the  water  molecule  (3.844  A).  The  following  crystallographic 
radii  were  used  [92]:  r(NHj)  =  1.49  A;  r(F’)  =.  1.33  A;  r(Cl")  =  1.81  A; 
r(Br")  =  1.96  A;  r(I")  =  2.20  A. 

Results  and  Discussion 

The  ionic  hydration  of  the  proton,  hydroxyl  ion,  and  5  simple  inorganic  ions 
(NHJ ,  F-,  CL-,  Br-,  and  I-)  was  studied  by  using  the  semiempirical  amI  theory 
[93,94]  with  the  inclusion  of  the  MCa  SCRF  model.  Thus  we  have  studied  the  ionic 
complexes  H30+(H20)4,  0H-(H20)4,  NHj(H20)4,  and  Hal-(H20)4,  where 
Hal  =  F,  Cl,  or  Br.  With  the  full  protonation  of  one  water  molecule  ( formation  of 
the  H30+  ion),  the  first  hydration  shell  of  the  proton  was  assumed  to  have  the 
following  structure 


In  the  case  of  hydroxyl  ion,  the  complex  initial  structure  was  as  follows 
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whereas  the  following  structure  of  the  first  coordination  shell  of  the  NH4 


and  halide  ions 


was  used. 

In  Table  I,  the  am  1  calculated  heats  of  formation  for  the  isolated  ions  and  water 
molecule,  and  ion- water  complexes  in  the  gas  phase  (e  =  1 )  and  the  high  dielectric 
constant  medium  (e  =  80),  are  given.  The  total  ion  hydration  energies  were  obtained 
according  to  the  following  procedure: 


A Hi  =  A^[I±(H20)4]>v  “ 

+  {AHfll^KjO)^-  4AHflH20]g}  (5) 

where  A Hf[X]w  and  A Hf[X]g  denote  the  calculated  heats  of  formation  for  the 


Table  I.  The  AMl  calculated  heats  of  formation  (kcal/mol)  for  the  isolated  ions 
(AHf  [HJg  and  ion-water  complexes  in  the  gas  phase  (A//;[r(H20)4]g)  and  in  the 
medium  with  the  dielectric  constant  e  =  80  (A//^[I±(H20)4]w). 


Ion 

A//j![I*(H20)4]g 

A//;[r(H20)4]w 

H+ 

362.68 

— 163.5  la 

— 207.70a 

H30+ 

143.48 

-163.51 

-208.81 

OH~ 

-14.12 

-327.43 

-371.58 

NH4 

150.58 

-136.08 

-178.28 

F” 

3.44 

-311.08 

-355.42 

cr 

-37.66 

-321.20 

-364.85 

Br" 

-20.41 

-296.70 

-338.60 

I" 

-2.22 

-274.66 

-316.88 

a  A///  [H+(H20)5]. 
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Table  II.  amI  MCa  scrf  calculated  specific  (A //•),  nonspecific  (A H™)  and  total 
electrostatic  (A Hi)  ion  solvation  energies  in  aqueous  solution  (kcal/mol),  and 
experimental  heats  of  solvation  (A//,(exp)  [98]). 


Ion 

A  Hj 

A  H? 

A  Hi 

A  Hi  (exp) 

H+ 

-229.89 

-45.30 

-275.29 

-261.7 

h3o+ 

-69.93 

-45.30 

-115.33 

— 

OH- 

-76.35 

-44.15 

-120.50 

-126.6 

NH4 

-49.70 

-42.20 

-91.90 

-76.6 

F" 

-77.56 

-44.34 

-121.90 

-124.1 

cr 

-46.58 

-43.65 

-90.23 

-90.0 

Br" 

-39.33 

-41,90 

-81.23 

-82.5 

r 

-35.48 

-42.22 

-77.70 

-71.8 

species  X  (ion-water  complex  I±(H20)4,  ion  I*  and  the  water  molecule)  in  the 
dielectric  medium  or  in  the  gas  phase,  respectively.  The  difference  in  curly  brackets 
corresponds  conventionally  to  the  specific  solvation  energy  of  the  ion  ( A H* )  whereas 
the  difference  between  the  heats  of  formation  of  the  ionic  complex  in  two  phases 
gives  the  nonspecific  (dielectric)  solvation  energy  of  the  ion  ( A Hf).  The  respective 
total  hydration  energies  and  its  components  are  given  in  Table  II.  It  is  most  inter¬ 
esting  to  note  that  the  variance  in  the  total  hydration  energies  of  different  ions  is 
caused  by  the  difference  in  the  specific  ion-water  interactions  in  the  first  coordination 
sphere  whereas  the  nonspecific  long-range  electrostatic  solvation  energies  are  re¬ 
markably  constant  (44  ±  2  kcal/mol).  A  good  correlation  between  the  calculated 


Figure  1 .  The  linear  relationship  between  the  am  1  MCa  scrf  calculated  total  electrostatic 
ion  hydration  energies  A//,  and  experimental  ion  hydration  energies  [98].  Filled  point 
corresponds  to  the  water  molecule  (electroneutral  compound). 
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and  experimental  total  hydration  energies  was  obtained  for  all  ions  studied  (cf.  Fig. 
1 )  with  the  following  regression  coefficients: 

A  Hi  =  (0.8  ±  7.0)  +  (1.03  ±  0.05)  AH,  (exp)  (6) 

(correlation  coefficient  R  =  0.994;  standard  deviation  s  =  8.6  kcal/mol).  The  slope 
of  this  linear  relationship  is  close  to  unity  (as  expected)  and  the  intercept  close  to 
zero.  However,  the  data  in  Table  II  indicate  some  overestimate  of  the  ion  hydration 
energies  by  amI  MCa  SCRF  results.  One  should  keep  in  mind  that  in  this  level  the 
theory  accounts  only  for  the  electrostatic  and  short-term  quantum  effects  of  inter- 
molecular  ion-solvent  interaction.  One  should  also  consider  the  term  for  the  cavity- 
formation  in  the  medium,  which  has  always  a  positive  contribution  to  the  solvation 
energy.  Considering  the  similarity  of  the  volume  of  the  ionic  complexes  studied  in 
this  work,  these  contributions  should  be  close  for  different  ions.  A  quantitative 
estimate  of  the  cavity-formation  energy  may  be  calculated  using  the  scaled  particle 
theory  (spt)  [95-97].  We  have  proceeded  from  the  following  formula  for  the 
reversible  work  W(r ,  p)  required  to  produce  a  cavity  of  radius  r  in  a  fluid  with  the 
number  density  p  [97] : 

W(r,p)  =  kT{-\n(\  -y)  +  [3y/(l  -  y)]R 

+  [3y/(  1  —  y)]R2  +  4.5[y/(  1  —  y)]2R2  +  yPR3fpkT}  (7) 

where  y  =  7rp<r?/6  is  the  reduced  number  density  of  the  solvent,  R  =  c2/ai,  and 
<x2  is  the  diameter  of  the  hard-sphere  solute  molecule  such  that  the  cavity  radius  is 
r  =  (<7i  +  <t2)/2.  The  term  proportional  to  the  internal  pressure  of  solution,  P,  is 
considered  negligible  (cf.  [95,96])  and  thus  neglected  in  our  calculation.  The  cal¬ 
culated  cavity-formation  energy  for  one  water  molecule  is  1.25  kcal/mol  (y  = 
0.2055,  using  additive  bond  refractions  calculated  internal  volume  of  the  water 
molecule  [98]). 


Table  III.  amI  MCa  scrf  (A Hj)  and  spt  (W)  calculated  neutral  electrolyte  solvation  energies,  the 
total  calculated  neutral  electrolyte  solvation  energies  (A Hs),  and  the  respective  experimental  heats  of 
solvation  ( AHs(exp )  [98])  in  aqueous  solution  (kcal/mol). 


Electrolyte 

A  Hi 

W 

A  Hs 

A//,  (exp) 

5a 

HF 

-397.19 

12.12 

-385.07 

-385.8 

0.7 

HC1 

-365.52 

12.43 

-353.09 

-351.7 

-1.4 

HBr 

-356.52 

12.52 

-344.00 

-344.2 

0.2 

HI 

-352.99 

12.67 

-340.32 

-333.5 

-6.8 

NH4OH 

-212.40 

12.20 

-200.20 

-203.2 

3.0 

NH4F 

-213.80 

11.82 

-201.98 

-200.7 

-1.3 

NH4C1 

-182.13 

12.15 

-169.98 

-166.6 

-3.4 

NH4Br 

-173.13 

12.22 

-160.91 

-159.1 

-1.8 

NH4I 

-169.48 

12.37 

-157.11 

-148.4 

-8.7 

a  5  =  AHS  -  AHs(exp). 
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AH(calc) 


-100  -200  -300  -400 


AH(exp) 


Figure  2.  The  linear  relationship  between  the  AMl  MCa  scrf  and  Spt  calculated  total 
hydration  energies  A Hs  and  experimental  hydration  energies  of  neutral  electrolytes. 


The  ionic  solvation  energies  are  obtained  by  the  separation  of  the  total  hydration 
energy  of  a  neutral  electrolyte  in  ionic  parts  using  various  empirical  schemes. 
Therefore  it  is  more  correct  to  compare  the  calculated  total  hydration  energies  of 
neutral  electrolytes  AT/,  (calc),  which  include  both  the  quantum-chemical  and  cavity 
formation  contributions,  with  the  corresponding  experimental  numbers,  ATT,  (exp). 
In  Table  III,  we  have  given  the  respective  data  for  nine  strong  electrolytes,  obtained 
by  the  combination  of  seven  ions  used  in  this  study.  An  excellent  agreement  between 
the  theoretical  and  experimental  results  is  obvious  (cf.  also  Fig.  2).  The  results 
correspond  to  the  following  linear  regression: 

AT/,  (calc)  =  (-4.7  ±  3.8)  +  (0.99  ±  0.01) ATT, (exp)  (7) 

(Correlation  coefficient  R  ~  0.999;  standard  deviation  s  -  3.8  kcal/mol).  The  slope 
of  this  relationship  is  again  unity  and  the  intercept  zero  within  the  limits  of  error. 
The  standard  deviation  of  the  correlation  is  somewhat  larger  than  the  estimate  of 
the  respective  experimental  error  (±2  kcal/mol)  and  thus  the  model  is  not  overfitted. 
Therefore,  by  using  only  the  dielectric  constant  of  the  medium  and  the  crystallo¬ 
graphic  radii  of  ions  as  the  external  information,  we  have  been  able  to  correctly 
describe  most  of  the  ion  solvation  in  water. 
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Abstract 

The  PM 3  quantum-mechanical  method  has  been  used  to  study  large  water  clusters  ranging  from  8  to 
42  water  molecules.  These  large  clusters  are  built  from  smaller  building  blocks.  The  building  blocks 
include  cyclic  tetramers,  pentamers,  octamers,  and  a  pentagonal  dodecahedron  cage.  The  correlations 
between  the  strain  energy  resulting  from  bending  of  the  hydrogen  bonds  formed  by  different  cluster 
motifs  and  the  number  of  waters  involved  in  the  cluster  are  discussed.  The  pm3  results  are  compared 
with  tip4p  potential  and  ab  initio  results.  The  number  of  net  hydrogen  bonds  per  water  increases  with 
the  cluster  size.  This  places  a  limit  on  the  size  of  clusters  that  would  fit  the  Benson  model  of  liquid  water. 
Many  of  the  20-mer  clusters  fit  the  Benson  model  well.  Calculations  of  the  ion  cluster  (H20)4o(H30+)2 
reveal  that  the  mje  ratio  obtainable  by  mass  spectrometry  experiments  can  uniquely  indicate  the  con¬ 
formation  of  the  20  water  pentagonal  dodecahedron  cage  present  in  the  larger  clusters.  ©  1994  John  Wiley 
&  Sons,  Inc. 


Introduction 

In  this  paper  we  discuss  large  water  clusters  modeled  by  the  pm 3  method  and  by 
model  building.  Gas  phase  ionic  water  clusters  are  experimentally  observable  and 
their  structures  can  be  determined  by  experimental  and  theoretical  means  [1-9]. 
The  (H20)2iH+  cluster  ion  is  a  prominent  “magic  number  entity”  as  detected  with 
numerous  experimental  conditions  [1-3].  Definitive  experimental  evidence  of  the 
pentagonal  dodecahedron  structure  was  provided  by  Wei  et  al.,  who  used  trimeth- 
ylamine  (TMA)  molecules  as  tags  to  identify  dangling  hydrogens  on  water  clusters 
[6].  They  made  mixed  neutral  clusters  by  expanding  vapor  containing  water  and 
TMA  through  a  pulsed  nozzle,  ionizing  the  clusters  by  multiphoton  ionization, 
and  separating  the  product  ions  by  time-of-flight  mass  spectrometry.  The  results 
of  this  experiment  revealed  that  the  (H20)21(TMA)mH+  ion  distribution  was  a  max¬ 
imum  when  m  =  10  with  an  abrupt  drop  in  intensity  when  m=  11.  This  experiment 
proved  that  the  (H20)2iH+  ion  has  a  total  number  of  10  hydrogen  bonding  sites 
available  for  TMA  to  bind.  The  deformed  pentagonal  dodecahedron  is  the  only 
possible  structure  for  the  (H20)2iH+  ion  that  provides  a  total  of  10  dangling  hydrogen 
atoms  [2]. 

Aside  from  this  one  peak,  there  are  many  more  peaks  in  the  mass  spectrum 
obtained  by  Castleman’s  group.  Higher-order  clusters  such  as  (H2O)40(H3O+)2  will 
have  unique  mass/charge  (m/e)  ratios  depending  on  the  unique  hydrogen  bonding 
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pattern  of  each  particular  cluster.  The  TMA  tag  experiment,  in  conjunction  with 
theoretical  predictions,  can  be  used  to  determine  the  structures  of  the  individual 
(H20)2o  clathrate  cage  structures  that  compose  the  higher  order  cluster.  There  have 
already  been  reported  in  the  literature  three  isomers  of  the  (H2O)20  clathrate  cage 
structure  [7,8].  In  this  paper  we  present  results  of  pm3  calculations  and  model 
building  studies  that  show  that  the  m/e  ratio  of  higher  order  water  cluster  ions, 
tagged  with  TMA,  should  allow  for  the  identification  of  the  clathrate  cage  isomers 
used  as  building  blocks  in  forming  the  higher  order  clusters. 

Benson  and  Siebert  have  proposed  that  liquid  water  is  composed  of  polycyclic 
octamers  and  tetramers  [10].  This  model  has  been  introduced  in  order  to  explain 
the  anomalous  heat  capacity  of  liquid  water,  which  requires  more  than  two  hydrogen 
bonds  per  water  molecule  to  model  the  heat  capacity  of  liquid  water  from  0  to 
100°C.  Tsai  and  Jordan  have  studied  water  clusters  (H20)„  from  n  =  8  to  n  =  20 
by  ab  initio  and  tip4p  force  field  calculations  [9].  They  show  that  the  stability  of 
water  clusters  is  driven  by  the  number  of  hydrogen  bonds;  the  driving  force  to 
maximize  the  number  of  hydrogen  bonds  more  than  compensates  for  the  ring 
strain  present  in  some  of  their  clusters.  We  have  extended  their  work,  and  present 
the  results  of  PM3  calculations  of  large  water  clusters  and  their  building  blocks.  We 
also  discuss  the  implications  of  the  clusters  described  here  with  the  model  of  Benson 
and  Siebert  for  liquid  water. 

The  semiempirical  quantum-mechanical  methods  developed  by  Dewar  and  co¬ 
workers  [11-14]  have  been  successful  at  reproducing  molecular  energies,  molecular 
structures,  and  modeling  chemical  reactions.  Of  these  methods,  only  the  pm3  method 
[14]  has  successfully  described  intermolecular  hydrogen  bonding  of  neutral  mole¬ 
cules  [14-18].  PM3  correctly  models  the  hydrogen  bonding  of  small  water  clusters 
{n  =  2-5)  [14-16]  and  the  (H20)2jH+  ion  [8],  whereas  the  AMl  method  is  not 
able  to  correctly  model  the  geometries  of  either  the  water  dimer  or  small  water 
clusters  [15-24]. 

The  ability  of  the  pm3  Hamiltonian  to  model  intermolecular  hydrogen  bonding 
between  neutral  molecules  has  been  attributed  to  the  reduction  of  two-center  re¬ 
pulsive  forces  brought  about  by  the  parameterization  of  the  Gaussian  core-core 
interactions  [16].  We  have  used  pm3  to  study  higher  order  clusters  of  the  (H20)2iH+ 
system  as  well  as  larger  neutral  clusters  of  water  ( n  =  20-40)  based  on  octamer  and 
pentamer  building  blocks. 


Methods 

All  water  clusters  were  built  using  Chem  3D  Plus  software  (Cambridge  Scientific 
Computing,  Inc.,  Cambridge,  MA)  on  a  Macintosh  II  computer  (Apple  Computer, 
Inc.,  Cupertino,  CA).  Clusters  were  designed  with  hydrogen  bond  lengths  of  1.7  A 
and  with  the  lone  pairs  of  each  oxygen  oriented  towards  the  hydrogens  of  the 
neighboring  donor  molecules.  The  small  building  blocks  were  fully  geometry- 
optimized  with  the  PM3  method  [10]  using  MOPAC  6.0  software  [25].  The  keyword 
PRECISE  was  used  to  increase  the  criterion  for  terminating  all  optimizations 
100-fold  over  normal  MOPAC  limits  and  the  keyword  XYZ  was  used  to  run  these 
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calculations  in  cartesian  coordinates.  The  larger  clusters  (greater  than  20  waters) 
were  run  with  the  SPARTAN  software  package  (Wavefunction,  Irvine,  CA)  on  a 
Silicon  Graphics  Indy  workstation  (Mountain  View,  CA).  The  converge  option  was 
used  with  the  default  gradient  tolerance  values.  All  pm3  calculated  structures  were 
characterized  as  stationary  points  and  true  minima  by  calculating  the  vibrational 
frequencies. 

In  addition  to  the  optimization  calculations,  single  point  calculations  (SPC)  were 
performed  with  SPARTAN  to  determine  the  contribution  of  hydrogen  bond  energy 
of  the  H30+  ions  in  the  (H20)4o(H30+)2  cluster.  Single  point  calculations  in  SPAR¬ 
TAN  are  equivalent  to  lSCF  calculations  within  MOP  AC.  In  these  calculations, 
the  individual  H30+  and  (H2O)40  subunits  of  the  (H2O)40(H3O+)2  ion  are  “frozen” 
to  determine  the  heats  of  formation  for  these  species  in  the  42  mer.  This  is  done 
by  isolating  both  H30+  and  the  (H2O)40  subunits  in  the  exact  conformation  of  the 
(H2O)40(H3O+)2  ion,  followed  by  a  single  self-consistent-field  (lSCF  in  MOPAC  or 
SPC  in  SPARTAN)  calculation  to  determine  the  heats  of  formation  of  each  subunit. 
A  thermodynamic  cycle  was  used  to  contrast  hydrogen  bonding  with  ring  strain 
for  the  (H2O)40(H3O+)2  ion. 


Results  and  Discussion 

The  PM3  geometrical  results  for  an  (H2O)40(H3O+)2  ion  cluster  are  presented  in 
Figures  1  and  2.  Figure  1  displays  only  the  oxygen  atoms  of  the  geometry-optimized 
structure  in  the  top  of  the  figure,  with  the  hydronium  ions  shaded  and  with  oxygens 
involved  in  hydrogen  bonding  with  the  hydronium  ions  blackened.  In  the  bottom 
of  the  figure  the  two  ionic  subunits  (H2O)20(H3O+)  have  been  separated  at  the 
interface  and  rotated  90°.  The  hydronium  ion  is  bound  to  the  furthermost  pen¬ 
tagonal  face  from  the  interface  in  each  case.  Figure  2  illustrates  the  hydrogen  bonding 
interface  on  the  top  half  of  the  figure,  and  the  hydrogen  bonding  interactions  of 
the  hydronium  ion  on  the  bottom  half.  In  Figure  2  the  perspectives  of  the  top  and 
bottom  views  are  identical  to  those  in  Figure  1,  with  the  exception  of  a  slight 
rotation  of  the  interface.  Only  the  oxygens  and  hydrogens  involved  in  hydrogen 
bonding  of  the  interface  and  hydrogen  bonding  of  the  hydronium  ions  are  displayed. 
Figure  3  contains  the  pm3  energetic  results  for  the  42-mer  cluster  and  the  ther¬ 
modynamic  cycle  used  to  obtain  A Hu  A H2,  and  AH.  In  Table  I  the  structures  of 
the  three  unique  (H2O)40(H3O+)2  ions  composed  from  three  different  (H2O)20  build¬ 
ing  blocks  [7,8]  are  compared  to  predict  the  expected  m/e  ratio  for  these  different 
doubly-charged  ions  in  a  mass  spectrometry  experiment. 

Figure  4  contains  the  two  octamer  building  blocks  used  to  build  many  of  the 
larger  clusters  displayed  in  Figures  5  and  6.  We  have  built  on  the  previous  work  of 
Tsai  and  Jordan  and  have  retained  their  nomenclature  [9].  Figure  5  contains  pm3 
optimized  20-mers,  and  Figure  6  contains  three  pm3  optimized  40-mers.  Only  the 
oxygen  atoms  are  shown  in  Figure  6  for  clarity.  Table  II  contains  the  PM3  heats 
of  formation,  association  energies,  and  hydrogen  bonding  information  for  the  clus¬ 
ters  depicted  in  Figures  4-6,  as  well  as  the  previously  reported  pentagonal  dode¬ 
cahedral  cage  structure  for  (H2O)20  [8]. 
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Left  Hand  Side 


Right  Hand  Side 


Figure  1 .  The  PM 3  optimized  structure  for  the  ion  cluster  (H20)4o(H30+)2  illustrating  the 
oxygen  atoms  and  the  interface  of  the  two  pentagonal  dodecahedrons.  Only  oxygens  are 
displayed,  and  the  blackened  oxygens  represent  oxygens  that  serve  as  hydrogen  bond  ac¬ 
ceptors  for  the  shaded  hydronium  ion.  The  bottom  figure  shows  the  two  pentagonal  faces 
that  bind  to  the  hydronium  ion.  To  obtain  the  bottom  view  the  interface  was  broken  and 
the  two  cages  were  rotated  approximately  90°. 


The  (IT20)4o(H30+)2  Ion  Cluster 

This  structure  was  made  from  two  Cage  structures  (Fig.  6)  joined  together  by 
matching  a  pentagonal  ring  on  one  20-mer  with  its  complement  on  the  second  20- 
mer.  In  this  cluster  there  are  four  hydrogen  bond  donors  on  a  pentagonal  face  on 
the  right-hand  side  matching  the  complement  of  one  hydrogen  bond  donor  on  the 
left-hand  side  (Fig.  2).  Then  two  hydronium  ions  were  placed  in  the  center  of  each 
cage.  The  ion  structure  would  not  optimize  until  both  hydronium  ions  were  displaced 
slightly  from  the  centers  of  the  cages.  Figure  1  reveals  that  the  two  hydronium  ions 
migrated  to  the  pentagonal  faces  furthermost  from  the  interface.  The  detailed  hy¬ 
drogen  bonding  interactions  of  the  hydronium  ions  with  its  cage  is  displayed  at  the 
bottom  of  Figure  2.  These  two  patterns  are  different  from  the  six  seen  previously 
for  the  (H20)2iH+  cluster  [8].  Using  the  nomenclature  of  Jeffrey  and  Saenger  [26], 
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Figure  2.  Top:  the  hydrogen  bond  interface  between  the  two  cage  structures,  showing  all 
oxygens  and  hydrogens  involved  in  hydrogen  bonds  that  make  the  interface.  Bottom:  the 
detailed  hydrogen  bonding  interactions  of  the  hydronium  ions.  The  view  is  the  same  as 

in  Figure  1  (bottom). 


the  hydronium  ion  on  the  left-hand  side  of  the  figure  has  three  three-center  bifurcated 
donor  hydrogen  bonds  and  one  three-centered  bifurcated  acceptor.  The  first  three- 
center  hydrogen  bond,  with  hydrogen  bond  lengths  of  1 .8 1 8  and  1 .8 1 3  A,  is  known 
as  a  bifurcated  donor  since  one  hydrogen  donates  to  two  acceptors.  The  hydronium 
ion  also  forms  two  more  three-center  bifurcated  donor  hydrogen  bonds  (1.783  and 
1.832  A,  1.75 1  and  1,735  A).  The  one  three-center  bifurcated  acceptor  has  hydrogen 
bonds  of  1.813  and  1.783  A.  As  noted  by  Jeffrey  and  Saenger  [26],  three-center 
bifurcated  acceptors  only  occur  in  conjunction  with  three-center  bifurcated  donors. 
The  three-center  bifurcated  acceptors  seen  here,  as  in  the  (H20)2iH+  system  [8], 
appear  to  be  an  accidental  occurrence  necessitated  by  the  hydrogen  rich  environment 
of  the  hydronium  ion.  The  hydronium  hydrogen  bond  distances  and  angles  range 
from  1.735  to  1.832  A  and  98°  to  152°,  consistent  with  the  kinds  of  geometries 
observed  in  the  (H20)2iH+  system  [8].  The  three-center  bifurcated-acceptor  hydro- 
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Single  Point 

AHf<Cage)2  =  -2340.08  "Frozen"  AHf(H30+)  =  160.16, 160.11 
(Cage)2  +  2H30+ 

AHj  =  -20088^/^ 

40H2O  +  2H3O+  — 

£AHf= -1819.05 

AHf  (H20)  =  -53.43 

AHf  (H30+)  =  159.077 

Figure  3.  Thermodynamic  cycle  and  energetic  information  for  the  formation  of  the 

(Cage)2(H30+)2  ion. 

gen  bond  is  rarely  observed  in  the  crystalline  state  and  is  not  found  in  any  of  the 
ice  structures  [26].  When  the  bifurcated-acceptor  hydrogen  bond  is  found,  as  in 
some  nucleoside  crystal  structures,  it  is  always  in  conjunction  with  three-center 
bonds  [26].  The  hydronium  ion  on  the  right-hand  side  of  Figure  2  has  two  three- 
center  bifurcated  donor  and  one  two-center  hydrogen  bond.  Bond  distances  and 
angles  range  from  1.745  to  1.871  A  and  128°  to  157°. 

The  hydrogen  bond  interface  that  joins  two  20-mer  cages  together  to  make  the 
40-mer  is  depicted  in  the  top  half  of  Figure  2.  The  bond  lengths  and  bond  angles 
for  the  interface  range  from  1.766  to  1.806  A  and  146°  to  155°.  An  important 
feature  of  this  interface  is  that  while  the  (H20)2iH+  cage  has  10  dangling  hydrogens 
[8],  the  (H2O)40(H3O+)2  cluster  has  20  minus  5,  or  15  dangling  hydrogen  bonds  as 
five  are  lost  in  the  interface  of  the  two  cages.  The  mass  spectrometric  experiments 
performed  by  Castleman’s  group  showed  that  the  (H20)2JH+  cage  had  10  dangling 
hydrogens  because  of  the  large  peak  observed  at  the  mje  ratio  corresponding  to  1 0 
trimethylamine  (TMA)  molecules  bound  to  the  21-mer  [6].  Since  the  pentagonal 
dodecahedron  cage  is  the  only  possible  structure  for  the  (H20)2iH+  ion  that  provides 
a  total  of  10  dangling  hydrogen  atoms,  the  mass  spectrometric  experiments  identified 


Table  I.  Comparison  of  three  different  (H20)4o(H30+)2  structures,  built  from  the  building  blocks 
described  in  references  seven  and  eight. 


(Isomer  I)2(H30+)2 

(Isomer  II)2(H30+)2 

(Cage)2(H30+)2 

Number  of  hydrogen  bonds 

per  interface 

2 

3 

5 

Dangling  hydrogens 

available  for  TMA  sites 

18 

17 

15 

Mass/charge  ratio 

910.3 

881.8 

822.7 

370  05  ►  (Cage)2(H30*)2 
AHf  =-2188.98 
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Figure  4.  The  D2d  and  S4  octamer  structures.  Coordinates,  axes  of  rotation,  and  mirror 

planes  are  shown. 


a  pentagonal  dodecahedron  composed  of  20  waters  with  a  hydronium  ion  in  the 
middle  as  the  species  formed  in  their  apparatus  [6].  We  note  that  the  (H20)4o(H30+)2 
doubly-charged  ion  with  1 5  dangling  hydrogen  tags  for  TMA  will  have  a  unique 
m/e  ratio,  as  would  the  (H20)6o(H30+)3  triply-charged  ion  with  20  available  tags. 
The  number  of  available  tags  is  dependent  on  the  dodecahedral  clathrate  isomer 
used  as  a  building  block  for  larger  clusters. 

Cioslowski  and  Nanayakkarc  performed  the  first  calculations  on  the  (H2O)20 
system  [7].  They  used  the  HF/6-311G**  level  of  ab  initio  theory  within  the  C5 
symmetry  point  group  to  calculate  two  quite  different  arrangements  of  dangling 
hydrogens  for  the  pentagonal  dodecahedral  cage.  The  two  isomers  are  separated  by 
only  0.4  kcal  mol-1,  and  consist  of  structures  composed  of  two  regular  pentagons 
and  10  pentagons  with  three  short  and  two  long  edges  each  [7].  Our  model  building 
studies  reveal  that  using  isomer  I  or  isomer  II  as  building  blocks  to  form  the 
(H2O)40(H3O+)2  ion  will  have  different  numbers  of  hydrogen  bonds  involved  in  the 
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Figure  5.  pm3  optimized  structures  (MOPAC)  of  neutral  water  clusters  composed  of  20 

water  molecules. 


interface,  thus  changing  the  number  of  sites  available  for  TMA  tags.  Dimers  of 
isomer  I  will  have  an  interface  of  two  hydrogen  bonds  and  dimers  of  isomer  II  will 
have  an  interface  of  three  hydrogen  bonds.  Hence,  as  shown  in  Table  I,  dimers  of 
isomer  I  will  have  18  dangling  hydrogen  atoms  available  for  TMA  labeling,  while 
dimers  of  isomer  II  will  have  17  TMA  tag  sites.  The  m/e  ratio  of  the  three 
(H20)4o(H30+)2(TMA)/2  ions  will  be  unique  (Table  I),  allowing  a  study  of  larger, 
multiply  charged  water  clusters.  Observation  of  any  of  the  three  m/e  ratios  in  the 
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(Ring5b)2 

Figure  6.  pm3  optimized  structures  (SPARTAN)  of  neutral  water  clusters  composed  of 

40  water  molecules. 


mass  spectrum  would  be  evidence  that  doubly  charged  clusters  exist  and  would 
allow  for  the  identification  of  the  (H2O)20  building  blocks  used  to  form  the  larger 
cluster. 

A  thermodynamic  cycle  has  been  used  to  examine  the  hydrogen  bond  energetics 
of  the  (H20)4o(H30+)2  ion.  Figure  3  illustrates  the  cycle  where  the  overall  reaction 
of  40  waters  and  two  hydronium  ions  reacting  to  make  the  42-mer  can  be  calculated 
in  two  steps.  Step  1  is  the  conversion  of  the  40  water  molecules  and  the  hydronium 
ions  into  the  (H2O)40  cluster  and  the  two  H30+  ions,  all  “frozen”  in  the  conformation 
of  the  final  42-mer  structure.  Step  2  is  the  formation  of  the  42-mer  from  these  two 
“frozen”  structures.  The  heats  of  formation  of  the  “frozen”  structures  were  calculated 
by  SPC  on  the  geometries  of  the  separated  species  from  the  geometry  optimized 
(H2O)40(H3O+)2  ion.  The  sum  of  these  three  heats  of  formation  yields  the  single¬ 
point  “frozen”  energies  in  Figure  3.  Figure  3  contains  all  energetic  information 
obtained  from  geometry  optimization  and  from  the  SPC.  Also  shown  are  the  AHs 
for  each  step. 
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Table  II.  Comparison  of  different  water  cluster  geometries  displayed  in  Figures  1,  4-6.  Energies  in 

kcal  mol-1. 


No.  of  H  bonds 

Total  H 

per  water 

Association 

H-bond 

Structure 

PM  3  A  Hf  bonds 

molecule  Dangling  H 

energy 

energy 

(H20)8 

D2d 

-475.96 

12 

3.00 

4 

-48.52 

-4.04 

s4 

-475.75 

12 

3.00 

4 

-48.31 

-4.03 

(H2O)20 

(D2d)4 

-1206.99 

36 

3.60 

4 

-138.39 

-3.84 

(D2dS4)2 

-1206.64 

36 

3.60 

4 

-138.04 

-3.83 

(S4)4 

-1205.96 

36 

3.60 

4 

-137.36 

-3.82 

Ring5b 

-1205.33 

35 

3.50 

5 

-136.79 

-3.91 

Q 

-1206.66 

34 

3.40 

6 

-138.12 

-4.06 

Cage 

-1185.41 

30 

3.00 

10 

-116.99 

-3.90 

(H2O)40 

[Cage]2 

-2399.61 

65 

3.25 

15 

-262.53 

-4.04 

(D2d)9 

-2427.09 

76 

3.80 

4 

-290.01 

-3.82 

(Ring5b)2 

-2426.90 

75 

3.75 

5 

-289.82 

-3.86 

(H20)„  where 
n  >  40 

(Cage)2(H30+)2 

-2188.98 

The  enthalpy  of  reaction  for  step  1  (AHi)  is  the  sum  of  the  spc  “frozen”  heats 
minus  the  sum  of  the  40  waters  and  hydronium  ion  heats,  or  AH j  =  —200.88  kcal 
mol-1.  The  enthalpy  of  reaction  for  step  2  (A H2)  is  found  from  the  SPC  heats  and 
the  (H20)4o(H30+)2  heat  of  formation.  Formation  of  the  (H20)4o(H30+)2  cluster 
from  40  separated  waters  and  two  hydronium  ions  releases  370.05  kcal  mol-1.  The 
values  for  AHi  and  AH2  reveal  that  200.88  kcal  mol-1  are  released  in  making  the 
(Cage)2  structure  and  straining  the  hydronium  ions  and  that  169.17  kcal  mol-1  are 
released  when  the  hydronium  ions  are  bound  to  the  40-mer.  Compared  to  the 
previously  reported  system  [7,8],  there  is  less  strain  in  the  (Cage)2  structure  and 
less  energy  is  released  upon  binding  hydronium  ions. 

Neutral  Water  Clusters 

The  (H20)8  cluster  in  its  most  stable  form  has  a  cubic  structure  which  maximizes 
the  number  of  hydrogen  bonds  [9],  The  two  most  stable  forms  have  D2d  and  S4 
symmetry,  and  they  are  depicted  in  Figure  4.  The  pm  3  optimized  D2d  structure  is 
0.21  kcal  mol-1  more  stable  than  S4  (Table  II),  in  agreement  with  MP2/aug-ccp 
vdz*  ab  initio  calculations  [9]  that  place  the  D2d  structure  0.2  kcal  mol-1  more 
stable  than  S4.  We  have  used  the  D2d  and  S4  octamers  as  building  blocks  for  many 
of  the  larger  neutral  water  clusters  reported  in  this  paper.  The  20-mers  displayed 
in  Figure  5  are  very  similar  in  energy  (Table  II).  Our  results  are  in  general  agreement 
with  the  tip4p  force  field  calculations,  which  predict  stabilities  of  the  20-mers  of 
(D2d)4  >  (D2dS4)2  >  (S4)4  >  Ring5b  >  Q  >  Cage.  The  exception  is  that  pm3  predicts 
that  the  Q  structure  is  more  stable  [(D2d)4  >  C;  >  (D2dS4)2  >  (S4)4  >  Ring5b  > 
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Cage],  perhaps  because  the  tip4p  potential  exaggerates  the  stability  of  the  fused 
cubic  species  relative  to  structures  containing  five-  and  six-membered  rings  [9]. 
The  40-mers  displayed  in  Figure  6  include  the  (Cage)2  parent  cluster  for  the 
(H20)4o(H30+)2  cluster  discussed  above,  and  two  long  repeating  cubes  and  penta¬ 
gons.  The  (D2d)9  structure  has  one  more  hydrogen  bond  and  is  0.192  kcal  mol-1 
more  stable  than  the  (Ring5b)2  structure.  The  (Ring5b)2  cluster  has  the  interesting 
feature  that  it  is  starting  to  twist  around  the  long  axis  of  the  cluster. 

Benson’s  Two-Structure  Model  for  Liquid  Water 

In  their  recent  paper,  Benson  and  Siebert  argue  that  the  heat  capacity  data  for 
liquid  water  can  only  be  modeled  if  fluid  water  is  composed  of  discrete  units, 
(H20)„,  which  can  move  independently  of  each  other  [10].  They  argue  that  to 
account  for  the  energy  of  vaporization  of  water  (9.9  kcal  mol-1)  requires  breaking 
1.7  hydrogen  bonds  per  water  molecule  at  300  K.  Each  water  molecule  in  a  cubic 
octamer  (Fig.  4)  shares  three  hydrogen  bonds  with  other  waters,  for  a  net  of  1.5 
hydrogen  bonds  per  water  molecule  at  300  K.  Our  results  show  that  larger  clusters 
have  higher  net  hydrogen  bonds  per  water  molecule.  For  the  different  water  clusters 
displayed  in  Figure  5,  Table  II  tabulates  the  number  of  hydrogen  bonds  per  water 
molecule.  Dividing  this  number  by  2  yields  the  net  number  of  hydrogen  bonds. 
The  (D2d)4,  (D2dS4)2,  and  (S4)4  clusters  (made  from  octamers)  all  have  a  net  of  1.8 
hydrogen  bonds,  Ring5b  (pentamers)  has  1.75,  Cf  (pentamers  and  an  octamer)  has 
1 .7,  and  the  cage  has  1.5  net  hydrogen  bonds.  These  values  for  net  hydrogen  bonds 
fit  nicely  into  the  Benson/Siebert  model,  which  requires  breaking  1.7  hydrogen 
bonds  to  vaporize  water. 

Clusters  based  on  40  waters  have  higher  net  hydrogen  bonds  than  smaller  clusters. 
The  (Cage)2,  (D2d)9,  and  (Ring5b)2  clusters  have  net  hydrogen  bonds  of  1.625,  1.9, 
and  1.875,  respectively.  An  80  water  cluster  made  of  pentamers  (Ring5b)4  would 
have  3.875  hydrogen  bonds  per  water  and  a  net  of  1 .94.  This  increase  in  net  hydrogen 
bonds  with  increases  in  cluster  size  is  because  there  are  four  shared  hydrogen  bonds 
for  each  water  molecule  in  the  inside  region  of  water  clusters  made  from  pentamers 
and  tetramers,  but  at  the  ends  there  are  only  three.  Increasing  the  cluster  size  will 
always  increase  the  number  of  hydrogen  bonds  per  water  and  thus  the  net  hydrogen 
bonds.  These  results  suggest  that  if  the  Benson/Siebert  model  is  applicable,  water 
clusters  will  be  limited  in  size  as  the  vaporization  of  water  requires  the  breaking  of 
1.7  hydrogen  bonds  per  water  molecule. 


Conclusions 

The  PM 3  quantum-mechanical  method  is  a  low-cost  yet  reasonably  accurate 
method  for  modeling  water  clusters.  The  two  main  results  of  our  research  are:  First, 
PM  3  results  on  the  (H2O)40(H3O+)2  ion  indicate  that  mass  spectrometry  combined 
with  trimethylamine  tags  can  be  used  to  uniquely  indicate  the  conformation  of  the 
20  water  pentagonal  dodecahedron  cage  present  in  this  ion  and  for  other  multiply 
charged,  larger  clusters.  Second,  the  number  of  net  hydrogen  bonds  per  water  in¬ 
crease  with  the  cluster  size.  This  places  a  limit  on  the  size  of  clusters  that  would 
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fit  the  Benson  model  of  liquid  water.  Many  clusters  of  20  waters  fit  the  Benson 
model  well. 
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Abstract 

The  extreme  sensitivity  of  the  absorption  spectrum  to  small  changes  in  the  medium  polarity  has  made 
Reichardt’s  dyes  useful  molecular  probes  in  the  study  of  micelle /solution  interfaces  and  phospholipid 
bilayers.  This  work  reports  preliminary  results  of  semiempirical  quantum  chemical  calculations  on  some 
conformations  of  2,6-diphenyl- ( 2,4,6-triphenyl- l-pyridinium)-A-phenoxide  betaine  (Reichardt’s  betaine, 
rb),  which  exhibits  negative  solvatochromic  effects.  We  have  used  the  amI  Hamiltonian  of  Dewar  in 
the  geometry  optimizations,  and  the  Intermediate  Neglect  of  Differential  Overlap  method  parameterized 
for  spectroscopy  (indo/S).  For  RB,  two  low-lying  conformations  have  been  found.  The  small  difference 
in  energy  between  them  suggests  that  both  forms  may  be  present  in  solution,  an  observation  confirmed 
by  calculations  on  the  spectra  using  the  SCRF  model:  the  superposition  of  the  calculated  spectra  for  these 
two  forms  matches  the  experimental  spectra  very  well.  For  nonpolar  solvents,  the  general  pattern  consists 
of  variation  of  Et(30)  concurrent  with  variation  of  the  dielectric  constant.  We  have  also  carried  out 
calculations  for  solvents  which  form  specific  (e.g.,  H-bond)  binding  to  the  solute,  namely  methanol  and 
water,  using  a  supermolecule  approach.  Our  results  are  in  excellent  agreement  with  the  experiment  and 
present  an  accurate  description  of  the  spectra.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

The  absorption  and  emission  spectra  of  a  substance  change  characteristics  (po¬ 
sitions  and  intensities)  on  going  from  the  gas  phase  to  solution.  These  differences 
largely  depend  on  the  solvent  and,  although  they  are  generally  small,  they  can  be 
very  large  in  some  cases  [1—3].  This  phenomenon  is  widely  known  as  solvatochro- 
mism  and  was  first  recognized  more  than  100  years  ago  by  Kundt  [1],  It  occurs 
because  dissolved  molecules  interact  differently  with  the  environment  in  their  ground 
and  several  excited  states.  These  interactions  are  not  always  easily  described  and 
this  subject  has  been  intensely  pursued  in  the  last  40  years:  dozens  of  books  and 
accounts,  and  innumerable  reports  have  been  devoted  to  it.  For  empirical  ap¬ 
proaches,  the  interested  reader  will  find  useful  material  in  Refs.  [3-5],  and  for  a 
thorough  description  of  theoretical  and  computational  aspects,  in  Refs.  [6-7], 
among  many  others.  From  the  empirical  point  of  view,  solvatochromism  has  been 
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used  to  construct  linear  solvation  free  energy  relationships  (lser)  that  give  access 
to  solvent  properties  important  to  explain  several  important  physicochemical  phe¬ 
nomena  as,  for  example,  chemical  kinetics,  conformation  or  tautomeric  equilibria, 
and  even  partition  coefficients  [3,4]. 

The  problem  of  theoretically  predicting  the  electronic  properties  of  organic  sub¬ 
stances  and  their  observed  spectra  in  solution  has  interested  and  challenged  theo¬ 
retical  chemists  for  more  than  four  decades.  This  field  has  been  very  active  especially 
in  the  last  few  years  and  Rauhut,  Clark,  and  Steinke  [8]  have  recently  given  a 
summary  of  recent  literature.  Following  recent  work  in  this  lab  on  solvent  effects 
[9]  we  decided  to  examine  the  electronic  spectrum  of  one  of  the  most  widely  used 
solvatochromic  dyes,  2,6-diphenyl-  ( 2,4,6-triphenyl- 1  -pyridinium)  - TV-phenoxide 
(Reichardt’s  betaine  #1,  a.k.a.  Reichardt’s  dye  #1,  RB,  Fig.  1 )  [10-18].  The  lowest 
energy  band  of  RB  undergoes  a  negative  solvatochromism  (hipsochromic  shift)  of 
about  360  nm  (approximately  10,000  cm-1)  in  going  from  diethyl  ether  to  water. 
This  is  the  basis  of  the  very  popular  £’r(30)  scale  (or  its  equivalent  Ej-  scale) 
[3-5,10-16]. 


dye  #1,  R  =  H 
dye  #2,  R  =  SO2CH3 

Figure  1 .  Reichardt’s  dye  #1  (  rb  ) . 
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Method 

We  initially  performed  a  search  of  low-lying  conformations  of  rb  with  geometry 
optimizations  at  the  SCF  level  using  the  AMl  [19-21]  model  Hamiltonian,  within 
the  ampac  package  [22].  Keywords  precise  and  GNORM  =  0.05  were  used  in 
order  to  obtain  the  smallest  residual  gradient  possible.  Solvent  effects  were  included 
in  our  study  at  two  levels:  in  a  first  approximation,  the  bulk  was  simulated  by  a 
self-consistent  reaction  field  (SCRF)  within  the  continuum  model  [23-28  ] .  We  use 
in  this  work  the  SCRF  formulation  as  given  by  Karelson  and  Zerner  [9]:  in  model 
A  of  this  reference,  we  write  the  energy  of  the  universe  ( molecule  +  environment ) 


Eu  =  E°  1  -  (e,  «o)<^ImI^)<’AImI’A)  O) 

2  \  e  J  do  2 

with  Q  the  net  charge  of  the  solute  and  f  the  reaction  field  tensor.  We  now  form 
the  functional 


L  =  El 


i\  Q2  l 


C  a0  2 


l(e,  flo) 


x  <^|mI^)<^ImI^>  -  -  l)  (2) 

with  W the  Lagrange  multiplier  ensuring  normalization  and  E0  the  gas  phase  energy 

E°  =  (*\H0\t)  (3) 

where  H0  is  the  Hamiltonian  for  the  isolated  solute  molecule.  Applying  the  varia¬ 
tional  theorem  to  Eq.  (2)  leads  to  the  Schrodinger  equation  for  the  state  |^) 

(H0  -  ?<^I5I^X5»I^>  =  W*  |^>  (4) 

with  the  Fock  operator  for  the  electron  k  given  by 

f(k)  =fo(k)  -  (5) 

and  the  energy  of  the  universe  is  then 

Eu=  W+Ec,c=  W+X-t’  l<’/'l?l^>|2  (6) 

where  the  “solvent  cost”  Ec>c  is  a  correction  that  express  the  energy  lost  by  the 
solvent  in  dissolving  the  solute.  Note  that  the  nuclear  repulsion  energy  and  the 
Born  term, 

(7) 

A  B>A  RAB  2  \  C)  d0 


must  be  added.  In  this  work  we  use  an  alternative  procedure  that  includes  the 
solvent  relaxation,  model  B  of  Ref.  [9]:  we  put  a  second  Lagrange  constraint  in 
Eq.  (2)  such  that 
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and  the  Fock  operator  for  electron  k  turns  out  to  be 

f(k)=Mle)-yf-(t\Z\f)il(k)  (9) 

an  operator  that  yields  Eq.  (1)  directly  (since  the  interaction  term  includes  the 
1/2).  The  cavity  radius  a0  (6.02  A  for  isolated  rb)  was  determined  from  mass 
density.  In  addition,  when  necessary,  we  have  also  used  the  supermolecule  approach 
[29-31]  by  including  at  least  two  solvent  molecules  around  the  solute  (RB). 

The  absorption  spectra  were  obtained  (at  the  AMl  and  amI-scrf  geometries) 
by  the  spectroscopic  version  of  the  Intermediate  Neglect  of  Differential  Overlap 
(INDO/S)  method  [32-37]  .  The  two  center,  two  electron  integrals  are  obtained 
from  the  modified  Mataga-Nishimoto  formula  [38] 

7 ab  -  T777  T1  r;'n  ( 10) 

in  which  the  Weiss  parameter  fy  was  chosen  equal  to  1 .2  in  order  to  reproduce  the 
spectrum  of  aromatic  compounds  [32].  The  configuration  interaction  (Cl)  calcu¬ 
lations  included  all  single  excitations  from  the  19  highest  occupied  mo’s  to  the  12 
lowest  unoccupied  MO’s,  plus  the  ground  state,  a  total  of  229  configurations. 

Results  and  Discussion 


Results  in  Gas  Phase  (AMl ) 

Table  I  presents  a  summary  of  our  AMl  results  for  rb.  Conformers  A  (Fig.  2) 
and  A'  differ  only  by  a  rotation  of  one  of  the  phenyl  rings  by  about  180°,  and  have 
virtually  the  same  energy.  Conformer  B  (Fig.  3)  was  calculated  to  be  0.1  kcal/mol 
more  stable  in  gas  phase,  and  this  suggests  that  these  conformers  coexist  in  gas 
phase.  In  solution  (under  the  scrf  approximation  [23-28]),  differences  between 
A,  A',  and  B  conformers  are  less  than  1  kcal/mol  on  going  from  ethanol  to  water. 
This  suggests  that  these  conformers  likely  also  coexist  in  solution.  If  this  is  the  case, 
then  the  observed  spectral  properties  must  arise  from  a  combination  of  the  spectral 
properties  of  all  conformers.  We  assumed  that  the  contributions  of  both  forms 
should  be  approximately  the  same  ( regardless  of  the  existence  of  other  conforma¬ 
tions),  so  the  observed  spectra  are  compared  in  this  article  with  the  average  of  the 
spectra  of  the  two  forms,  A  and  B. 

The  INDO/S-CIS  calculated  spectrum  of  RB  in  gas  phase  is  shown  in  Tables  II 
(Conformer  A,  Fig.  2)  and  III  (Conformer  B,  Fig.  3).  There  is  no  gas  phase  spectrum 
available  in  the  literature  (there  is  little  vapor  pressure),  so  we  compare  our  gas 
phase  calculations  to  the  results  obtained  for  a  nonpolar  solvent,  namely,  1,4-dioxane 
[3],  and  recognize  the  limitations  of  this  comparison.  In  gas  phase,  the  lowest 
excited  singlet  state  is  mostly  HOMO  -►  lumo  and  was  calculated  at  1 1,480  cm-1 
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Table  I.  Summary  of  amI-scrf  results  for  Reichardt’s  dye  #1  (rb). 


Medium 

e 

Dihedral  angles  (°) 

C  X 

33-32-9-8  15-14-2-1  21-20-4-3 

A  Hf 

(kcal/mol) 

m(£) 

IP  (eV) 

Vacuum 

A 

1.000 

-39.0 

128.4 

-37.9 

202.8 

13.005 

6.938 

A' 

1.000 

-39.1 

-55.2 

-37.8 

202.8 

12.979 

6.939 

B 

1.000 

-140.8 

-54.1 

-37.9 

202.9 

13.094 

6.923 

A  Conformation 
dioxane 

2.209 

-38.4 

131.2 

-36.9 

200.2 

15.883 

6.950 

ethanol 

24.30 

-38.1 

130.0 

-34.4 

195.4 

19.064 

7.046 

CH3OH 

32.63 

-38.1 

130.0 

-34.4 

195.3 

19.121 

7.052 

CH3CN 

35.94 

-38.0 

130.0 

-34.4 

195.2 

19.134 

7.054 

H20 

78.54 

-38.1 

129.9 

-34.4 

195.0 

19.225 

7.062 

A'  Conformation 
CH3OH 

32.63 

-38.1 

-52.9 

-34.5 

195.3 

19.116 

7.053 

CH3CN 

35.94 

-38.1 

-52.9 

-34.5 

195.2 

19.129 

7.054 

H20 

78.54 

-38.1 

-54.3 

-34.4 

195.1 

19.191 

7.062 

B  Conformation 
dioxane 

2.209 

-140.8 

-53.9 

-37.9 

200.0 

14.905 

6.963 

CH3OH 

32.63 

-141.7 

-52.7 

-34.6 

195.1 

19.131 

7.043 

CH3CN 

35.94 

-141.7 

-52.7 

-34.6 

195.1 

19.146 

7.044 

h2o 

78.54 

-141.7 

-52.9 

-34.4 

194.8 

19.267 

7.052 

(Conformer  A)  and  1 1,220  cm-1  (Conformer  B),  to  be  compared  to  the  experi¬ 
mental  value  of  12,580  cm-1  in  1,4-dioxane  [3,10-18]. 

A  broad  shoulder  is  observed  between  about  20,000-22,200  cm"1 .  We  assign  it 
to  our  next  two  calculated  transitions:  HOMO  —►  LUMO  +  1,  calculated  at  17,140 
cm"1  (conformer  A)  and  16,920  cm"1  (conformer  B),  and  HOMO-1  -►  LUMO, 
calculated  at  24,180  cm"1  (conformer  A)  and  24,690  cm"1  (conformer  B).  Both 
of  these  excitations  are  of  medium  strength.  Their  oscillator  strength  average  is  at 
about  19,400  cm-1 ,  to  be  compared  to  the  observed  maximum  of  the  shoulder  at 
about  21,000  cm"1  in  1,4-dioxane  [3,10-18].  The  next  observed  peak  is  at  25,000 
cm"1  and  is  strong.  This  compares  very  well  with  calculated  values  of  25,560  cm  1 
(conformer  A)  and  25,190  cm"1  (conformer  B). 

Finally,  we  consider  a  broad,  intense  band,  observed  with  maximum  at  32,260 
cm"1  with  a  slight  shoulder  at  30,300  cm"1 .  Both  conformers  are  calculated  to  have 
a  great  many  transitions  in  this  region.  At  about  31,500  cm  1  (317  nm),  both  have 
strong  transitions  and  both  have  strong  transitions  at  about  34,500  cm  1 .  Experi¬ 
mentally  this  band  shows  little  solvent  shift  [  3  ] . 

We  conclude  that  the  absorption  spectrum  as  a  whole  is  well  described,  as  one 
can  seen  by  comparing  Tables  II  and  III  with  the  observed  spectrum  of  RB  in  1,4- 
dioxane  [3].  The  most  important  peak  for  our  purposes  is  the  first,  observed  at 
10,500  cm"1  [3].  We  calculate  1 1,480  cm"1  for  conformer  A  and  1 1,220  cm"1  for 
conformer  B. 
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Figure  2.  Reichardt’s  dye  #1,  conformer  A,  in  gas  phase,  am  1  optimized  geometry. 


Solvent  Effects:  1,4-Dioxane 

Our  next  step  was  the  consideration  of  solvent  effects.  One  of  our  immediate 
goals  was  to  establish  the  procedure  that  could  be  applied  to  a  large  number  of 
solvents,  in  order  to  reproduce  Reichardt’s  £>(30)  scale  [3-5,10-16]  and  to  check 
our  understanding  of  this  scale.  For  the  nonpolar  solvents,  that  have  no  specific 
bonding  with  the  solute  (RB),  the  continuum  approximation  [23-28]  should  be 
enough  to  reach  agreement  with  the  observed  spectra.  However,  we  have  to  apply 
some  caution,  because  even  for  the  nonpolar  solvents  the  dielectric  constant  vari¬ 
ation  does  not  concur  with  the  polarity  of  the  solvent  that  is  expressed  by  the 
Et(  30)  parameter.  For  example,  1,4-dioxane  (e  =  2.209)  is  more  polar  than  benzene 
( e  =  2.284)  and  acetonitrile  (c  =  35.94)  is  more  polar  than  dmso  (e  =  46.45)  on 
the  £r(30)  scale. 

The  indo/S-CIS  calculated  spectra  of  RB  (conformations  A  and  B)  in  1,4-dioxane 
treated  by  the  SCRF  approach  [9]  are  shown  in  Tables  IV  and  V.  The  first  band  is 
calculated  at  10,591  cm-1  (for  A)  and  11,871  cm-1  (for  B).  If  we  take  the  result 
for  B  (based  on  the  fact  that  the  intensity  is  twice  as  large),  the  agreement  with  the 
experiment  (12,600  cm-1  [3,10-16])  is  very  good. 

A  second  band  was  calculated  at  567  nm  =  17,640  cm-1  (conformer  A)  and  550 
nm  =  18, 193  cm-1  (conformer  B),  and  accounts  for  a  shift  of  about  600-900  cm-1 
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Figure  3.  Reichardt’s  dye  #1,  conformer  B,  in  gas  phase,  am  1  optimized  geometry. 


from  the  gas  phase  result  (see  above).  Again,  the  relation  between  oscillator  strengths 
enables  us  to  assign  this  band  to  the  shoulder  with  maximum  at  about  2 1 ,000  cm  1 . 

Comparison  of  Tables  II- V  show  that  the  third  and  fourth  transitions  undergo 
an  inversion  from  gas  phase  to  solution.  A  third  transition  (homo  -►  lumo  +  4, 
HOMO  -►  lumo  +  8)  is  now  calculated  at  25,220  cm-1  (conformer  A)  and  24,925 
cm-1  (conformer  B).  Both  numbers  match  very  well  the  sharp  peak  observed  at 
approximately  25,000  cm"1  =  400  nm. 

Again  we  found  a  series  of  five  very  weak  transitions  beginning  at  370  nm  (in¬ 
cluding  a  n  7T*  transition  from  the  lone  pair  of  oxygen).  This  system  is  not 
observed  in  1 ,4-dioxane,  but  the  band  appears  in  acetonitrile,  and  helps  demonstrate 
the  consistency  of  our  assignments.  Finally,  the  large,  broad  band  calculated  to 
begin  at  330  nm  is  nearly  unshifted  with  respect  to  the  gas  phase  spectrum  (compare 
Table  II  with  IV,  and  Table  III  with  V). 


Other  Nonpolar  Solvents:  The  Special  Case  of  Chloroform 

Other  solvents  in  this  range  of  ET(  30 )  follow  the  same  pattern:  in  general,  increase 
of  the  dielectric  constant  is  followed  by  an  increase  in  the  transition  energy  of  the 
first  band.  Therefore,  we  simply  give  the  calculated  values  in  Table  VI,  in  order  to 
avoid  repetition.  It  is  observed  that  the  value  of  ET(  30)  for  benzene  is  underestimated 
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Table  II.  Calculated  (gas  phase)  absorption  spectrum  of  Reichardt’s  dye  #1, 
conformer  A. 


A £a  (cm"1) 

f  b 

J  osc. 

Main  Cl  contributions  and  single  excitations 

1 1 483 c 

0.405 

+0.96901  (102  —  103)  (homo  —  lumo) 

171 38  d 

0.045 

-0.97468  (102  —  104)  homo  —  lumo  +  1) 

24180 

0.027 

-0.89879  (101  —  103)  (homo-1  —  lumo) 

25556® 

0.478 

-0.69212(102  —  107),  -0.59668(102—  111) 

26418 

0.054 

+0.61817  (102  —  105),  +0.51282  (102  —  108) 

26951 

0.022 

+0.75531  (102  —  1 10),  +0.49626  (102  —  109) 

27550 

0.016 

-0.74336  (102  —  108),  -0.41459  (102  —  109) 

27784 

0.038 

+0.63383  (102  —  111),  -0.54800  (102  —  107) 

27900 

0.058 

+0.83022  (100  —  103),  -0.35859  (102  —  1 1 1) 

30085 f 

0.049 

+0.53846  (102  —  105),  +0.52160  (99  —  103) 

31572f 

0.305 

-0.73333  (96  —  103),  +0.41061  (94  —  103) 

317 19f 

0.084 

-0.53730  (99  —  103),  +0.46658  (102  —  105) 

31901 

0.006 

+0.95613  (102  —  106) 

335 1 5 8 

0.012 

+0.67122  (101  —  104),  +0.48958  (102  —  113) 

33603 8 

0.047 

-0.58691  (102  —  1 13),  +0.43957  (102  —  1 12) 

33871 8 

0.015 

+0.86539  (102  —  114) 

34264® 

0.464 

+0.83022  (100  —  103),  -0.35859  (102  —  111) 

a  Transition  energies. 

b  Oscillator  strengths:  ftj  =  4.7092  X  A£,y<(/'|ji|y)2. 

Experimental  values  in  1,4-dioxane  [1],  (c)-(g). 
c  12,600  cm"1  =  795  nm. 
d  21,500  cm-1  =  465  nm. 
e  25,000  cm"1  =  400  nm. 
f  A  shoulder  at  31,000  cm"1  =  322  nm. 

8  A  broad  band  centered  at  approximately  33,330  cm"1  =  300  nm. 


by  ca.  2.0  kcal/mol  (700  cm-1 ).  The  solvents  1,4-dioxane  and  chloroform  present 
us  with  a  larger  error  (4.0-5.0  kcal/mole)  in  £V(30).  Since  this  error  is  not  systematic 
with  that  of  benzene,  we  might  speculate  that  very  weak  specific  interactions  begin 
to  affect  the  pattern  of  the  nonpolar  solvents.  To  check  this  hypothesis,  we  carried 
out  an  additional  SM  calculation  on  rb  (conformer  B)  surrounded  by  four  chlo¬ 
roform  molecules  (Fig.  4)  (we  have  chosen  chloroform  because  the  specific  inter¬ 
actions — H-bond  involving  the  CHCI3  proton — were  easier  to  figure  out  than  for 
the  case  of  1,4-dioxane).  The  resulting  spectrum  is  depicted  in  Table  VII.  Clearly 
the  H-bond  affects  the  n  -►  ir*  CT  transition,  and  the  corrected  value  of  ET(  30)  = 
38.8  kcal/mol  (see  Table  VII)  closely  matches  the  experiment.  The  sharp  peak 
observed  at  25,000  cm-1  for  1,4-dioxane  is  now  blue  shifted  by  1,000  cm-1 .  Thus, 
we  conclude  that  results  from  the  SCRF  are  indeed  perturbed  for  CHC13  (and  prob¬ 
ably  also  for  1,4-dioxane)  due  to  specific  binding,  even  though  the  H-bonding  is 
quite  weak. 
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Table  III.  Calculated  (gas  phase)  absorption  spectrum  of  Reichardt’s  dye  #  1 , 

conformer  B. 


A E*  (cm"1) 

f  b 

J  osc. 

Main  Cl  contributions  and  single  excitations 

1 1 2 1 8C 

0.392 

+0.96687  (102  —  103)  (homo  —  lumo) 

169 17d 

0.045 

-0.97398  (102  —  104)  (HOMO  —  LUMO  +  1) 

24688 

0.016 

+0.85674  (101  —  103)  (homo-1  —  LUMO) 

25 1 87 e 

0.500 

+0.57067  (102  -►111),  +0.59781  (102  —  107) 

26220 

0.056 

-0.62569  (102  —  105),  +0.52804  (102  —  108) 

26797 

0.016 

+0.76846  (102  —  1 10),  -0.45084  (102  —  109) 

27364 

0.011 

-0.75341  (102  —  108),  -0.44406  (102  —  109) 

27498 

0.001 

-0.91681  (100  —  103) 

27558 

0.052 

+0.74652  (102  —  1 1 1),  -0.61248  (102  —  107) 

29895 f 

0.047 

+0.54909  (102  —  105),  +0.50071  (99  —  103) 

31434f 

0.291 

-0.79422  (96  —  103),  -0.32512  (94  —  103) 

31567f 

0.125 

+0.60065  (99  —  103),  -0.43290  (102  —  105) 

31699 

0.005 

+0.96039  (102  —  106)  (homo  —  lumo  +  3) 

331408 

0.052 

-0.87124(102  —  112) 

33592e 

0.002 

+0.77394  (102  —  114) 

33592e 

0.002 

+0.87672  (101  —  104)  (HOMO-1  —  lumo  +  1) 

33887B 

0.012 

-0.74600  (102  ^  113),  -0.41668  (102  —  114) 

34658® 

0.529 

-0.31631  (95  —  107) 

a  Transition  energies. 

b  Oscillator  strengths:  fj  =  4.7092  X  kEij(i\ii\j')2. 

Experimental  values  in  1,4-dioxane  [1],  (c)-(g). 
c  12,600  cm-1  =  795  nm. 
d  21,500  cm-1  =  465  nm. 
e  25,000  cm'1  =  400  nm. 
f  A  shoulder  at  31,000  cm-1  =  322  nm. 

8  A  broad  band  centered  at  approximately  33,330  cm-1  =  300  nm. 


H-Bonding  Solvents:  Methanol 

Inspecting  the  case  of  methanol  (Fig.  5)  is  interesting  because  its  dielectric  con¬ 
stant  ( e  =  32.63 )  is  very  close  to  that  of  acetonitrile  ( e  =  35.94).  Thus,  at  the  SCRF 
approximation  [23-28  ] ,  the  INDO/S-Cis  calculations  should  yield  almost  the  same 
spectra.  This  is,  indeed,  the  case  as  one  can  see  from  inspection  of  Table  VI.  The 
calculated  (SCRF  only)  transition  energy  of  acetonitrile  (12,830  cm-1)  is  slightly 
larger  than  that  of  methanol  (12,250  cm-1),  while  experimentally  the  first  n  -►  tt* 
band  is  observed  at  19,420  cm-1  =  515  nm  (the  solution  is  red)  for  methanol  and 
16,080  cm-1  =622  nm  (green-blue)  for  acetonitrile  [3] .  These  calculations  illustrate 
the  well-known  phenomenon  of  the  saturation  in  the  SCRF  model  as  5(e)  approach 
1/2  as  e  increases.  Clearly  the  continuum  model  [23-28]  alone  is  not  satisfactory. 
To  correct  this  we  decided  to  include  two  methanol  molecules  near  the  oxide  ter¬ 
ming  of  RB,  and  two  molecules  near  the  positively  charged  nitrogen.  This  simple 
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Table  IV. 

Calculated  absorption  spectrum  of  Reichardt’s  dye  #1,  conformer 

A  in  1,4-dioxane. 

A E*  (cm"1) 

f  b 

Jose. 

Main  Cl  contributions  and  single  excitations 

1059 lc 

0.153 

-0.97127  (102  —•  103)  (homo  -►  lumo) 

17640d 

0.030 

-0.98076  (102  —  104)  (homo  -►  lumo  +  1) 

25217e 

0.414 

-0.75027  (102  — ►  111),  -0.43925  (102  —  107) 

2561  r 

0.151 

+0.82156  (101  —  103),  +0.37918  (100  —  103) 

27069 

0.040 

+0.731 19  (102  —  105),  +0.48779  (102  110) 

27772 

0.001 

-0.72349  (102  —  109),  -0.59091  (102  —  107) 

27536 

0.057 

-0.50022  (102  —  108),  -0.49728  (102  —  109) 

27339 

0.030 

+0.71390  (102  —  108),  +0.34673  (102  -►  111) 

28305 

0.058 

-0.82669  (100  —  103),  +0.45851  (101  —  103) 

31307f 

0.342 

+0.75436  (96  —  103),  -0.35847  (93  —  103) 

3 1 842f 

0.065 

+0.64745  (99  —  103),  -0.35764  (95  —  103) 

31941 

0.069 

+0.72591  (102  —  112),  +0.47590  (102  —  113) 

32186 

0.037 

+0.86755  (102  -►  1 14)  (HOMO  —  lumo  +  1 1) 

32815s 

0.429 

+0.65087  (102  -►  113),  -0.39108  (102  —  112) 

32655s 

0.062 

-0.61304  (102  —  110),  +0.47997  (102  —  105) 

33945s 

0.057 

-0.74460  (101  —  104),  -0.34662  (99  —  103) 

33290s 

0.003 

-0.81896  (102  —  106),  (homo  —  lumo  +  3) 

34740s 

0.319 

+0.56390  (95  —  103),  -0.52880  (94  —  103) 

a  Transition  energies. 

b  Oscillator  strengths:  fj  =  4.7092  X  AEy(i\n\jy. 

Experimental  values  in  1,4-dioxane  [1],  (c)-(g). 

c  12,600  cm"1  =  795  nm. 

d  21,500  cm"1  =  465  nm. 

e  A  sharp  peak  at  25,000  cm-1  =  400  nm. 

f  A  slight  shoulder  at  31,000  cm-1  =  322  nm. 

s  A  broad  band  centered  at  approximately  33,330  cm-1  =  300  nm. 


arrangement  “fixes”  the  calculated  spectrum  for  methanol,  as  shown  in  Tables  VIII 
and  IX.  At  the  SCRF-SM  level  of  approximation,  the  n  -►  7r*,  ct  band  is  located 
at  15,070  cm”1  for  conformer  A  and  14,140  cm-1  for  conformer  B.  We  further 
observe  that  the  oscillator  strength  for  this  transition  is  drastically  reduced  (compared 
to  the  gas  phase  calculation).  Also,  the  now  stronger  homo-1  -►  lumo  transition 
was  calculated  at  19,220  cm-1  for  conformer  A  and  19,320  cm-1  for  conformer  B. 
This  accounts  for  a  blue  shift  of  7,000  cm”1  with  respect  to  the  scrf  result.  The 
agreement  with  the  experimental  value  of  19,420  cm-1  is  very  good. 


Water 

The  absorption  spectrum  of  RB  in  water  (Fig.  6)  should  be  the  most  difficult  to 
reproduce,  due  to  the  special  properties  of  the  solvent.  The  INDO/S-Cis  calculated 
spectra  of  RB  surrounded  by  four  water  molecules  are  shown  in  Tables  X  (conformer 
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Table  V.  Calculated  absorption  spectrum  of  Reichardt’s  dye  #1,  conformer 
B  in  1,4-dioxane. 


A Ea  (cm-1)  fosch  Main  ci  contributions  and  single  excitations 


1 1 87 1 c 

0.304 

-0.96922 

(102  — 

103)  (HOMO  —  LUMO) 

18193d 

0.041 

+0.97441 

(102  — 

104)  (HOMO  —  LUMO  +  1) 

24925 e 

0.401 

+0.65478 

(102  — 

111),  -0.63700(102  —  106) 

26027 e 

0.162 

-0.91968 

(101  — 

103) 

27469 

0.049 

-0.71689 

(102  — 

105),  -0.53121  (102  —  110) 

27917 

0.013 

+0.90477 

(102  — 

109)  (HOMO  —  LUMO  +  6) 

27495 

0.027 

-0.55251 

(102  — 

108),  -0.53296  (102  —  111) 

27523 

0.021 

+0.66714 

(102  — 

108),  +0.45593  (102  —  111) 

27856 

0.003 

-0.96844 

(100  — 

103)  (HOMO-2  —  LUMO) 

31562f 

0.145 

+0.68260 

(99  —  103),  -0.40201  (95  —  103) 

3 1 938 f 

0.281 

-0.78945 

(96  —  103)  (homo-6  —  lumo) 

32015 

0.066 

-0.87926 

(102  — 

1 12)  (HOMO  —  LUMO  +  9) 

32602 

0.050 

+0.78754 

(102  — 

113)  (HOMO  —  LUMO  +  10) 

33428 8 

0.195 

-0.57387 

(102  — 

114),  +0.49714(102—  110) 

33474s 

0.002 

+0.82964 

(102  — 

107)  (HOMO  —  LUMO  +  4) 

342 1 5 8 

0.238 

-0.67225 

(102  — 

107),  -0.40388  (102  —  110) 

34863s 

0.018 

+0.85915 

(101  — 

104)  (HOMO-1  —  LUMO  +  1) 

35599s 

0.201 

-0.46040 

(94  —  103),  -0.39471  (95  —  103) 

a  Transition  energies. 

b  Oscillator  strengths:  f0  =  4.7092  X  AEg(i\n\jy2. 

Experimental  values  in  1,4-dioxane  [1],  (c)-(g). 
c  12,600  cm-1  =  795  nm. 
d  21,500  cm-1  =  465  nm. 
e  A  sharp  peak  at  25,000  cm-1  =  400  nm. 
f  A  slight  shoulder  at  31,000  cm-1  =  322  nm. 

8  A  broad  band  centered  at  approximately  33,330  cm-1  =  300  nm. 


Table  VI.  First  CT  band  in  the  uv-visible  absorption  spectrum  of  pyridinium-iV-phenoxide  betaines. 


Solvent 

e 

nD 

^calcd.  (cm  ) 

^expti.  (cm  l) 
[3] 

Et  (30)  (kcal) 

SCRF 

SCRF— SM 

Calcd. 

Exptl.  [3} 

hexane 

1.890 

1.3751 

11,050 

_ 

10,850 

31.6 

31.0 

CC\4 

2.238 

1.460 

11,260 

— 

11,338 

32.2 

32.4 

benzene 

2.284 

1.501 

11,297 

— 

11,990 

32.3 

34.3 

1 ,4-dioxane 

2.209 

1.4224 

11,231 

— 

12,600 

32.1 

36.0 

CHC13 

4.806 

1.446 

12,008 

13,555 

13,680 

38.8 

39.1 

CH3CN 

35.94 

1.3442 

12,831 

— 

16,077 

— 

45.6 

methanol 

32.63 

1.3288 

12,246 

19,272 

19,420 

55.1 

55.4 

water 

78.54 

1.3328 

12,370 

21,790 

22,075 

62.3 

63.0 
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Figure  4.  Reichardt’s  dye  #1  (conformer  B),  in  chloroform.  am1-scrf-sm  geometry. 


A)  and  XI  (conformer  B).  The  n  -►  t*,  ct  band  was  calculated  at  14,060  cm-1 
for  conformer  A  and  13,980  cm-1  for  conformer  B.  As  previously  observed  for 
methanol,  this  band  loses  its  oscillator  strength  by  at  least  one  order  of  magnitude 
(compared  to  the  gas  phase  result).  The  experimental  spectrum  shows  a  peak  at 
453  nm,  and  this  peak  should  be  the  one  related  to  the  i^(30)  scale.  However, 


Table  VII.  Calculated  absorption  spectrum  of  Reichardt’s  dye  #1 ,  conformer 
B  in  chloroform,  sm  approximation. 


A E*  (cm-1) 

f  b 

Jose, 

Main  ci  contributions  and  single  excitations 

1 3555 c 

0.013 

-0.95917  ( 1 54  — ►  155)  (HOMO  -►  lumo) 

16260 

0.074 

+0.98861  (154  -►  158)  (homo  lumo  +  3) 

16451 

0.101 

+  0.98535  ( 1 54  — ►  159)  (homo  lumo  +  4) 

17943 

0.009 

+0.98031  (154  — ►  156)  (homo  -►  lumo  +  1) 

19979 

0.002 

-0.96406  (154  -►  157)  (homo  -*  lumo  +  2) 

22236 

0.019 

+0.97694  (154  —►  160)  (homo  -*►  lumo  +  5) 

26355 

0.484 

-0.79421  (154  — ►  165)  (homo  -►  lumo  +  10) 

26465 

0.013 

-0.71339  (153  155)  (homo-1  lumo) 

a  Transition  energies. 

b  Oscillator  strengths:  fj  =  4.7092  X  A Ey  (i\n\j}2. 
cExptl.  value  13,680  cm-1  =  833  nm. 
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Figure  5.  Reichardt’s  dye  #1  (conformer  B),  in  methanol.  amI-SCRF-sm  geometry. 


based  on  the  results  of  this  article,  we  now  conclude  that  the  observed  peak  (for 
methanol  and  water,  and  probably  also  for  ethanol)  rather  corresponds  to  the  second 
n  7r*  transitions,  homo-1  -►  lumo  for  methanol  (see  Tables  VIII  and  IX)  and 
HOMO  -►  lumo  +  1  for  water  (see  Tables  X  and  XI).  We  also  observe  that  for 
both  solvents  the  sharp  peak  (observed  close  to  25,000  cm"1  for  1,4-dioxane)  is 
red  shifted  by  about  1,000  cm"1 . 


Table  VIII.  Calculated  absorption  spectrum  of  RB,  conformer  A  in  methanol, 
sm  approximation. 


A Ea  (cm"1) 

f  b 

Jose, 

Main  Cl  contributions  and  single  excitations 

15071 

0.038 

+0.96651  (130  —  131)  (homo  —■  lumo) 

19224c 

0.103 

+0.88285  (129  —  131)  (HOMO-1  -►  LUMO) 

20815 

0.077 

-0.89996  (130  —  132)  (homo  —  lumo  +  1) 

23335 

0.037 

+0.83438  (129  —  132)  (homo-1  —  lumo  +  1) 

24431 

0.442 

-0.76913  (130  -►  136)  (homo  —  lumo  +  5) 

27449 

0.032 

-0.92785  (130  —134)  (homo  —  lumo  +  3) 

a  Transition  energies. 
b  Oscillator  strengths. 
c  Exptl.  19,420  cm"1  =  515  nm. 
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Table  IX.  Calculated  absorption  spectrum  of  rb,  conformer  B  in  methanol, 
sm  approximation. 


A£a(cm  ') 

f  b 

Jose. 

Main  Cl  contributions  and  single  excitations 

14138 

0.020 

-0.96546  (130  -►  131)  (homo  -*►  lumo) 

19320c 

0.164 

+0.94352  (129  -►  131)  (homo-1  -►  lumo) 

21343 

0.044 

-0.96346  (130  -►  132)  (homo  -*►  lumo  +  1) 

24052 

0.442 

-0.76055  (130  -►  136)  (homo  -►  lumo  +  5) 

24929 

0.031 

+0.81435  (129  —  132)  (homo-1  lumo  +  1) 

28427 

0.040 

+0.69131  ( 1 30  — ►  133)  (homo  lumo  +  2) 

a  Transition  energies. 
b  Oscillator  strengths. 
c  Exptl.  19,420  cm-1  =  515  nm. 


Concluding  Remarks 

In  this  work  we  have  examined  the  absorption  spectra  of  a  compound  which 
presents  negative  solvatochromism,  Reichardt’s  dye  #1  (RB).  The  challenge  was  to 
obtain  a  correct  description  of  the  negative  solvatochromism  exhibited  by  rb.  For 
nonpolar  solvents,  which  make  no  specific  binding  to  the  solute,  the  continuum 
model  [23-28]  was  enough  to  reproduce  the  experimental  spectra.  For  solvents 
which  form  specific  binding  (e.g.,  H-bond)  with  rb,  we  had  to  apply  the  super- 


Figure  6.  Reichardt’s  dye  #1  (conformer  B),  in  water.  AMl-SCRF-SM  geometry. 
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Table  X.  Calculated  absorption  spectrum  of  RB,  conformer  A  in  water,  SM 
approximation. 


AFa  (cm  ') 

f  b 

Jose. 

Main  Cl  contributions  and  single  excitations 

14061 

0.017 

+0.97387  (1 18  -►  119)  (homo  -►  lumo) 

21851° 

0.035 

-0.97906  (118  -*•  120)  (homo  -►  lumo  +  1) 

23000 

0.168 

+0.89137  (115  — ►  1 19)  (homo-3  -►  lumo) 

24377 

0.470 

-0.83016  (1 18  ^  124)  (homo  -►  lumo  +  5) 

26658 

0.003 

+0.76182  (1 15  -*  120)  (homo-3  lumo  +  1) 

28836 

0.070 

+0.74919  (118  -►  122)  (homo  -►  lumo  +  3) 

29450 

0.016 

-0.66650  (1 18  -*■  121)  (homo  ->  lumo  +  2) 

a  Transition  energies. 
b  Oscillator  strengths. 
c  Exptl.  22,075  cm-1  =  543  nm. 


molecule  approach  [29-31].  This  work  provided  basis  from  drawing  some  conclu¬ 
sions,  that  we  now  present. 

(i)  am  1  calculations  have  provided  two  low-lying  conformers  very  close  in  heats 
of  formation,  which  suggests  that  they  coexist,  both  in  gas  phase  and  in  solution. 

(ii)  We  have  confirmed  that  the  first  CT  band  in  the  spectrum  of  RB  has  n  -►  tv* 
character  and  arises  from  an  excitation  from  the  lone  pair  of  the  phenoxide  oxygen. 

(iii)  For  nonpolar  solvents  increase  of  the  dielectric  constant  gives  rise  to  increase 
of  £,(30).  (iv)  For  some  nonpolar  solvents  such  as  chloroform  and  1,4-dioxane, 
specific  binding,  although  quite  weak,  is  enough  to  disturb  the  mentioned  pattern 
and  the  CT  band  is  further  blue  shifted,  (v)  One  of  the  effects  of  the  SCRF  on  the 
Cl  calculations  is  that  the  lowest  energy  n  -►  tt*,  CT  band  loses  its  intensity  as  the 
dielectric  constant  increases,  (vi)  We  observe  that  this  effect  holds  even  for  sm 
calculations,  so  the  bands  used  in  the  £,(30)  scale  for  methanol  and  water  are 


Table  XI.  Calculated  absorption  spectrum  of  RB,  conformer  B  in  water,  sm 
approximation. 


A£a  (cm-1)  /oscb  Main  Cl  contributions  and  single  excitations 


13978 

0.018 

+0.97253  (118 

1 19)  (HOMO  -►  LUMO) 

21728° 

0.037 

-0.97587  (118 

-► 

120)  (HOMO  -►  LUMO  +  1) 

23029 

0.172 

+0.91992(115 

-► 

1 19)  (HOMO-3  -►  LUMO) 

24056 

0.450 

+0.87774  (118 

-► 

124)  (HOMO  -►  LUMO  +  5) 

26674 

0.003 

+0.78160  (115 

-* 

120)  (HOMO-3  -►  LUMO  +  1) 

28325 

0.072 

+0.87366  (118 

-> 

122)  (HOMO  -*»  LUMO  +  3) 

29288 

0.009 

-0.83416  (118 

121)  (HOMO  -*■  LUMO  +  2) 

a  Transition  energies. 
b  Oscillator  strengths. 
c  Exptl.  22,075  cm-1  =  543  nm. 
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Et(30)  as  calculated  by  INDO/S 


Figure  7.  Plot  of  theoretical  versus  experimental  results  for  the  £t(30)  parameter. 


actually  due  to  the  second  n  tt*  transition.  Finally,  (vii)  the  indo/S-cis  technique 
proved  a  successful  tool  in  reproducing  the  empirical  ^(30)  scale.  We  summarize 
these  results  in  Table  VI  and  Figure  7.  Perhaps  slight  discrepancies  could  be  removed 
by  increasing  the  number  of  solvent  molecules,  but  we  are  more  likely  at  the  limit 
of  the  model  we  use  for  structure  and  spectroscopy. 
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Determination  of  Higher  Electric  Polarizability 
Tensors  from  Unrelaxed  Coupled  Cluster  Density 
Matrix  Calculations  of  Electric  Multipole  Moments 

TADEUSZ  PLUTA,*  JOZEF  NOGA, :  and  RODNEY  J.  BARTLETT* 

Quantum  Theory  Project,  University  of  Florida,  Gainesville,  Florida  32611-2085 


Abstract 

The  ccsdt-  1  method  is  used  to  calculate  electric  dipole,  quadrupole,  and  octopole  moments  for  the 
HF  and  H20  molecule  and  the  F"  anion,  together  with  the  associated  higher-order  polarizabilities.  All 
multipole  moments  are  computed  as  expectation  values.  Externally  perturbed  moments  are  used  to 
determine  the  components  of  the  electric  tensors  of  second  and  higher  order.  It  is  shown  that  the  correlated 
electric  moments  are  in  good  agreement  with  experiment  and  that  they  may  be  used  to  obtain  the  electric 
polarizability  and  hyperpolarizability  tensors  in  an  effective  way.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Electric  multipole  moments  are  of  major  importance  in  many  areas  of  physical 
chemistry,  including  the  evaluation  of  intermolecular  interactions  [1-3].  The  in¬ 
teraction  among  permanent  multipoles  represents  the  coulombic  part  of  the  inter¬ 
action  energy,  while  the  induction  contribution  can  be  represented  as  an  interaction 
of  permanent  moments  of  a  molecule  A  with  the  distorted  charge  density  of  B.  The 
ability  of  molecule  B  to  be  distorted  by  the  permanent  moments  of  A  is  measured 
by  the  polarizability  tensors  of  B.  Unlike  the  multipole  moments,  polarizabilities 
are  of  second-  or  higher-order  in  the  perturbation  and  demand  much  more  com¬ 
putational  effort  to  calculate  than  first-order  electric  properties.  In  principle,  one 
can  use  analytical  differentiation  methods  [2],  but  in  practice  the  absence  of  an¬ 
alytical,  mixed  higher  derivatives  at  the  correlated  level  dictates  the  use  of  more 
straightforward  finite-field  techniques. 

In  this  article  we  present  the  results  of  our  calculations  of  electric  moments  and 
polarizabilities  for  model  systems:  HF,  H2O,  and  the  F  anion.  We  have  chosen 
to  employ  the  charge  perturbation  variant  of  the  finite-field  method  together  with 
direct  expectation  value  calculations  of  electric  first-order  moments.  Electron  cor¬ 
relation  effects  have  been  accounted  for  using  the  coupled  cluster  method  (ccm) 
[4-7]  with  the  inclusion  of  single,  double,  and  triple  excitation  operators,  namely, 


*  Department  of  Chemistry,  Silesian  University,  PL-40-006  Katowice,  Poland. 

t  institute  of  Inorganic  Chemistry,  Slovak  Academy  of  Sciences,  SK-842  36  Bratislava,  Slovakia. 

*  To  whom  correspondence  should  be  addressed  at  the  University  of  Florida. 


International  Journal  of  Quantum  Chemistry:  Quantum  Chemistry  Symposium  28,  379-393  (1994) 

©  1994  John  Wiley  &  Sons,  Inc.  CCC  0360-8832/94/010379-15 


380 


PLUTA,  NOGA,  AND  BARTLETT 


CCSDT-1  [8],  The  choice  of  our  systems  follows  from  the  fact  that  there  are  many 
calculations  of  higher-order  properties  available  at  the  SCF  level  to  compare.  How¬ 
ever,  at  the  correlated  level  some  components  of  the  polarizability  tensors  have,  to 
our  knowledge,  never  been  obtained  before.  In  particular,  F  is  an  extremely  difficult 
case  for  electric  properties  evaluation,  due  to  the  diffuse  character  of  the  charge 
density  and  the  need  to  account  for  a  substantial  part  of  the  electron  correlation 
effects. 

We  would  also  like  to  emphasize  that  the  high  quality  CCSDT-1  approach  satisfies 
the  Hellmann-Feynman  theorem  to  a  great  accuracy  [9,10]  and  in  conjunction 
with  the  charge  perturbation  method  provides  an  effective  and  accurate  tool  for 
calculating  electric  tensors  of  small  molecules.  At  the  same  time,  the  ability  of 
medium  size  polarized  basis  sets  suited  for  calculations  of  electric  properties,  as 
suggested  by  Sadlej  [11],  has  been  tested. 

Method 

The  finite-field  technique  consists  of  placing  a  molecule  (atom)  in  a  weak  external 
electric  field  and  then  calculating  the  total  energy  of  the  system.  In  principle,  the 
first-,  second-,  and  higher-order  electric  properties  of  the  system  can  be  calculated 
by  numerical  differentiation  of  the  total  perturbed  energy  [e.g.,  12].  Since  the  only 
property  used  in  these  calculations  to  determine  polarizabilities  is  the  total  energy, 
there  is  no  fundamental  difficulty  in  using  correlated  methods  like  mbpt  or  CCM . 
The  charge  perturbation  method  [13,14]  is  a  variant  of  the  conventional  finite- 
field  technique.  The  external  electric  field  is  generated  by  a  charge  Q  (or  charges) 
placed  at  a  distance  R  from  the  molecule  and  hence  do  not  provide  a  homogeneous 
field,  but  instead  a  presence  of  field  gradients.  Hence,  the  method  can  be  used  to 
determine  higher  moment  polarizabilities.  The  Hamiltonian  of  such  a  system  can 
be  written  as 


H(Q,R)  =  H(0)  +  H(l) 


with  the  perturbation  H(  1  ]  given  by 


«el 

2 


i=i 


Q  ZjQ 

\r,~R\  "  |  0-R\’ 


(1) 

(2) 


where  the  symbols  have  their  usual  meaning. 

Expanding  the  Hamiltonian  about  the  unperturbed  H(0)  in  terms  of  the  molecular 
multipole  moments,  one  obtains  the  following  expression: 

H  =  i/(0)  —  fiaFa  —  —  ]5  £lapyFapy  —  •  •  •  (3) 


while  the  corresponding  total  perturbed  energy  E  reads 

E  =  E^  —  flaFa  —  jQapFap  —  15  ®at0yFa0y  —  •  •  • 

-  2<WvF,s  -  iPeiSyFaFpFy  -  STa/S-yS FJFgFyFs  -  •  •  . 

3  ^4  afiyF aF {jy  6  ^  ot(3,ydFctF fiFyfc  ^Ca^^y^F ctftFyfi  1 5  ^'a,{35yFaF[ 
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Cartesian  coordinates  are  denoted  as  a,  ft  and  y  suffixes  and  the  repeated  index 
summation  convention  is  employed  throughout  the  paper.  Electric  field  Fa  and  the 
field  gradient  Faj3  are  defined  as  the  derivatives  of  the  electrostatic  potential  <p  at 
the  origin  of  the  multipole  moments,  here  taken  to  be  at  the  center  of  mass: 


F„  = 


F<x&  — 


d(f) 

fa’ 

d2<t> 

drarp 


(5a) 

(5b) 


fi ,  0,  and  U  denote  the  dipole,  quadrupole,  and  octopole  moments  of  the  unperturbed 
molecule,  a,  ft  and  y  are  the  dipole  polarizability,  hyperpolarizability,  and  second 
dipole  hyperpolarizability,  respectively.  Dipole-quadrupole,  quadrupole,  dipole- 
octopole,  and  mixed  dipole-dipole-quadrupole  hyperpolarizability  tensors  are  de¬ 
noted  by  the  symbols  A,  C,  E,  and  B,  respectively.  Analogous  relations  like  Eq. 
(4)  hold  for  multipole  moments: 

M«(  Q  fF. )  fia  F  ocapFp  T"  3^a,/?7  T  3  Fap ,y&F sFyb ,  ( 6a ) 

®ap(Q,R)  =  0ajS  F  AytCtpFy  F  CafaysFy8  F  ^Fy^apFy F&,  (6b) 

=  ^a(3y  F  E8^a(3yF8 .  (6c) 

The  above  relations  along  with  the  energy  expansion  provide  enough  information 
to  completely  determine  the  components  of  the  electric  tensors  and  form  the  basis 
of  the  charge  perturbation  method.  For  a  given  system  of  atoms  and  external  charges, 
one  can  get  a  system  of  equations  relating  the  known  quantities,  like  the  perturbed 
moments  to  the  unknown  tensors  a,  ft  A,  B,  etc.  The  presence  of  a  small  external 
charge  Q  does  not  change  the  electron  distribution  dramatically  and,  with  the  proper 
choice  of  its  magnitude,  is  sufficient  to  produce  a  field  that  can  generate  the  induced 
multipoles.  Bishop  and  his  coworkers  systematically  employed  this  method  to  study 
atoms  [15],  diatomics  [16],  and  small  polyatomic  systems  [17].  The  working  equa¬ 
tions  relating  the  components  of  the  second-  and  higher-order  tensors  were  derived 
in  [  16, 1 7  ]  and  need  not  be  repeated  here.  Bishop  performed  his  calculations  mostly 
at  the  scf  level  using  extended,  carefully  prepared  basis  sets.  For  systems  like  H20 
[  1 7  ] ,  he  also  included  electron  correlation  by  performing  limited  mcscf  calculations. 
Both  the  SCF  and  MCSCF  methods  obey  the  Hellmann-Feynman  theorem  and  con¬ 
sequently  all  first-order  moments  are  rigorously  given  as  expectation  values.  Others 
[18]  have  used  the  charge  perturbation  method  with  mbpt  and  limited  coupled 
cluster  applications  using  the  approximate  variant,  CCD  +  ST(CCD)  [19].  Since 
neither  mbpt  nor  CC  wavefunctions  satisfy  the  Hellmann-Feynman  theorem,  po¬ 
larizability  tensors  must  be  determined  directly  from  perturbed  energies.  In  this 
case  the  charge  perturbation  method  becomes  a  rather  minor  modification  of  the 
traditional  finite  field  technique  and  many  potential  advantages  are  lost.  Alterna¬ 
tively,  calculations  can  be  based  on  the  multipole  moments  calculated  as  expectation 
values  from  CC/mbpt  wavefunctions  at  various  electric  fields.  Such  an  approach 
proved  to  be  efficient  for  the  calculation  of  dipole  polarizability  of  Be  [10].  This 
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follows  from  the  fact  that  the  expectation  value  error  at  higher  cc  levels  (like  ccsdt  ) 
can  be  expected  to  be  quite  small  [9,10].  This  may  be  explained  as  follows: 

Let  us  have  a  system  under  the  influence  of  an  external  one-electron  perturbation 
\0  (with  the  operator  O  and  its  strength  X)  described  by  the  Hamiltonian  (in 
normal  ordered  form) 

Hn(X)  =  +  \On.  (7) 


Then  the  first  derivative  of  the  expectation  value  (correlation)  energy 


A  E(X)  = 


C*!*) 


(8) 


with  respect  to  A  at  A  =  0,  defining  the  correlation  contribution  to  the  one-electron 
property,  is  given  as 


(dAE\  >  <»|Q|»)  2(»|/fo-Ag|»x) 

l  3A  )x__0  <*|*>  <*!*> 


<¥|/7W-  A£|tfx>  =  (0\e^(HN- AE)(U  +  T( l))e7'|0> 

=  <0|(t/+  +  T^(\))e^(HN-  AE)eT\0).  (9b) 

For  simplicity  from  this  point  we  have  omitted  superscript  (0)  connected  with 
unperturbed  quantities.  Operator 

U  =  2  Uua'i  ( 10) 


follows  from  the  CPHF  theory  [20],  and  T(  1 )  is  the  first-order  change  to  the  exci¬ 
tation  operator  T(X)  expanded  in  powers  of  X 

\2 

T(\)  =  T+  X7X1)  +  y  T{2)  +  •  •  •  (11) 

Finally  inserting  1  =  eT  2  \<pm)(<f>m\e~T to  (9b)  yields 
<0|(C/t  +  TH\))e'*er'2  \4>m){<S>m\e-T(HN  -  A£)er|0> 

m 

=  <0| ( C/+  +  7’t(l))er,c:r|0>(0|  e~T(HK—  A£)cr|0)  (12a) 

+  2  <0|(f/t+  THl))eT'eT\<t>m)(<l>m\(HNeT)c\0).  (12b) 

m¥=0 


Now  one  immediately  sees  that  the  last  term  (12b)  vanishes  for  those  excited  con¬ 
figurations  |  <£w)  which  are  generated  by  the  action  of  the  actually  chosen  T  on 
1 0  ) ,  since  within  the  conventional  CCM 

(4>m\(HNeT)c\0)  =  0  (13) 

are  the  defining  equations  to  determine  Tm.  The  energy  dependent  term  (12a) 
would  only  vanish  if  one  assumes  the  full  CC  method  with  all  possible  excitation 
operators  included  in  T.  For  a  truncated  T  operator  the  expectation  value  ( 8 )  is 
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no  longer  equal  to  the  usual  cc  energy  calculated  from  the  asymmetric  formula 
[21],  i.e., 

A E(X)  ±  (0\(HNMe™)c\0).  (14) 

Nevertheless,  the  difference  between  the  latter  two  values  is  again  due  to  contri¬ 
butions  from  those  excited  configurations  that  are  not  included  in  the  CC  method 
actually  used  [21] .  The  similar  conclusion  for  non-Hellmann-Feynman  terms  can 
be  drawn  for  the  truncated  Cl  [22].  Hence  for  T  =  T2  the  error  due  to  the  “non- 
Hellmann-Feynman”  or  “relaxation”  correction  [Eq.  ( 12)]  is  of  second  order  (in 
T),  for  ccsd  [23]  of  third  order,  while  for  CCSDT  [24]  with  T  =  T i  +  T2  +  T3 
only  of  fifth  order  in  a  generalized  sense  [25].  As  the  error  of  the  energy  is  for 
CCSDT  of  the  same  order,  it  is  quite  appropriate  to  assume  the  validity  of  the 
Hellmann-Feynman  theorem  and  to  calculate  one  electron  properties  as  expectation 
values.  Let  us  stress  that  the  difference  between  the  conventional  CCSDT  energy 
and  the  expectation  value  actually  defines  its  error.  Therefore,  within  the  limits  of 
this  error,  starting  our  analysis  from  the  expectation  value  was  appropriate,  too. 
Finally,  if  the  difference  between  the  full  CCSDT  and  its  less  accurate  variants  CCSDT- 
n  [26]  is  expected  to  be  small,  one  can  safely  use  the  above  conclusions.  For  alter¬ 
native  CC  ansatz  like  ucc(«)  [27]  even  the  term  with  T(  1 )  in  Eq.  ( 12)  disappear, 
since  the  method  satisfies  the  generalized  Hellmann-Feynman  theorem  [27,28]. 

Cizek  has  shown  [4]  that  the  expectation  value  of  an  arbitrary  operator  in  CC 
theory  is 


<0>  =  <*|0|*>c,  (15) 

which  is,  however,  infinite.  Considering  only  terms,  which  contribute  to  lower  order 
than  is  the  actual  error  inherent  to  the  method,  this  expansion  can  be  truncated 
after  a  few  terms.  In  [9]  it  was  suggested  to  truncate  the  wave  function  in  Eq.  ( 15) 
after  the  second-order  terms,  since  the  wave  function  is  anyway  only  accurate  to 
this  order.  One  can,  of  course,  formulate  Eq.  ( 1 5 )  in  terms  of  the  density  matrix 

Ppq  =  (*\p*q\*)c,  (16> 

which  is  independent  of  O ,  and,  then,  as  usual,  contract  this  density  matrix  with 
integrals  of  the  operator  O.  Instead  of  truncating  the  wave  function,  one  can  insist 
on  the  accuracy  of  the  density  matrix  and  terminate  the  expansion  of  the  latter 
after  fourth-order  terms  [10],  which  is  more  precise.  We  denote  in  this  paper  the 
former  approach  as  approach  A  and  the  latter  as  B.  Let  us  recall  that  in  approach 
A  we  have  in  Eq.  ( 16) 


eT~\  +  r,  +  r2  +  r3  +  \t2t2 


(17) 


and  this  differs  from  B  by  a  fourth  order  term  (<0 1  T\a?i  TXT2  |0)c  +  h.c.). 

The  alternative  “response”  density  approach  avoids  truncation  and  has  been 
extensively  applied  to  properties  [29],  but  requires  determination  of  the  N  operator 
as  well  as  T. 
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Results  and  Discussion 

As  mentioned,  we  have  selected  rather  well-known  systems  to  test  the  approach 
described  in  the  previous  sections:  HF,  H20  molecules,  and  the  F“  anion.  Electric 
multipoles  were  calculated  at  the  CCSDT-1  level  of  theory  as  expectation  values  at 
various  charges,  and  from  the  perturbed  moments  higher  polarizabilities  have  then 
been  evaluated.  Since  the  charge  perturbation  method  requires  many  repetitions 
of  time-consuming  calculations,  we  have  decided  to  use  relatively  modest  basis  sets 
still  expected  to  work  well  for  such  calculations.  Such  basis  sets,  in  particular  designed 
for  electric  properties  calculations  (POL  bases)  have  been  recently  proposed  by 
Sadlej  [11].  The  POL  basis  which  consists  of  the  contracted  set  [5s3p2d]  for  O  and 
F  and  [3s2p]  for  H  has  been  used  in  our  calculations.  We  have  partially  lifted  the 
contraction  of  the  d  shell  on  O  and  F  and  p  shell  on  H  to  obtain,  finally,  a  [5s3p3d] 
set  for  O,  F,  and  [3s3p]  for  the  hydrogen  atom.  The  small  differences  between  our 
SCF  results  and  the  results  of  the  recent  study  by  Sekino  and  Bartlett  (SB)  [29] 
reflects  the  fact  that  the  latter  study  employed  the  original  POL  basis:  [5s3p2d/ 
3s2p] .  Our  calculations  for  hf  have  been  performed  for  the  bond  length  of  1 .7330 
a.u.  and  at  the  experimental  geometry  for  the  H20  molecule  as  in  [  29  ] .  The  charges 
and  distances  have  been  selected  to  produce  the  field  strength  of  near  0.005  a.u. 
and  gradients  0.001  to  0.002  a.u..  The  T-amplitudes  have  been  converged  to  at 
least  10-7,  which  means  that  the  accuracy  of  density  matrices  was  the  same  or 
higher. 

hf:  The  results  for  the  dipole,  quadrupole,  and  octopole  moments  are  given  in 
Table  I.  Numerical  Hartree-Fock  (nhf)  results  for  the  dipole  and  quadrupole 
moments  [  30,3 1  ]  are  0.756  and  1 .733  a.u.,  respectively,  and  are  in  very  good  agree¬ 
ment  with  our  SCF  values.  Other  comparisons  can  be  made  with  Bishop  and  Maroulis 
(BM)  [16],  who  systematically  constructed  and  employed  eight  basis  sets  up  to  a 
set  of  96  contracted  Gaussian  functions  denoted  as  B8  [9s6p5dlf/5s4p2d].  Their 
B8  SCF  dipole,  quadrupole,  and  octopole  moments  are:  0.757133,  1.74071,  and 
2.62645  a.u.,  respectively.  The  overall  agreement  of  these  near  HF  limit  values  with 
our  results  is  very  good,  too,  and  confirmes  the  reliability  of  the  basis  set  used  for 
the  pertinent  properties  (Table  II) .  At  the  correlated  CCSDT-1  level  the  results  agree 
very  well  with  experimental  values  (where  available).  The  octopole  moment  has 
not  been  determined  experimentally  so  we  compare  our  results  with  the  CISD  cal¬ 
culations  performed  with  a  [6s5p3d/4s3p]  basis:  2.4466  a.u.  [32]. 


Table  I.  Electric  moments  of  the  hf  molecule  (in  a.u.). 


Moment 

SCF 

CCSDT-1 

(A) 

CCSDT-1 

(B) 

Experiment 

0.757 

0.700 

0.700 

0.7078 

0 

1.754 

1.727 

1.727 

1.72  ±  0.0b 

2.593 

2.461 

2.460 

a  Experimental  estimate  taken  from  [37]. 
b  Experimental  estimate  taken  from  [32]. 
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Table  II.  Electric  tensors  for  the  hf  molecule  (in  a.u.) 


Tensor 

SCF 

CCSDT-1 

CCSDT-1 

Other  scf 

Exp.  or  Correlated 

«Z2 

5.72 

6.40 

6.40 

5.75a 

6.428b,  6.438f 

«xx 

4.49 

5.33 

5.32 

4.48  lc 

5.253b,  5.340f 

a 

4.90 

5.69 

5.68 

Aa 

1.23 

1.07 

1.08 

0ZZZ 

-8.94 

-10.26 

-10.36 

-8.3a 

— 9.838b,  -9.62f 

Pzxx 

0.005 

-0.54 

-0.64 

— 0.4C 

— 0.802b,  -1.27f 

0 

-5.36 

-6.80 

-6.94 

— 7.30f 

Az,zz 

3.98 

4.56 

4.56 

3.986c 

4.508d 

AXyZX 

0.76 

1.07 

1.07 

0.928c 

0.688d 

c«.„ 

6.96 

8.16 

8.15 

7.36c 

8.63e 

c„„ 

3.15 

3.71 

3.71 

4.34c 

5.15e 

(-x.r  jiv 

3.87 

4.70 

4.70 

5.43c 

6.53e 

B„.r. 

-52.6 

-64.5 

-64.5 

-56c 

Bxw 

-28.2 

-40.6 

-40.5 

-35c 

Bw-_ 

26.3 

41.9 

41.6 

26c 

-39.6 

-62.4 

-62.1 

~45c 

V 

6.84 

13.30 

13.33 

6.68e 

Exjax 

-0.49 

-0.11 

-0.19 

— 0.76c 

7zzzz 

279 

411 

413 

294c 

400b,  390f 

Txxxx 

299 

599 

597 

340c 

540b,  650f 

Ixxzz 

120 

177 

178 

1 17c 

150b,  180f 

a  NHF  [30]. 
b  ccsd+T  (4)  [33]. 
c  scf  [16]. 
d  MCSCF  [36]. 
eMBPT(4)  [37]. 
fCCSD<T)  [29]. 

The  dipole  polarizability  a  has  been  calculated  many  times  at  different  levels  of 
sophistication.  The  BM  paper  [16],  the  review  [3],  and  a  recent  study  [29]  can 
serve  as  sources  of  references.  Our  scf  result  for  the  parallel  component  agrees  very 
well  with  the  nhf  value  [31]  while  the  perpendicular  component  is  very  close  to 
the  B8  value  of  BM:  4.48 1  a.u.  The  CCSDT-1  results  compare  very  well  to  the  values 
obtained  by  Sekino  and  Bartlett  [33]  with  the  CCSD+T(4)  method  and  the  extended 
basis  set  [5s3p4d2f/5s3p].  Electron  correlation  contributes  1 1%  to  the  total  value 
of  the  parallel  component  and  8%  for  the  perpendicular  case.  The  average  value  of 
a  is  close  to  the  experimental  value  of  5.52  a.u.  [34]  or  5.60  a.u.  [  35] .  The  correlated 
anisotropy  Aa  of  1.07  or  1.08  a.u.  does  not  agree  that  well  with  experiment  1.32 
a.u.  [  34  ]  and  like  recent  results  by  SB  [29] :  1.10  a.u.  is  slightly  different  from  some 
previous  correlated  calculations,  e.g.,  the  cepa  value  of  1.27  a.u.  obtained,  however, 
with  a  larger  basis  set  [35].  The  error  in  the  anisotropy  of  a  indicates  a  delicate 
imbalance  in  treating  z  and  x;  components  by  the  POL  basis  set.  The  diagonal  hy¬ 
perpolarizability  pzzz  is  below  the  nhf  value  of -8. 3  a.u.  and  recent  ccsd(T)  results 
[29]  obtained  with  the  original  and  augmented  POL  basis  set:  -8.14  to  8.40  a.u., 
but  very  close  to  -9.1  a.u.  B8  basis  results  [16].  The  correlated  results  are  in  good 
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agreement  with  previous  CC  calculations  -9.838  (CCSD  +  T(4))  [33]  and  -9.62 
a.u.  for  the  more  accurate  ccsd(T)  method  [29]  with  an  extended  POL  basis  set. 
The  zxx  component  is  more  difficult  to  be  determined,  scf  calculations  yielded 
0.005  a.u.  while  SB  reported  the  range  from  —0.02  to  —0.28  a.u.  Among  eight  results 
of  BM,  the  range  is  very  broad  from  - 1 .4  to  +0.2  a.u. 

The  basis  sets  effects  are  manifested  even  more  at  the  CC  level.  Our  ccsdt-1 
values  agree  well  with  the  older  CCSD  +  T (4)  results  of —0.802  a.u.,  while  the  most 
recent  study  [29]  reported  —1.27  a.u.,  i.e.,  almost  twice  our  value.  One  should 
notice  the  difference  in  handling  triple  excitations  in  the  CCSD(T)  model  (nonit- 
eratively)  and  in  ccsdt-1,  fully  iterative  but  in  an  approximate  way.  These  differ¬ 
ences  and  usage  of  slightly  different  basis  sets  lead  to  differences  in  the  final  value 
of  this  component  reported  in  the  present  study  and  in  [29].  The  values  of  /? 
defined  for  a  linear  molecule  as 


0  =  I  (fez  +  202XX)  (18) 

is  fortuitously  close  to  the  SCF  value  of  SB  [29]  obtained  with  an  extended  POL 
basis  and  higher  than  any  of  the  eight  values  reported  in  [16].  The  correlated  j6 
value  is  close  to  ccsd(T):  -7.30  [29],  again  obtained  with  a  larger  basis  set.  The 
dipole-quadrupole  polarizability  A,  quadrupole  polarizability  C,  and  dipole- 
octopole  polarizability  tensor  E  are  all  second-order  electric  tensors.  The  knowledge 
of  these  tensors  can  become  important  in  considering  ion-molecule  interactions 
where  the  electric  field  created  by  an  ion  is  not  homogeneous.  The  dipole- 
quadrupole  tensor  A  was  determined  from  the  perturbed  quadrupole  moments. 
The  z,zz  component,  3.98  a.u.,  is  very  close  to  the  best  bm  value  of  3.986  a.u. 
While  the  CCSDT-1  values  can  be  compared  to  the  MCSCF  result  [36]:  4.508  a.u. 
The  permanent  Sxz  moment  is  zero  so  the  Ax,zx  must  be  obtained  from  a  rather 
weak  induced  quadrupole  moment.  Clearly,  higher  angular  momentum  and  diffuse 
functions  are  needed  to  describe  situations  like  this.  Therefore,  there  is  almost  a 
20%  discrepancy  between  our  SCF  value  and  the  B8  result  from  [16] .  The  correlation 
contribution  amounts  to  ~30%  for  this  component,  and  its  value  is  almost  two 
times  larger  than  mcscf  value  [36] .  For  the  quadrupole  polarizability  C,  the  zz,zz 
component  is  in  good  agreement  with  the  bm  value.  Electron  correlation  adds 
about  15%  raising  this  value  to  8.15  a.u.,  in  reasonably  good  agreement  with  the 
MBPT(4)  result  of  Diercksen  et  al.  [37]:  8.63  a.u..  The  perpendicular  component 
differs  more  from  the  bm  value  while  ccsdt-1  disagree  with  the  mbpt(4)  value. 

For  the  “mixed”  xz,xz  component  the  pertinent  quadrupole  moment  is  zero; 
therefore  again  our  results  disagree  with  the  B8  BM  value,  but  is  surprisingly  close 
to  the  rather  extended  B7  basis  set  result  of  3.29  a.u.  (80  contracted  GTOs).  The  / 
functions  are  necessary  to  describe  properly  the  xz  components.  A  similar  situation 
occurs  for  the  dipole-octopole  polarizability  tensor  E.  The  diagonal  EZ)ZZZ  agrees 
well  with  the  SCF  results  of  bm  while  the  difference  for  the  component  is 

larger.  The  situation  can  be  attributed  to  the  POL  basis  deficiency.  Electron  corre¬ 
lation  contributes  about  50%  to  the  total  value  of  the  EZtZZZ  tensor,  ccsdt-1  results 
for  EXtXXX  show  also  a  significant  role  of  electron  correlation,  but  the  basis  set  lim- 
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itations  make  our  values  rather  approximate.  The  dipole-dipole-quadrupole  hy¬ 
perpolarizability  B  is  a  third-order  property  that  can  be  formally  defined  as 

d  -(  d'E  \  (19) 

Our  scf  results  are  in  general  agreement  with  the  BM  SCF  values.  Electron  cor¬ 
relation  contributes:  18%*  30%,  37%,  and  36%  for  zz,zz,  xz,zz,  xx,zz,  and  xx,xx 
components,  respectively.  Unlike  the  previous  four  tensors,  y  has  attracted  a  lot 
of  attention.  A  recent  study  [29]  contains  a  discussion  of  the  relation  of  y  to  fre¬ 
quency-dependent  experiments.  The  SCF  result  for  the  zzzz  component  is  close  to 
the  B8  basis  BM  value.  The  xxxx  component  is  alfected  more  by  the  quality  of  the 
basis  set  and  the  difference  between  our  value  and  the  BM  and  SB  (with  expanded 
POL  basis)  is  40  and  50  a.u.,  respectively.  The  correlated  results  for  zzzz  and  xxzz 
are  close  to  the  CCSD  +  T(4)  results  of  Sekino  and  Bartlett  [33]  obtained  with  a 
larger  basis  set.  The  largest  discrepancy  occurs  for  the  xxxx  component;  597  versus 
6  50  a.u.  by  SB  [  29  ]  and  can  be  attributed  to  the  basis  set  deficiency  in  the  x  direction. 
Despite  the  modest  basis  set  employed  in  our  calculations  the  CCSDT-1  density 
matrix  results  based  on  perturbed  correlated  moments  compare  very  well  with  all 
the  standard  finite  field  highly  correlated  calculations  available. 

H20:  Complete  SCF  calculations  of  the  electric  polarizability  tensors  were  per¬ 
formed  by  Bishop  and  Pipin  (bp)  [17].  We  have  used  the  POL  basis  set  with  un¬ 
contracted  d  functions  on  O  and  p  on  the  H  atom;  the  contracted  set  [5s3p3d/ 
3s3d]  of  56  functions.  The  C2  axis  coincides  with  the  z  axis  and  the  molecule  lies 
in  the  xz  plane.  The  multipole  moments  are  presented  in  Table  III  and  compared 
to  experimental  values.  Electron  correlation  calculated  by  the  CCSDT-1  method 
changes  the  value  of  ^  by  about  9%  in  perfect  agreement  with  experiment,  while 
the  correlation  contribution  to  the  quadrupole  moment  is  small  except  for  the  zz 
component  where  it  amounts  to  -19%.  Second-  and  third-order  electric  properties 
are  displayed  in  Table  IV. 

The  values  of  the  dipole  polarizability  tensor  a  agree  very  well  with  the  SCF  results 
of  Werner  and  Meyer  [35]  who  employed  an  extended,  uncontracted  (Ils6p3d/ 
5s2p)  carefully  optimized  basis  set:  9.04,  7.93,  and  8.47  a.u.  for  the  xx,  yy,  and 
zz.  Electron  correlation  improves  the  results:  The  ccsdt-1  average  polarizability 
a  perfectly  matches  experimental  values  9.82  [35]  and  9.81  [38].  As  for  the  HF 

Table  III.  Electric  moments  of  the  H20  molecule  (in  a.u.). 


Moment  SCF  CCSDT-1  CCSDT-1  Experiment 


0.793  0.725  0.725  0.7268a 

1.921  1.919  1.918  1.96  ±  0.02b 

-1.823  -1.837  -1.835  -1.86  ±0.02b 

-0.099  -0.083  -0.083  -0.10  ±0.02b 


a  Taken  from  [46]. 
b  Taken  from  [47]. 
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Table  IV.  Electric  tensors  for  the  H2Q  molecule. 


Tensor 

SCF 

CCSDT-1 

(A) 

CCSDT-1 

(B) 

Other  scf 

Exp.  or  correlated 

OCxx 

9.10 

10.05 

10.06 

9.24a 

9.8  lb,  9.87c,  9.64e 

ayy 

7.91 

9.71 

9.70 

7.9  la 

9.59b,  9.30c,  10.02® 

azz 

8.42 

9.71 

9.71 

8.55a 

9.64b,  9.54c,  9.73® 

a 

8.48 

9.82 

9.82 

8.57 

9.68b,  9.79® 

A  a 

1.03 

0.34 

0.35 

1.15* 

0.34® 

ftzxx 

-9.45 

-11.2 

-11.3 

— 7.87a 

-10.0®,  -6.2® 

fizyy 

-0.39 

-4.13 

-4.41 

1.41a 

-3.7®,  -10.2® 

fez 

-6.03 

-11.2 

-11.4 

— 4.68a 

-9.2®,  -13.7® 

0 

-9.52 

-15.9 

-16.3 

— 6.68a 

-18.0® 

■^XyXX 

3.21 

3.55 

3.55 

6.27a 

6.742d 

■dy,zy 

0.10 

0.44 

0.44 

1.40a 

1.786d 

■dz.zz 

2.30 

2.31 

2.30 

1.62a 

2.54f 

4z,zz 

2.29 

2.28 

2.29 

1.68a 

2.194d,  2.44f 

Cxx,xx 

11.6 

13.4 

13.4 

1 1.78a 

1 1.65f 

Cyy,yy 

11.5 

14.0 

14.0 

12.02a 

12.58f 

C2Zy2z 

9.10 

10.6 

10.6 

10.10a 

8.76f 

Cxyjcy 

5.45 

6.91 

6.90 

8.48a 

8.48f 

Cxzjcz 

7.07 

8.28 

8.29 

1 1.06a 

1 1.33f 

c 

^ yztyz 

4.47 

6.02 

5.96 

4.72a 

4.41f 

BxXyXX 

-81.5 

-94.9 

-94.5 

-76.613 

Bxxjz 

36.9 

50.2 

49.5 

39.14s 

Byy\xx 

90.3 

144.7 

144.3 

234.06a 

Byy,zz 

66.7 

109.8 

109.1 

67.99a 

Bzz,zz 

54.9 

82.8 

82.3 

55.78® 

BZ2yZZ 

-94.8 

-133 

-132 

-90.91® 

BXyjcy 

-57.1 

-91.6 

-91.1 

-83.61® 

Bxz>xz 

-65.0 

-88.4 

-88.2 

-87.11® 

BZy,yz 

-54.8 

-86.5 

-86.0 

-80.54® 

a  SCF  [17]. 
b  Taken  from  [35]. 
c  MBPT-SDQ  (4)  [45]. 
d  CISD  [48]. 
e  CCSD(T)  [29]. 
fMCSCF  [17]. 


molecule  the  difference  between  ccsdt-1  results  correct  through  the  second-order 
in  the  wave  function  and  correct  through  the  full  fourth-order  in  the  corresponding 
density  matrix  are  negligible.  The  recent  CCSD(T)  calculations  employing  an  aug¬ 
mented  POL  basis  yielded  9.79  a.u.  for  the  average  value  of  a  [29].  The  anisotropy 
of  a  defined  as: 

Aa  =  y=  [( otxx  -  Oiyy)2  +  (oLyy  -  azz)2  +  ( azz  -  axx)2]l/2  (20) 

and  reflecting  the  balanced  description  of  all  components  is  very  sensitive  to  basis 
set  effects.  Our  correlated  A  a  agree  with  the  SB  [29]  value  of  0.34  a.u.  and  mbpt 
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(4)  0.32  a.u.  [39]  and  are  off  the  experimental  estimate  by  a  factor  of  2.  Lazzeretti 
and  Zanasi  [40]  reported  a  broad  range  of  &zyy  tensor  values  from  -0.299  to  1.098 
a.u.  and  Maroulis  [39]  constructed  five  optimized  basis  sets  and  obtained  values 
from  —1.46  to  —131  a.u.  so  that  it  may  be  concluded  that  our  SCF  value  is  most 
probably  off  the  HF  limit  by  a  factor  of  about  3.  Our  SCF  value  for  the  xxz  component 
fits  well  into  the  range  of  previously  obtained  results:  -7.87  [17],  -9.5  to  -10.6 
[40],  and  -9.53  to  -9.35  a.u.  [39].  The  zzz  component  is  also  consistent  with 
previous  results,  and,  again,  the  range  of  results  is  rather  broad:  -3.574  a.u.  [38], 
-4.68  [17],  -8.11  to  -7.44  a.u.  [39]. 

To  conclude  the  discussion  of  the  SCF  results,  it  may  be  interesting  to  compare 
Derivative  HF  (DHF  [3])  calculations  with  the  contracted  elp  (electric  properties) 
basis  set  [3,36].  The  (3  components  read:  -0.5445,  -10.029,  and  -5.4715  a.u.  for 
the  zyy ,  zxx,  and  zzz,  respectively,  all  in  a  good  agreement  with  our  results.  (3  is 
not  as  sensitive  to  the  choise  of  basis  set  as  some  of  its  components.  Our  (3  equals 
-9.52  a.u.  and  agrees  very  well  with  DHF  elp  calculations,  -9.63  a.u.,  and  Maroulis 
[39]  reported  -11.15  to  -10.86  a.u.  and  BP  obtained  -6.68  a.u..  The  CCSDT-1 
calculations  give  results  fairly  close  to  the  previous  mbpt-SDQ(4)  [45  ]  with  [6s5p4d/ 
4s2p]:  -13.7  a.u.  [12]  and  CCSDT  +  T(4)  [33]  values.  The  recent  accurate  CCSD(T) 
calculations  [29]  were  performed  with  the  POL  basis  augmented  with  lone-pair 
functions.  Our  CCSDT-1  0  agree  well  with  —18.0  a.u.  of  SB  [29]  and  lies  within 
rather  broad  experimental  limits  quoted  in  [  29  ] ,  —22  ±  6  a.u.  The  difference  between 
the  CCSDT- 1  results  is  not  negligible  reflecting  the  importance  of  the  fourth  order 
component  <0 1  T\T\aHT2 1 0)  for  the  hyperpolarizability  ff.  The  A  tensor  was  pre¬ 
viously  obtained  using  the  DHF  method  with  the  elp  basis  [3],  and  bp,  who  also 
performed  limited  MCSCF  calculations.  The  value  of  our  x,zx  component  is  in  fair 
agreement  with  the  DHF  result  (4.0176  a.u.),  but  almost  half  of  the  corresponding 
value  reported  by  bp  (6.27  a.u.).  This  difference  can  be  attributed  to  basis  set 
limitations.  The  POL  basis  set  is  simply  not  flexible  enough  to  describe  properly 
the  very  small  induced  component  of  the  quadrupole  ©xz,  which  is  zero  by  symmetry 
for  the  unperturbed  H20  molecule. 

The  same  effect  manifests  itself  even  more  for  the  y,yz  component  for  which 
both  the  SCF  and  the  CCSDT-1  results  are  by  an  order  of  magnitude  smaller  than 
the  SCF  results  of  bp.  The  z,xx  component  is  on  the  contrary  larger  than  the  values 
of  bp.  Our  SCF  value  for  the  z,zz  component  lies  between  the  bp  (1.68)  and  the 
dhf  result  (2.7406  a.u.).  The  electron  contribution  as  given  by  the  CCSDT-1  com¬ 
putations  is  negligible  for  the  z,xx  and  z,zz  components,  and  it  is  a  moderate 
— 10%  for  the  AXjZX.  For  the  small  Ay>zy  component  correlation  changes  the  value 
by  a  factor  of  4.  One  can  notice  that,  although  our  z,xx  and  z,zz  results  are  different 
from  the  corresponding  results  of  bp,  we  observe  the  same  approximate  equality 
of  these  two  components.  This  internal  consistency  shows  that  the  discrepancy  for 
the  y,zy  as  well  as  other  smaller  differences  are  indeed  due  to  basis  set  limitations. 
The  few  existing  SCF  results  suggest  that  only  the  z,xx  and  z,zz  components  are 
probably  fairly  well  predicted.  Electron  correlation  seems  to  be  less  important 
than  the  use  of  good  quality  basis  sets.  There  are  six  independent  components  of 
the  second-order  quadrupole  polarizability  tensor  C.  For  all  pure  components, 
our  SCF  results  are  in  very  good  agreement  with  the  results  of  bp  but  DHF  values 
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are  systematically  higher:  12.8394,  13.5368,  and  11.766  a.u.  for  xx,yy  and  zz,zz, 
respectively.  The  correlation  amounts  to  — 13%,  18%,  and  14%  for  the  xx,xx, 
yy,yy,  and  zz,zz  components.  The  “mixed”  components  are  all  smaller  than  the 
corresponding  results  of  BP.  The  inflexibility  of  the  basis  set  can  be  blamed  for 
these  differences.  The  inclusion  of  electron  correlation  improves  our  results,  but 
they  are  still  probably  too  small.  The  DHF  calculations  yield  4.010  a.u.  for  the  yz,yz 
4.776  a.u.  for  the  CxytXy  and  7.014  a.u.  for  the  xz,xz  component.  All  results  are 
smaller  than  our  scf. 

To  the  best  of  the  author’s  knowledge,  the  dipole-dipole-quadrupole  hyperpo¬ 
larizability  B  tensor  has  not  been  calculated  at  any  correlated  level.  Only  bp  scf 
results  can  be  compared  to  our  results.  Dykstra  uses  a  different  sign  convention  in 
his  calculations,  thus  obscuring  the  comparison.  There  are  nine  independent  com¬ 
ponents  of  the  B  tensor  (Dykstra  reports  12  values).  Our  scf  results  are  consistent 
with  the  Bishop  and  Pipin  results.  The  xx,xx,  zz,zz,  xx,zz,  zz,xx,  and  yy,zz 
components  agree  very  well  with  the  Bishop  and  Pipin  values.  Again,  the  negative 
“mixed”  components  are  more  difficult  to  determine.  Our  SCF  values  are  higher 
than  Bishop  and  Pipin  by  approximately  30%.  The  correlation  always  lowers  the 
results  beyond  the  Bishop  and  Pipin  SCF  results.  In  conclusion  we  may  say  that 
five  “pure”  components  are  probably  accurately  predicted  and  that  the  three 
“mixed”  components  of  the  B  tensor  are  fairly  well  described.  The  yy,xx  component 
is  an  exception.  It  depends  critically  on  the  basis  set  quality  and  our  SCF  results 
amount  to  only  ~40%  of  the  previously  reported  value.  The  electron  contribution 
is  also  substantial  so  that  making  any  predictions  about  this  component  is  not 
reliable  at  this  point. 

F“:  Despite  the  apparent  simplicity  of  the  closed-shell  10-electron  system,  electric 
properties  of  this  anion  are  extremely  difficult  to  calculate  correctly.  The  reason  is 
the  diffuse  electronic  charge  distribution  requiring  extended  basis  sets  and  a  large 
portion  of  the  electron  correlation  energy.  For  any  spherical  system  in  its  ground 
state  all  permanent  multipoles  are  zero  and  calculations  of  the  nonzero  even  po¬ 
larizabilities  a,  7,  C,  and  B  must  rely  on  a  proper  description  of  small  induced 
moments.  Requirements  for  a  basis  set  led  us  to  the  ELP  basis  set  of  Dykstra  [3,41] 
which  consists  of  the  contracted  [6s5p3d]  set,  denoted  as  basis  II.  I  is  the  pol 
basis.  We  have  also  employed  a  version  of  the  elp  augmented  by  one  /  function 
with  the  exponent  taken  from  [42],  basis  III.  Table  V  summarizes  our  results  and 
shows  some  literature  values.  The  SCF  calculations  of  Maroulis  and  Bishop  [42] 
were  performed  with  two  extended,  optimized  sets  of  67  contracted  or  87 
[17sl0p5dlf]  uncontracted  functions,  cisd  calculations  of  Diercksen  and  Sadlej 
[43  ]  used  a  contracted  [12s8p5d]  set  derived  from  a  set  used  by  Werner  and  Meyer 
[35]  in  their  fluoride  atom  calculations.  All  these  bases  are  much  larger,  and  what 
is  more  important,  are  individually  optimized  for  the  fluoride  anion.  Nevertheless, 
as  can  be  seen  from  Table  V  our  SCF  results  for  a  and  y  compare  well  with  the 
contracted  set  results  of  [42], 

However,  tensors  C  and  B  proved  rather  sensitive  to  the  quality  of  basis  set,  and 
our  scf  results  differ  from  the  uncontracted  Maroulis  and  Bishop  values  by  40% 
and  15%,  respectively.  Electron  correlation  is  very  important  for  all  tensors.  For 
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Table  V.  Electric  polarizabilities  of  the  fluoride  anion  (a.u.). 


Tensor 

Basis 

SCF 

CCSDT-1 

(A) 

CCSDT-1 

(B) 

Other  scf 

Exp.  or 
correlated 

a 

I 

10.21 

16.27 

16.26 

10.66a 

18.99c 

a 

II 

10.39 

18.82 

18.82 

a 

III 

10.40 

18.84 

18.84 

C 

I 

10.29 

15.09 

15.04 

17.35b 

C 

II 

12.91 

32.23 

32.05 

23.46b 

c 

III 

16.82 

36.22 

36.05 

B 

I 

-275.3 

-643.8 

-638.2 

-47  lb 

B 

II 

-422.5 

-2,157 

-2,140 

-552b 

B 

III 

-441.2 

-2,124 

-2,108 

23, 100d 

y 

2,638 

9,063 

9,110 

y 

II 

10,030 

103,000 

102,700 

10,900b 

79,500e 

y 

III 

10,120 

99,700 

99,450 

12,900b 

a  NHF  [49]. 
b  scf  [42]. 
c  CCSD  +  T  (4)  [50], 
d  CISD  [43]. 
e  MBPT  (4)  [44], 

the  augmented  ELP  basis  set  (basis  III)  it  amounts  to  ~45%  for  the  dipole  polar¬ 
izability,  53%  for  the  quadrupole  polarizability,  and  almost  80%  for  the  dipole- 
dipole-quadrupole  hyperpolarizability  B .  The  second  dipole  hyperpolarizability  y 
is  known  to  be  difficult  to  determine  by  numerical  differentiation.  The  charge  per¬ 
turbation  method  allows  using  the  perturbed  dipole  moment  and  thus  may  be 
numerically  more  stable.  Basis  elp  and  III  yield  results  which  are  reasonably  close 
to  the  estimated  hf  limit  of  13,000  a.u.  The  correlation  contribution  is  enormous 
even  by  anionic  standards.  The  ccsdt-1  values  are  about  nine  times  larger  than 
the  corresponding  SCF  values.  The  mbpt(4)  results  with  a  larger  basis  set  [44] 
yielded  79,500  a.u.,  in  fair  agreement  with  our  results. 

Conclusions 

We  have  combined  the  coupled  cluster  method  (the  ccsdt-1  model)  to  calculate 
electric  multipole  moments  as  expectation  values,  with  the  charge  perturbation 
version  of  the  finite  field  method  to  calculate  electric  response  properties.  This 
approach  has  been  applied  to  determine  the  electric  properties  of  the  small  systems: 
HF,  H20,  and  F”.  The  approach  has  been  shown  to  provide  values  of  the  higher- 
order  static  polarizabilities  associated  with  the  higher  multipole  moments:  the 
quadrupole  polarizability  C,  dipole-quadrupole  polarizability  A ,  dipole-octopole 
polarizability  E ,  and  hyperpolarizabilities  ft  B,  y  in  an  effective  way.  The  relatively 
modest  basis  sets,  not  flexible  enough  to  yield  accurate  results  for  hyperpolariza¬ 
bilities,  has  nevertheless  given  accurate  results  for  the  multipoles  and  the  dipole 
polarizability  a,  while  providing  good  estimates  for  other  tensors.  The  fluoride 
anion  requires  an  extended  basis  set  to  obtain  accurate  electric  properties,  especially 
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since  our  results  point  to  very  significant  electron  correlation  effects  in  the  hyper¬ 
polarizabilities  B  and  y. 
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Abstract 

The  field  of  nonlinear  optics  in  molecules  and  polymers  offers  challenging  opportunities  for  theorists 
on  both  molecular  modeling  of  new  structures  with  enhanced  microscopic  response  and  manifestations 
of  bulk  effect  in  nonlinearity.  This  article  presents  an  experimental  point  of  view  in  assessing  the  current 
status  of  various  theoretical  approaches.  Experimental  results  are  presented  for  two  model  compounds 
together  with  predictions  by  various  theoretical  models  to  discuss  issues  and  future  opportunities.  Three 
different  types  of  nonlinear  optical  processes  are  discussed:  (i)  second  order,  (ii)  third  order,  and  (iii) 
photorefractivity.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Nonlinear  optical  (nlo)  effects  form  the  backbone  of  photonics,  a  rapidly  growing 
technology  that  utilizes  photons  to  transmit,  process,  and  store  information  [1]. 
The  largest  immediate  application  of  photonics  is  in  the  telecommunication  industry 
for  fiber-optic  communication.  Here,  NLO  effects  will  be  used  to  perform  the  func¬ 
tions  of  optical  switching,  optical  data  storage  and  processing  of  information  by 
wavelength  multiplexing,  and  modulating  the  phase  as  well  as  frequencies  of  the 
optical  signal.  The  real  challenge  facing  the  industrial  application  of  photonic  tech¬ 
nology  is  the  search  for  appropriate  materials,  which,  in  addition  to  being  highly 
NLO  active,  must  be  easily  processible,  mechanically  and  thermally  stable,  and  most 
importantly,  economically  viable  to  compete  with  the  commonly  used  electronic 
materials.  One  such  class  of  materials  which  fulfill,  to  a  large  extent,  these  require¬ 
ments  is  formed  by  organic  molecules  and  polymers  and  constitute  the  topic  of 
this  article. 

In  the  past  several  years,  extensive  experimental  as  well  as  theoretical  investigations 
have  been  performed  at  the  Photonics  Research  Laboratory  [2-12]  and  by  other 
groups*  to  establish  structure-NLO  property  relationship  and  to  identify  techno¬ 
logically  important  organic  and  polymeric  materials.  Such  investigations  have  tre¬ 
mendously  improved  our  understanding  of  the  mechanism  and  origin  of  optical 


*  A  recent  issue  (February  1994)  of  the  Chem.  Rev.  contains  several  excellent  reviews  of  the  experimental 
and  theoretical  investigations  of  NLO  properties  of  organic  molecules  and  polymers. 


International  Journal  of  Quantum  Chemistry:  Quantum  Chemistry  Symposium  28,  395-410  (1994) 

©  1994  John  Wiley  &  Sons,  Inc.  CCC  0360-8832/94/010395-16 
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nonlinearity  in  organics  and  helped  identify  crucial  structural  factors  to  control 
and  optimize  material  nlo  response.  However,  there  still  remain  a  number  of 
outstanding  issues,  particularly  of  correlation  between  experiment  and  theory  and 
that  among  various  theoretical  methods,  understanding  and  establishing  a  definite 
relationship  between  the  microscopic  (molecular)  and  the  macroscopic  (bulk)  nlo 
response,  and  finally  applying  the  available  knowledge  of  structure-property  rela¬ 
tionship  to  model  new  materials  that  invite  further  investigation.  In  order  to  highlight 
some  of  these  issues,  we  report  here  the  results  of  the  experimental  characterization 
and  quantum  chemical  calculations  performed  in  our  laboratory  on  the  second- 
and  third-order  nlo  properties  of  two  potentially  important  organic  molecules.  We 
also  briefly  describe  the  structural  requirements  and  our  investigation  of  organic 
photorefractive  materials.  The  photorefractivity  in  molecular  system  is  a  new  field 
and  offers  challenging  opportunities  as  the  effect  is  produced  by  the  combined 
action  of  both  electrical  and  nonlinear  optical  properties  of  a  material. 

In  what  follows,  a  brief  account  of  the  underlying  theory  of  NLO  processes  in 
molecular  systems  is  presented.  The  various  experimental  and  theoretical  approaches 
for  the  study  of  nonlinear  optical  response  of  molecules  are  then  summarized,  and 
the  results  of  our  experimental  and  theoretical  studies  of  second-  and  third-order 
NLO  properties  of  2-nitro-l-( 2-thienyl )ethene  (NTE)  and  4-nitro-l-( 2-thienyl)- 1,3- 
butadiene  (ntb)  (Fig.  1 )  are  presented  with  a  view  to  compare  different  theoretical 
techniques.  An  attempt  is  made  to  identify  the  issues  and  opportunities  for  theo¬ 
retical  modeling  of  second-  and  third-order  nonlinearities.  The  final  section  discusses 
the  photorefractive  effect  in  polymeric  composite  systems,  again  identifying  some 
of  the  relevant  issues  and  opportunities  in  this  field. 

Theory  of  the  nlo  Processes  in  Molecular  Systems 

The  NLO  effects  are  best  described  as  a  manifestation  of  the  response  of  materials 
electron  cloud  to  an  external  electric  field,  E(r,t).  This  electric  field,  E(r,t),  can 
be  taken  in  a  general  sense  so  that  it  can  be  a  low-frequency  (hv  -►  0)  field,  such 
as  that  of  a  conventional  dc-electric  field,  or  a  very  high-frequency  oscillating  field 
derived  from  a  short  laser  pulse.  The  material  response  to  the  external  electric  field 
can  be  described  in  terms  of  a  generalized  polarization  P(E),  which  is  proportional 
to  the  strength  of  the  electric  field  if  the  disturbance  caused  by  it  in  the  material 
electronic  environment  is  small  and  can  be  written  as 

P(E)  =  X-E(r,t)  (1) 

Here  X  is  the  linear  susceptibility  of  the  system.  If,  however,  the  materials  electronic 
environment  experiences  appreciable  changes,  the  linear  dependence  of  P(E)  on 
E(r,t)  [Eq.  ( 1 )]  is  no  longer  valid.  In  such  cases,  assuming  that  the  force  on  an  7V- 
particle  system  exerted  by  the  external  electric  field  is  still  smaller  than  the  inter¬ 
particle  forces,  the  generalized  polarization,  P(E),  can  be  expressed  as  a  power 
series  of  E{r,t)  as, 

P(E)  =  X(1)  •  E(rj)  +  X(2) :  E(r,t)E(r,t) 

+  X (3) :  E(r,t)E(r,t)E(r,t)  +  •  •  •  (2) 


MOLECULES  AND  POLYMERS 


397 


where  now,  P(E)  does  not  only  depend  on  E(rJ)  but  also  on  its  higher  powers. 
The  quantities  X {n)  in  Eq.  (2)  represent  the  nth  order  susceptibility  and  are  tensor 
quantities  of  rank  (n  +  1).  It  is  the  nonlinear  susceptibility  terms  X(2),  X(3),  . . . 
which  constitute  the  topic  of  interest  in  this  article.  It  is  trivial  to  show  that  for 
systems  possessing  a  center  of  inversion,  the  even  order  susceptibility  tensors  (X (2), 
X (4),  etc.)  vanish  and  though  the  X {n)  has  3"  elements,  not  all  of  them  are  nonzero 
and  independent,  i.e.,  in  practice,  only  a  few  elements  need  to  be  evaluated  inde¬ 
pendently  to  get  the  pertinent  information. 

A  similar  expression  as  Eq.  (2)  can  be  used  to  describe  the  polarization  of  an 
atom  or  a  molecule,  written  as, 

p(E)  =  a-E(r,t)  +  13  :  E(r,t)E(r,t)  +  y  \  E(r,t)E(r,t)E(r,t)  +  •  •  *  (3) 

where  a,  7,  .  . .  are  now  the  first-order,  second-order,  third-order,  etc.  polariza¬ 
bilities  and  are  tensor  quantities  of  ranks  2,  3,  4,  ... ,  respectively.  The  higher- 
order  polarizabilities  are  also  called  hyperpolarizabilities,  for  example,  0  is  known 
as  the  first-hyperpolarizability,  7  is  known  as  the  second-hyperpolarizability,  and 
so  on.  Of  course,  for  atoms  and  centrosymmetric  molecules,  the  even-order  polar¬ 
izabilities  (for  example,  jS)  vanish. 

If  the  physical  state  of  a  material  is  such  that  the  interparticle  interaction  is 
insignificant,  as  is  the  case,  for  example,  for  gases  under  extremely  low  pressure, 
the  macroscopic  susceptibilities  (X(n))  and  the  microscopic  polarizabilities  (a,  0, 
7)  are  equal.  However,  in  other  phases,  e.g.,  solids  or  liquids,  where  the  electronic 
environment  is  substantially  different  from  that  of  the  isolated  systems  (atoms/ 
molecules),  there  is  no  comfortable  way  to  relate  the  microscopic  and  macroscopic 
nlo  properties.  An  intermediate  case  is  those  of  the  “molecular  materials,”  which 
include  most  organic  compounds  and  where,  due  to  weak  intermolecular  interaction, 
the  bulk  properties  can  be  considered  as  the  geometrical  superposition  of  the  cor¬ 
responding  microscopic  properties.  This  very  property  of  the  organic  materials 
constitutes  the  motivation  for  the  experimental  as  well  as  theoretical  investigation 
of  their  molecular  nlo  property  and  gives  hope  that  by  understanding  and  modifying 
the  microscopic  nonlinearity,  it  would  be  possible  to  ultimately  control  and  tailor 
the  macroscopic  NLO  behavior  as  required  for  a  particular  technological  application. 

Let  N  represent  the  number  per  unit  volume  of  an  organic  molecule  in  a  dilute 
solution  or  a  molecular  crystal,  then  its  bulk  susceptibilities  (assuming  an  uni- 
oriented  system)  can  be  approximated  as: 


x(l)  =  f(  i)Na 

(3a) 

X<2>  =  /(2)#/? 

(3b) 

x(3)  =  y<3)  Ny 

(3c) 

where  f{n\  popularly  known  as  the  “local  field  factor,”  accounts  for  the  overall 
effect  of  the  intermolecular  interaction  in  the  presence  of  the  external  electric  field. 
Although  the  set  of  Eqs.  (3)  provide  a  convenient  means  to  obtain  the  values  of 
microscopic  nlo  properties  from  experimental  measurement  of  X {n)  or  vice  versa, 
in  practice  it  is  a  nontrivial  task  due  to  the  complicated  nature  and  often  a  complete 
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lack  of  the  accurate  knowledge  of f(n).  The /(n)  in  Eqs.  (3)  depend,  in  addition  to 
the  strength  and  the  frequencies  of  the  input  and  output  signals,  on  the  nature  and 
magnitude  of  the  intermolecular  interaction,  which  in  turn  depends  on  the  overall 
charge  distribution,  that  is  itself  a  function  of  the  spatial  and  electronic  structure 
of  the  system.  Unfortunately,  there  is  no  rigorous  theory  to  describe  the  local  field 
effect  in  nlo  materials  and  generally  an  approximate  estimate  of  the  /"  terms  is 
used  to  extract  the  microscopic  NLO  coefficients,  a,  (3,  7,  from  the  experimentally 
measured  X (n)  values.  Usually,  the  estimates  of  the  /(n)  values  are  made  either  by 
Onsager  formula, 


or  by  the  Lorentz  formula, 


f(n)  —  £°(£u  2) 

Jo  "(2  c0  +  ej 


f 


<«)  = 
mo> 


+  2 


3 


(4) 


(5) 


In  the  above  equations,  the  ea  and  emu  terms  represent  the  dielectric  constant  of  the 
medium  at  frequencies  o  and  moo.  Eq.  (4),  that  also  accounts  for  the  reorientation 
of  the  molecules  in  the  presence  of  dc-e lectric  field  ( hv  =  0),  is  used  for  low- 
frequency  electric  fields  (or  dc  field),  whereas  Eq.  (5)  is  used  for  an  optical  field 
with  the  angular  frequency  moo. 


Experimental  Measurements 

The  two  important  manifestations  of  second-order  nonlinearities  are:  (a)  the 
second  harmonic  generation  ( SHG ) ,  whereby  from  the  two  input  waves  of  angular 
frequency  00,  the  system  generates  a  third  wave  at  the  frequency  2oo.  The  associated 
second-order  nlo  coefficients  for  sgh  are  designated  as  x(— 2o >;  00,  00)  for  macro¬ 
scopic  nonlinearity  and  /3(  — 2co;  00,  00)  for  microscopic  nonlinearity,  (b)  The  linear 
electrooptic  effect,  (eoe)  also  known  as  the  Pockels  effect,  which  is  described  by 
the  phase  shift  of  an  optical  wave  with  angular  frequency  a;  by  a  low  frequency  or 
dc-electric  field  (00  =  0).  The  output  optical  wave  has  the  same  frequency  (w)  as 
the  input  wave.  The  associated  macroscopic  and  the  microscopic  NLO  coefficients 
for  linear  eoe  are  designated  by  X(— 00;  w,  o)  and  (3(  —  00;  00,  o),  respectively.  (The 
convention  for  using  the  frequency  arguments  is  such  that  the  quantity  to  the  left 
of  the  semicolon  represents  the  frequency  of  the  “output”  beam,  while  those  at  the 
right  are  the  frequencies  of  the  “input”  beams  and  the  sum  of  the  frequencies  of 
the  input  and  output  beams  is  zero.) 

The  experimental  measurements  of  the  molecular  second-order  NLO  coefficients 
for  organic  materials  are  conventionally  made  by  the  dc-electric  field  induced  second 
harmonic  (efish)  generation  in  solution.  This  technique  utilizes  the  orientation 
of  the  nonlinear  molecules  by  dc-electric  field  in  solution,  and  therefore,  one  obtains 
ix  •  £(— 2 o>;  00, 00),  which,  together  with  the  knowledge  of  the  dipole  moment,  fx ,  of 
the  system  yields  a  vector  quantity,  (3vec,  i.e.,  the  component  of  (3  tensor  along  the 
dipole  axis. 
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It  is  important  to  note  here  that  in  an  efish  measurement  (in  solution)  the  actual 
quantity  measured  is  the  intensity,  J2w,  of  the  second  harmonic  wave  normalized 
with  respect  to  a  certain  reference  (often  quartz).  Therefore,  the  accuracy  of  the 
subsequent  results  in  these  experiments  is  subject  to  the  accuracy  of  the  reference 
value.  The  intensity,  /2a>,  which  is  related  to  the  third-order  nlo  coefficients,  finally 
yields  x (3)  for  efish  process,  from  which  a  combination  of  the  relations  similar  to 
(3b)  and 


y( EFISH)  =  (ye(-2oo;  0,  co,  co))  + 


IL- ff(-2co;  co,  co) 
5kT 


(6) 


yields  the  vector.  In  the  above  equation,  ye(-2w;  o,  co,  co)  represents  the 
electronic  part  of  the  y  value  for  efish  process,  k  is  the  Boltzman  constant  and  T 
is  the  temperature.  It  is  clear  then  that  the  two  important  factors  influencing  the 
experimental  (3  values  obtained  by  EFISH  technique  are  the  (a)  local  field  factors 
and  (b)  reference  value  of  X (3)  (efish) . 

For  materials  having  small  dipole  moment  or  where  fi  makes  large  angle  with  0, 
a  more  suitable  method  to  obtain  (3  is  hyper-Rayleigh  scattering  (HRS).  Often  it  is 
useful  to  measure  (3  by  both  the  EFISH  and  HRS  techniques,  which  provides  a  means 
to  obtain  the  various  tensorial  components  of  /?. 

For  the  second-order  NLO  characterization  of  the  bulk  materials  such  as  solids 
or  thin  films,  one  often  uses  either  (a)  the  second  harmonic  generation  (SHG) 
technique,  in  which  the  SH  efficiency  of  the  material  is  obtained  relative  to  a  reference 
and,  therefore,  the  resultant  X(2)(-2co;  co,  co)  is  subject  to  the  assumption  of  the 
reference  data,  or  (b)  electrooptic  modulation,  whereby  the  shift  in  the  phase  of 
an  optical  wave  produced  by  a  dc  field  is  measured,  yielding  X (2*(  —  co; a >,o).  Unlike 
the  SHG  measurement,  the  electrooptic  measurement  yields  an  absolute  value  of 
X (2).  In  the  low  frequency  limit  of  the  optical  wave,  X  (2)(-2co;  co,  co)  and  X  (2)(- co; 
co,  o)  are  not  much  different  from  each  other  and  one  can  use  the  electrooptic 
modulation  to  complement  the  SHG  measurements.  However,  when  the  material 
displays  strong  dispersion,  as  happens,  for  example,  close  to  a  resonance  in  the 
system,  the  two  X (2)  values  [X  (2)(-2co;  co,  co)  and  X  (2)(-co;  co,  <?)]  will  be  substantially 
different  from  each  other. 

The  microscopic  third-order  nonlinearity  of  organic  materials  is  also  characterized 
through  the  measurement  of  the  macroscopic  NLO  coefficients,  X(3),  in  solution. 
One  method,  as  described  above,  is  the  efish  technique,  which  yields  y(-2<a\  o, 
co,  co).  The  other  methods  which  involve  technologically  important  nlo  effects 
allow  one  to  measure  optically  induced  phase  shift  or  changes  in  the  refractive 
index.  Two  such  methods  generally  used  to  characterize  y  values  are:  (a)  Degenerate 
four- wave  mixing  (dfwm),  which  allows  one  to  measure  7(— «;  co,  co,  -co)  by 
monitoring  the  interaction  of  three  input  beams  whereby  a  resulting  output  beam 
of  same  frequency  is  produced,  (b)  Optical  Kerr  gate  (OKG)  technique,  which 
allows  one  to  measure  the  changes  in  the  refractive  index  of  the  material  by  mon¬ 
itoring  the  phase  shift  of  an  optical  probe  pulse  in  the  presence  of  a  strong  pump 
pulse. 
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Once  again,  the  local  field  factor  plays  a  crucial  role  in  yielding  the  values  of  y 
from  measured  X (3).  The  refractive  index  change  measurements  yield  the  value  of 
X (3)  which,  in  addition  to  the  true  third-order  nlo  contribution  xi3)  that  has  elec¬ 
tronic  origin,  may  also  include  contributions  xi3)  which  we  call  “incoherent”  con¬ 
tribution.  The  incoherent  contribution  arises  from  a  change  in  the  linear  suscep¬ 
tibility,  X(1),  by  populating  the  excited  states  which  in  turn  changes  the  refractive 
index  of  the  system.  Thus,  it  is  important  to  separate  the  X{3)  and  X -3)  terms  in  a 
measurement  based  on  optically  induced  refractive  index  change. 

Theoretical  Calculations 

Quantum  chemical  calculations  of  molecular  nlo  properties  are  conveniently 
performed  by  the  use  of  the  perturbation  theory  [13].  The  two  commonly  used 
methods  are  (a)  the  uncoupled  perturbation  method  and  (b)  the  coupled  pertur¬ 
bation  method.  For  the  organic  systems,  both  the  uncoupled  and  the  coupled  meth¬ 
ods  have  been  used  with  varying  degrees  of  success.  The  uncoupled  method  leading 
to  the  popularly  known  sum-over-states  (SOS)  method  [14]  has  been  extensively 
used  with  semi-empirical  Hamiltonian  of  the  complete  neglect  of  the  diatomic 
differential  overlap  (CNDO)  type  or  one  of  its  variations,  such  as  the  intermediate 
neglect  of  the  diatomic  differential  overlap  (indo)  type  and  with  7r-electron  Ham¬ 
iltonian.1’  The  need  to  obtain  the  dipole  transition  moment  integrals,  (^jImI’Aj) 
between  various  electronic  states  \pi  and  \f/j  and  the  energy  spectrum  Ej  up  to  the 
continuum  in  the  SOS  method  prohibits  its  application  to  organic  systems  in  the 
framework  of  the  nonempirical  techniques. 

A  popular  form  of  the  coupled  method,  known  as  the  time-dependent  coupled 
perturbed  Hartree-Fock  (tdcphf)  method  [  15-18],  has  also  been  used  extensively 
to  investigate  molecular  nonlinearity  of  organic  molecules.  The  fact  that  the  per¬ 
turbed  quantities  in  this  method  are  derived  from  the  ground  state  wave  function 
alone,  makes  it  ideally  suited  for  nonempirical  as  well  as  semi-empirical  type  Ham¬ 
iltonians.  Indeed,  tdcphf  method  has  been  used  with  the  ab  initio  [  6-9, 1 1 , 1 2, 1 7  ] 
indo,  [10]  and  mindo  [19]  type  Hamiltonians  for  the  calculation  of  nlo  properties 
of  organic  molecules. 

The  single  most  important  factor  to  be  considered  in  the  theoretical  investigation 
of  molecular  nlo  properties  is  the  selection  of  the  basis  function.  Since  NLO  effects 
arise  from  the  perturbation  of  the  electronic  charge  density  by  an  external  electric 
field,  the  region  of  space  most  influenced  is  the  outer  valence  part  or  the ‘region 
with  loosely  bound  electron  charges.  Physical  intuition  then  suggests  that  it  is  ex¬ 
tremely  important  that  the  wave  function  should  be  able  to  provide  a  good  descrip¬ 
tion  of  the  outer  valence  or  the  most  easily  polarizable  regions  of  space.  Of  course, 
the  ability  of  a  molecular  wave  function  generated  by  the  linear  combination  of 
atomic  functions  to  describe  a  particular  region  of  space  will  depend  on  the  involved 
atomic  functions  or  the  basis  set  themselves.  It  is  well  known  in  quantum  chemistry 


f  A  thorough  review  of  the  application  of  the  SOS  method  with  semi-empirical  Hamiltonian  has  been 
given  by  Kanis  et  al.,  Chem.  Rev.  94,  195  (1994). 
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that  the  atomic  functions,  whether  the  Slater  type  or  the  Gaussian  type,  used  in 
molecular  calculations  that  are  obtained  from  the  self-consistent-field  solution  of 
the  Schrodinger  equation  for  the  atomic  ground  state,  are  unsuitable  to  describe 
the  extra  valence  region  of  space. 

In  molecular  calculations,  often  this  discrepancy  in  the  atomic  functions  is  com¬ 
pensated  by  including  additional  functions  with  appropriately  optimized  radial 
part.  For  calculations  involving  Gaussian  functions,  a  variety  of  such  functions 
with  “diffuse”  and/or  “semi  diffuse”  radial  part  and  appropriate  angular  momentum 
have  been  optimized  for  calculating  molecular  wave  function,  energy,  and  electronic 
and  optical  properties  [20].  However,  such  functions  are  not  readily  available  for 
molecular  calculations  involving  Slater  type  orbitals  (STO).  Thus,  almost  all  the 
SOS  calculations  of  molecular  nlo  properties,  which  employ  STO,  are  performed 
without  auxiliary  functions  that  may  be  essential  to  describe  the  “semi-diffuse”  and 
“diffuse”  states  involved  in  the  perturbation  expression. 

The  next  important  factor  in  the  theoretical  calculation  of  molecular  nlo  prop¬ 
erties  is  the  treatment  of  electron  correlation.  The  tdcphf  methods  developed  for 
the  closed  shell  system  [  16-18]  do  not  adequately  account  for  the  correlation  be¬ 
tween  electrons  of  anti-parallel  spins,  though  some  electron  correlation  is  included 
via  mixing  of  the  occupied  and  unoccupied  molecular  orbitals  (MO)  in  the  pertur¬ 
bation  expression. 

The  SOS  method  does  include  some  correlation  effect  via  configuration  interaction 
(ci).  However,  since  calculations  performed  by  SOS  method  essentially  employ  a 
“valence  only”  one-electron  basis  set,  it  is  difficult  to  understand  the  impact  of 
such  Cl  calculations.  It  is  well  documented  in  quantum  chemistry  literature  [  2 1 ,22  ] 
that  a  poor  one-electron  basis  may  lead  to  unpredictable  and  often  erroneous  many- 
electron  wave  functions. 

Linear  and  Nonlinear  Optical  Properties  of  NTE  and  ntb 

To  illustrate  the  relative  merits  of  different  theoretical  techniques  in  terms  of  the 
ease  in  their  application  and  accuracy  of  the  calculated  results  with  respect  to  the 
experimental  measurements,  we  have  selected  two  model  compounds  nte  and  NTB 
whose  structures  are  shown  in  Figure  1.  We  present  in  Table  1  the  dipole  moment, 


Figure  1.  (a)  NTE,  (b)  NTB. 
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Table  I.  Dipole  moment,  n(D),  and  linear  polarizability,  a(  10  24  esu), 
of  nte  and  ntb  obtained  by  different  methods. 


H  <«(  <o)>a 


Method 

NTE 

NTB 

NTE 

NTB 

cndo-sos: 

SCI 

7.69 

8.23 

11.31 

15.96 

SDCI 

6.87 

7.46 

6.37 

9.00 

indo-sos: 

SCI 

8.36 

8.89 

12.20 

17.33 

SDCI 

7.34 

8.71 

6.66 

11.84 

indo-tdhf: 

sz 

7.21 

7.92 

10.85 

16.30 

DZ 

7.44 

8.11 

12.20 

19.32 

Ab  initio- tdhf: 

sv  +  s.d. 

7.54 

8.28 

15.52 

23.03 

Exp. 

5.17 

5.44 

13.2 

25.5 

a  All  calculations  of  «(co)  performed  at  a  fundamental  optical  wave¬ 
length  of  1064  nm. 


lx,  and  linear  polarizability,  a(w);  and  in  Table  II,  the  first  hyperpolarizability, 
/?(~2o;;  a),  to),  calculated  by  the  SOS  and  the  tdcphf  methods,  along  with  the 
corresponding  experimental  results  also  obtained  in  our  laboratory. 

All  theoretical  calculations  were  performed  at  the  ab  initio  self-consistent-field 
(SCF)  optimized  geometry  reported  earlier  [12].  The  SOS  calculations  were  per¬ 
formed  by  suitably  modifying  the  zindo  (Zerner  indo)  semi-empirical  programs.* 
Two  sets  of  calculations  were  performed:  One  with  the  CNDO  Hamiltonian  and  the 
other  with  the  INDO  Hamiltonian.  In  both  sets  the  excited  states  were  generated 
from  a  single  reference  ground  state  Hartree-Fock  configuration  by  two  different 
Cl  treatments:  ( 1 )  A  singles  only  ci,  in  which  all  electrons  from  the  highest  ten 
doubly  occupied  molecular  orbitals  (MO)  were  allowed  to  be  singly  excited  into 
the  lowest  ten  unoccupied  molecular  orbitals.  This  resulted  in  101  singly  excited 
configurations  and  35  electronic  states,  which  were  all  included  in  the  SOS  expres¬ 
sions.  The  results  obtained  from  this  calculation  are  listed  as  SCI.  (2)  Next,  in 
addition  to  the  above  SCI,  double  excitations  were  allowed  from  the  three  highest 
occupied  mo  to  the  lowest  three  unoccupied  mo.  This  resulted  in  a  total  of  155 
singly  and  doubly  excited  configurations  and  23  electronic  states,  which  were  all 
included  in  the  SOS  expressions. 

The  tdcphf  calculations  were  performed  using  the  multiple-zeta  indo  (mzindo) 
[  10,23]  program  developed  at  the  Photonics  Research  Laboratory.  The  indo-tdhf 
calculations  were  performed  using  a  single  zeta  (sz)  as  well  as  a  double  zeta  (dz) 
basis  set  taken  from  the  work  of  dementi  and  Roetti  [24].  The  ab  initio  results 


*  ZINDO  program  system  has  been  developed  by  Professor  M.  C.  Zerner  at  the  University  of  Florida. 
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Table  II.  First-hyperpolarizability,  j8(— 2a>;  a>,  <o),  ( 10”30  esu)  and  second- 
hyperpolarizability  oj,  a),  -oj)  (10-36  esu)  of  nte  and  ntb  obtained  by  different 

methods.0 


1/51 

<?>‘ 

Method 

NTE 

NTB 

NTE 

NTB 

CNDO-SOS: 


SCI 

19.43 

36.21 

88.45 

199.34 

SDCI 

12.36 

23.11 

73.93 

208.21 

indo-sos: 

SCI 

49.23 

90.54 

365.55 

859.37 

SDCI 

24.31 

65.91 

178.12 

762.01 

indo-tdhf: 

SZ 

DZ 

11.95 

7.21 

23.31 

15.26 

Ab  initio- tdhf 

sv  +  s.d. 

7.82 

18.30 

2.66 

5.47 

Exp. 

20.2 

66.2 

24 

84 

0  All  calculations  performed  at  a  fundamental  optical  wavelength  of  1064  nm. 

b  III  =  (s  0?) ' ;  ft  -  12  (ft#  +  ft#  +  ft#);  Ui  =  y,  * 

C  ‘(‘Y^  =  1/15  2  (yajj  'Yur/  -b  Ti/t/)j  Uj  ~ 

U 

d  Experimental  /?  values  measured  at  a  fundamental  wavelength  of  1064  nm  and 
the  y  values  measured  at  a  fundamental  wavelength  of  602  nm. 


were  obtained  from  tdcphf  calculations  using  the  HONDO  suite  of  programs  [25  ] . 
In  the  ab  initio  calculations,  a  4-31 G  split  valence  (sv)  set  augmented  by  semi- 
diffuse  functions  on  the  heavy  atoms  was  employed.  This  basis  set,  called  sv  +  s.d., 
is  listed  as  basis  D  in  Ref.  [  12] . 

The  experimental  measurements  have  been  described  in  detail  in  Ref.  [  12].  In 
brief,  the  experimental  measurements  were  performed  in  solution  with  xylene, 
chloroform,  or  tetrahydrofuran  (thf)  as  solvent.  The  linear  polarizability  was  ob¬ 
tained  from  the  refractive  index  measurement  by  Abbe  refractometer  at  the  wave¬ 
length  of  sodium  D-line.  The  results  for  /?(-2o>;  a>,  a?)  were  obtained  from  the  efish 
measurement  at  the  fundamental  wavelength  of  a  Nd-YAG  laser  (X  =  1064  nm) 
in  chloroform  (X(3)(efish)  =  0.88  X  10-13  esu)  solution  which  was  also  used  as 
reference.  The  y(-o>;  oj,  w,  co)  values  were  obtained  from  dfwm  measurements  at 
a  fundamental  wavelength  X  of  602  nm  and  thf  (X(3)  =  0.37  X  10-13  esu)  was 
used  as  the  reference. 

An  examination  of  Table  I  reveals  that  all  theoretical  calculations  yield  larger 
values  of  the  dipole  moment  compared  to  the  experiment.  The  sdci  results  of  p 
show  a  better  accord  with  the  experiment  than  do  the  SCI  results.  In  the  indo- 
tdhf  method,  improving  the  basis  function  seems  to  worsen  the  agreement  with 
experiment,  though  the  results  get  closer  to  the  ab  initio  values. 
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For  the  linear  polarizability,  a(w),  the  SOS  method  worsens  the  agreement  with 
the  experiment  in  going  from  the  SCI  to  SDCI,  by  reducing  the  value  of  a(a>)  by  as 
much  as  50%.  Both  the  cndo  and  the  indo  methods  yield  comparable  results  of 
a(o))  in  the  framework  of  the  SOS  technique.  The  INDO-TDHF  results  in  the  SZ 
basis,  which  are  comparable  to  the  SCI  (cndo  +  indo)  results,  show  less  spectacular 
changes  due  to  basis  set.  Similar  to  the  dipole  moment  results,  the  value  of  (  a:(  «)) 
increases  upon  improving  the  basis  set,  but  now  getting  closer  to  the  experiment 
and  also  to  the  ab  initio  results.  The  ab  initio  results  for  (a)  show  the  best  agreement 
with  the  experiment. 

In  contrast  to  the  linear  polarizability,  the  nonlinear  coefficients,  (3  and  y  show 
a  large  variation  in  different  theoretical  treatment.  For  example,  including  the  dou¬ 
bles  ci  reduces  the  calculated  values  of  p  of  nte  by  about  a  factor  of  half  and  that 
of  NTB  by  a  factor  of  1  /  3  or  more  in  the  SOS  calculations.  Further,  the  difference 
between  the  cndo  and  indo  results  range  from  a  factor  of  2  in  the  case  of  nte 
and  sdci  treatment  to  almost  a  factor  of  3  for  ntb,  with  the  INDO  results  being 
larger.  In  comparison  to  the  experiment,  the  indo-sdci  results  seem  to  have  the 
best  values  for  both  nte  and  ntb  followed  by  the  cndo-SCI  results. 

In  the  tdhf  calculations,  both  the  ab  initio  and  the  indo  results  of  0,  which  are 
not  too  different  from  each  other,  are  too  small  by  a  factor  of  3  or  more  compared 
to  the  experimental  values.  Once  again,  improving  the  basis  set  in  indo-tdhf 
calculations  brings  the  results  closer  to  the  ab  initio  values,  though  the  agreement 
with  the  experimental  values  worsens. 

The  calculated  values  of  (7)  show  a  poor  agreement  with  the  experiment.  The 
SOS  results  of  the  cndo  and  indo  calculations  are  larger  than  the  experimental 
values  by  a  factor  of  4  to  almost  an  order  of  magnitude.  On  the  other  hand,  the 
tdhf  results  obtained  from  the  ab  initio  calculations  are  too  small  by  an  order  of 
magnitude  or  more  compared  to  the  experiment.  In  the  SOS  calculations  of  7,  the 
inclusion  of  doubles  Cl  has  less  spectacular  effect  in  CNDO  method,  where  the  values 
between  the  SCI  and  sdci  treatments  differ  from  each  other  by  less  than  10%. 
Though  a  similar  difference  in  the  SCI  and  sdci  results  of  indo-sos  calculations 
of  7  for  ntb  is  noted,  that  for  nte  is  more  than  a  factor  of  2,  with  the  values  in 
sdci  always  being  smaller  than  in  the  SCI  treatment. 

Issues  and  Opportunities  for  Theoretical  Modeling 
of  Second-  and  Third-Order  Nonlinearities 

Although  far  from  exhaustive,  the  results  discussed  in  the  previous  section  provide 
some  insight  into  the  problems  associated  with  the  theoretical  characterization  of 
the  linear  and  nlo  properties  of  organic  systems.  It  is  clear  that  all  the  available 
theoretical  methods  can  provide  reasonably  accurate  values  of  the  linear  polariz¬ 
ability,  a,  though  two  important  points  can  be  noted  from  Table  I:  (a)  The  TDHF 
method  ( ab  initio  as  well  as  semi-empirical)  gives  a  better  agreement  with  experiment 
than  does  SOS.  (b)  The  singles  only  treatment  in  the  SOS  method  consistently  gives 
better  result  than  the  singles  and  doubles  treatment. 

For  /?,  there  is  no  consistency  among  the  results  obtained  by  different  theoretical 
methods,  though  the  indo-sdci  values  are  closest  to  their  experimental  counter- 
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parts.  The  (3  value  in  donor-acceptor  systems,  such  as  nte  and  ntb  (Table  II),  is 
dominated  by  the  contribution  from  the  lowest-lying  charge  transfer  state,  in  which 
case  even  a  two-level  model  often  gives  a  reasonable  description  of  second-order 
effects  [  1  ] .  Therefore,  the  SOS  method  within  the  CNDO  and  indo  approximations 
are  expected  to  provide  reasonably  accurate  value  of  This  is  found  to  be  true  in 
the  present  study  for  INDO-SDCI.  The  SOS  semi-empirical  methods,  (cndo  as  well 
as  indo),  however,  do  not  seem  to  provide  reliable  values  of  the  third-order  non¬ 
linear  optical  properties. 

Our  studies  on  other  organic  molecules  [6-9,1 1-12]  have  revealed  that  the  ab 
initio- TDHF  results  of  (3  and  y  are  consistently  too  small  compared  to  experiment, 
as  can  also  be  seen  from  Table  II.  Furthermore,  we  have  demonstrated  [5]  that 
once  the  basis  set  has  been  sufficiently  saturated  by  including  a  relatively  small 
number  of  semidiffuse  and  diffuse  p  and  d  functions,  rather  small  changes  occur 
in  the  calculated  results  due  to  further  extension  of  the  basis  set.  This  leads  us  to 
recognize  the  absence  of  the  treatment  of  the  electron-correlation  effect  (ec)  as  a 
major  shortcoming  of  the  tdhf  approach.  There  has  already  been  some  effort  to 
correct  the  TDHF  results  either  indirectly  [26]  or  directly  [27]  via  perturbative 
treatment  of  EC  effect.  While  such  treatments  have  established  the  importance  of 
EC  effect  in  NLO  processes  and  have  provided  the  first  step  toward  improving  the 
ab  initio  based  calculations,  their  applicability  seems  to  be  limited  to  atoms  and 
small  molecules.  Further  development  in  the  treatment  of  the  EC  effect  in  the  ab 
initio  method  and  improvement  in  the  applicability  and  the  efficiency  of  computer 
algorithm  are  required  for  accurate  prediction  of  nlo  effects  in  organic  systems.  A 
similar  development  for  the  treatment  of  EC  effect  and  inclusion  of  polarization 
and  diffuse  functions  in  the  indo  method  may  perhaps  be  more  fruitful,  since  such 
a  method  will  be  considerably  economical  compared  to  the  ab  initio  method. 

With  regard  to  the  SOS  method,  there  are  a  number  of  factors  that  need  to  be 
addressed.  Since  it  is  clear  that  the  application  of  the  SOS  method  to  organic  mol¬ 
ecules  is  practical  only  with  the  semi-empirical  Hamiltonians,  it  would  be  useful 
to  examine  and  hopefully  improve  the  factors  affecting  the  accuracy  of  the  calculated 
results  in  these  methods.  It  should  be  recognized  that  even  the  most  versatile  suite 
of  semi-empirical  programs,  e.g.,  zindo  have  been  parameterized  for  properties 
other  than  the  linear  and  NLO  properties.  Since  ZINDO  is  the  most  widely  used 
semi-empirical  program  for  studying  structure-property  relationship  in  organics, 
it  would  be  useful  to  obtain  a  new  set  of  parameters  to  be  used  with  zindo  for  the 
calculation  of  NLO  properties.  At  the  same  time,  it  is  also  necessary  that  extended 
set  of  basis  functions  capable  of  describing  the  valence  and  diffuse  states  in  the 
cndo  and  indo  formalism  be  optimized.  Since  the  core  of  the  SOS  formalism 
involves  the  expectation  values  (\pi/v/h)  between  electronic  states  i pj,  \pj  of  mol¬ 
ecules,  where  I  and  J  could  easily  run,  even  in  the  limited  Cl  scheme,  into  the 
diffuse  manifold  of  the  spectrum,  a  basis  set  consisting  of  valence  only  functions 
would  not  provide  correct  description  of  the  excited  state  wave  functions  and  tran¬ 
sition  dipole  moments. 

Development  of  auxiliary  basis  sets  of  polarization  and  diffuse  type  functions 
would  also  be  beneficial  for  semi-empirical  tdhf  calculations.  As  noted  in  the 
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previous  section,  using  a  double  zeta  basis  set  in  INDO-TDHF  calculations  substan¬ 
tially  improves  the  calculated  results  of  linear  and  nlo  properties.  Extending  these 
basis  sets  with  diffuse  type  functions  would  hopefully  further  improve  the  perfor¬ 
mance  of  this  method. 

Another  problem  in  the  application  of  the  semi-empirical  SOS  methods  never 
discussed  in  the  literature  is  more  of  computational  nature  and  involves  the  program 
limitation  for  handling  the  excited  state  configurations.  The  excited  states  in  the 
SOS  methods  are  obtained  from  a  Cl  space  generated  from  the  single  and  sometimes 
also  double  excitation  of  a  limited  set  of  electrons  (except  in  a  few  7r-electron 
calculations  where  it  is  possible  to  perform  full  Cl)  from  the  occupied  MO  to  the 
unoccupied  MO  of  the  ground  state  configuration.  The  physical  question  involved 
in  such  a  scheme  is:  How  many  electrons  must  be  correlated  to  get  an  accurate 
description  of  the  many-electron  wave  function  from  which  the  excited  states  and 
transition  moments  are  derived?  Unlike  the  spectroscopic  studies  where  a  limited 
number  of  states  of  a  given  symmetry  are  required  and  the  Cl  diagonalization  on 
a  maximum  of  ten  low-lying  states  often  suffices  the  computation,  the  SOS  expres¬ 
sions  for  calculating  linear  and  nlo  properties  require  the  entire  manifold  of  the 
excited  state  spectrum  and,  therefore,  ideally,  one  would  need  to  correlate  all  elec¬ 
trons.  However,  assuming  that  the  core-core  and  core-valence  correlations  have 
only  minor  influence  on  nlo  effects  (which  may  not  be  true  for  molecules  containing 
first  row  atoms),  one  is  at  least  required  to  correlate  all  valence  electrons  of  the 
molecule.  Although,  in  small  molecules  containing  less  than  10  first-row  atoms, 
such  a  calculation  is  not  prohibitive,  the  program  limitations  to  handle  large  Cl 
space  at  the  present  time  forbids  such  calculations  on  organic  molecules.  For  ex¬ 
ample,  the  ZINDO  program  permits  a  maximum  of  250  singly  and  doubly  excited 
configurations  and  200  excited  states.  For  the  two  examples  given  in  the  previous 
section,  we  extended  the  maximum  configuration  limitation  (nbmax)  to  350  and 
yet  the  number  of  such  configurations  generated  by  including  any  additional  electron 
pair  over  the  10  electrons  already  included  for  single  excitation  and  six  electrons 
for  double  excitation  would  easily  go  beyond  400  to  cause  the  program  to  stop. 

In  a  preliminary  SOS  study  on  pflra-nitroaniline  and  other  related  molecules  [  23  ] 
we  have  found  that  the  calculated  values  of  (3  and  y  are  strongly  dependent  on  the 

( 1 )  number  of  electrons  to  be  included  in  ci,  (2)  the  type  of  excitation  (singles/ 
doubles)  for  generating  the  Cl  space,  and  (3)  the  number  of  excited  states  included 
in  the  SOS  expressions.  Our  study  indicates  that  with  the  increasing  Cl  space  and 
the  number  of  excited  states,  the  calculated  results  tend  to  saturate.  Therefore,  in 
order  to  include  a  large  enough  ci  space  in  the  SOS  calculation,  further  extension 
of  the  capability  of  popular  semi-empirical  programs  such  as  ZINDO  will  be  valuable 
in  the  theoretical  modeling  of  NLO  materials. 

Apart  from  the  limitation  of  the  presently  available  theoretical  methods,  there 
is  also  the  question  of  the  comparison  between  the  results  of  the  molecular  calcu¬ 
lations  and  those  obtained  from  the  bulk,  generally  solution,  measurements.  It  is 
important  to  recognize  that  the  molecular  calculations  are  performed  on  “gas-like” 
isolated  structures  that  ( 1 )  have  no  contribution  from  the  intermolecular  interaction, 

(2)  often  ignore  contributions  from  vibrational  effect  in  molecule,  and  (3)  assume 
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an  idealized  geometry  to  remain  fixed  during  the  entire  interaction  time.  The  ex¬ 
perimental  measurements,  however,  are  performed  in  environments  where  one  or 
more  of  these  effects  may  be  present  to  complicate  the  observed  values.  The  effect 
of  molecular  interaction,  for  example,  as  discussed  in  the  previous  section  is  included 
in  the  experimental  measurements  through  the  Onsager  or  the  Lorentz  “local  field 
factor”.  Experimental  studies  performed  on  the  “concentration  dependence”  of 
the  measured  y  values  in  dilute  solution  indicate,  through  a  linear  behavior  of  the 
concentration  vs.  X (3)  curve,  that  the  present  models  of  local  field  adequately  account 
for  molecular  interaction  for  “nonpolar”  or  low  dipole  moment  organic  materials. 
The  situation  is  not  so  simple  in  highly  polar  materials  with  large  0  values  which 
exhibit  strong  solvatochromic  effects  [  1  ] .  Therefore,  a  reliable  theory  of  the  “local 
field  effect”  would  be  extremely  useful  both  for  correctly  deriving  0  values  from 
measured  X (2)  and  also  for  theoretically  predicting  the  optical  nonlinearity  of  mo¬ 
lecular  materials  in  the  bulk  form.  Studies  directed  at  understanding  the  local  field 
effects  and  the  validity  of  the  Onsager  model  are  being  pursued  in  several  laboratories 
[28,29]  including  ours  at  the  present  time. 

The  vibrational  optical  nonlinearity  though  important  within  its  own  right  in 
processes  such  as  hyper-Raman  and  antistokes-Raman  effects,  also  becomes  sig¬ 
nificant  in  the  low-frequency  limit.  Recent  studies  by  Bishop  and  Kirtman  [30] 
and  Bishop  et  al.  [31  ],  on  FH,  H20,  C02,  and  NH3  indicate  that  the  vibrational 
contribution  to  the  static  hyperpolarizabilities  may  vary  from  620%  in  the  case  of 
NH3  to  -37%  in  the  case  of  H20  of  their  electronic  counterparts  for  0(o;  o,  o)  and 
from  15%  in  the  case  of  H20  to  84%  in  the  case  of  NH3  of  their  electronic  coun¬ 
terparts  for  y{o\o,o).  At  optical  frequencies,  the  calculated  vibrational  contributions 
to  electronic  nonlinearity  in  these  molecules  are  much  smaller.  Further  studies  and 
theoretical  analysis  of  this  effect  are,  however,  needed  to  establish  its  relative  con¬ 
tribution  as  well  as  to  understanding  the  mechanism  of  some  nuclear  level  nlo 
effects  in  organic  molecules. 

Another  nuclear  effect  which  makes  significant  contribution  to  the  measured  y 
values,  especially  those  obtained  from  the  dfwm  and  Kerr  gate  experiments,  arises 
from  the  reorientation  of  molecules  by  the  optical  field.  These  experiments  show 
both  the  electronic  as  well’  as  the  “reorientation”  contributions.  If  sufficient  care 
has  not  been  taken  in  the  experimental  measurement  to  separate  the  reorientation 
contribution  from  the  electronic  part,  ye :  the  quantity  obtainable  from  the  theo¬ 
retical  calculations,  a  comparison  between  the  two  sets  of  result  may  not  be 
meaningful. 

It  should  be  also  noted  that  the  idealized  geometry  of  a  molecule  assumed  in 
theoretical  calculations  may  not  represent  the  true  geometry  of  the  molecule  in 
bulk.  The  true  geometry  in  a  bulk  form  may  be  different  due  to  several  reasons. 
For  example,  a  conformational  chemical  defect  present  in  the  molecule,  as  in  the 
case  of  extended  conjugated  systems,  may  substantially  alter  the  effective  7r-electron 
conjugation,  and  consequently,  the  NLO  response.  This  may  further  complicate  the 
comparison  between  theory  and  experiment. 

Another  factor  often  responsible  for  discrepancy  between  theory  and  experiment 
is  the  dispersion  effect.  Theoretical  calculations  of  nlo  effects  are  often  performed 
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assuming  a  nonresonant  optical  frequency.  The  experimental  measurements,  where 
the  optical  frequency  is  close  to  one-  or  two-photon  absorption  in  the  molecule, 
may  contain  substantial  resonant  contributions  due  to  dispersion  of  NLO  coefficients. 

Finally,  the  measurements  are  susceptible  to  variations  of  experimental  condi¬ 
tions,  references,  and  the  method  used.  Often,  experimental  results  performed  on 
the  same  molecule  by  different  groups  greatly  vary  from  each  other.  One  such 
example  is  the  frequently  referenced  system,  /?ara-nitroaniline,  whose  reported  0 
value  varies  over  a  range  of  a  factor  of  two  [6].  Such  discrepancies  in  the  experi¬ 
mental  results  only  add  to  the  already  complicated  situation  facing  theoretical  char¬ 
acterization  of  nlo  properties. 


Photorefractive  Materials 

Photorefractivity  is  produced  by  a  combined  action  of  electrical  properties  and 
the  electrooptic  properties.  The  photorefractive  phenomenon  can  be  described  in 
terms  of  the  following  steps  [  32  ] .  The  light  is  absorbed  by  a  photosensitizing  group 
whereby  electron-hole  pairs  are  created.  One  of  the  carriers,  which  has  the  higher 
mobility,  moves  from  the  original  site,  gets  trapped  in  the  dark  region  and  creates 
a  space-charge  field  due  to  separation  of  charges.  This  resulting  space-charge  electric 
field  then  produces  a  refractive  index  change  due  to  the  electrooptic  activity  of  the 
material.  The  result  is  a  light-induced  refractive  index  change  in  the  material,  so 
the  overall  manifestation  is  similar  to  X (3)  except  that  it  is  not  purely  an  electronic 
effect  at  the  molecular  level.  This  is  an  effect  which  is  a  bulk  manifestation  because 
the  charges  have  to  move  spatially  on  macroscopic  distances  in  order  for  them  to 
set  up  a  space-charge  field  that  is  sufficiently  large  to  produce  refractive  index  change 
by  electrooptic  effect.  In  understanding  the  photorefractive  behavior,  therefore,  one 
has  to  consider  at  molecular  level  the  efficiency  of  the  charge  carrier  generation  by 
the  action  of  light,  the  mobilities  of  the  carriers  in  bulk  and  formation  of  the  resulting 
space-charge  field. 

The  organic  photorefractive  system  would  be  different  from  widely  investigated 
inorganic  ferroelectrics  and  semiconductors  in  the  sense  that  both  the  quantum 
efficiency  for  charge  carrier  generation  and  the  mobilities  of  the  carriers  are  strongly 
dependent  on  the  electric  field  [33],  The  efficiency  of  photogenerated  charge  carriers 
is  often  described  by  the  Onsager  model.  The  dependence  of  mobility  on  the  field 
is  in  turn  dependent  on  the  model  used  for  the  transport  of  charge  carriers.  In  a 
dispersive  model,  where  the  mobilities  are  dominated  by  the  multiple  trapping 
dynamics,  one  can  describe  the  dependence  and  mobility  of  the  field  by  some  power 
law  for  a  limited  range  of  electric  field  applied.  In  our  laboratory,  we  have  considered 
the  field  dependence  of  the  photogeneration  efficiency  by  using  the  Onsager  model 
and  the  dependence  of  mobility  on  the  electric  field  by  power  dependence  such  as 
Ep,  where  p  is  determined  by  experimental  measurements  [  34,35  ] .  With  numerical 
simulation  of  the  space  charge  field  using  this  model,  we  have  calculated  the  dif¬ 
fraction  efficiency.  The  agreement  between  the  experimentally  observed  field  applied 
dependence  of  diffraction  efficiency  and  that  calculated  is  satisfactory  [34,35]. 
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This  new  field  of  photorefractive  polymers  offers  opportunities  for  theoretical 
modeling  at  the  bulk  level.  A  more  complete  modeling  of  space-charge  field  and 
trapping  dynamics  to  understand  the  time  response  is  very  much  needed. 
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Abstract 

A  new  generation  of  nearly  first-principles  calculations  predicts  both  the  linear  and  second-harmonic 
susceptibilities  for  a  variety  of  insulating  crystals,  including  GaAs,  GaP,  AlAs,  A1P,  Se,  a-quartz,  and  c- 
urea.  The  results  are  typically  in  agreement  with  experimental  measurements.  The  calculations  have 
been  extended  to  optical  activity,  with  somewhat  less  success  to  date.  The  theory,  based  on  a  simple  self¬ 
energy  correction  to  the  local  density  approximation,  and  results  are  reviewed  herein.  ©  1994  John  Wiley 
&  Sons,  Inc. 


Introduction 

The  optical  properties  of  solids  have  both  scientific  and  technological  interest. 
For  solid-state  physics,  the  response  of  matter  to  an  electric  field  is  one  of  its  most 
fundamental  properties.  This  interaction  has  helped  to  elucidate  the  nature  of  light 
itself.  Historically,  crystalline  quartz  has  been  the  key  material,  including  the  dis¬ 
covery  of  birefringence  in  1690,  optical  activity  in  1811,  and  optical  second- 
harmonic  generation  in  1961.  The  present  scientific  interest  is  centered  on  the 
calculation  of  the  tensors  in  the  consitutative  equations  of  macroscopic  electro¬ 
magnetism  rather  than  the  electrodynamics. 

Technologically,  one  need  only  look  to  laser  technology  and  telecommunications 
to  see  applications  of  second-harmonic  generation  and  related  phenomena  such  as 
optical  parametric  oscillation  and  electro-optic  modulators.  The  scientific  motivation 
is  to  understand  what  is  required  to  model  these  phenomena;  if  the  understanding 
is  sufficiently  robust,  the  hope  is  the  predictions  may  help  select  materials  that  are 
technologically  superior. 

Because  of  its  relative  simplicity  and  accuracy,  the  local  density  approximation 
(lda)  is  the  most  widely  used  computational  method  for  determining  the  structural 
and  electronic  structure  properties  of  solids  using  little  or  no  experimental  input 
[1].  Nevertheless,  its  application  to  the  optical  response  of  solids  has  been  limited 
largely  to  linear  optical  response.  The  present  work  reviews  an  attempt  to  extend 
the  predictive  powers  of  the  LDA  (with  its  leading  error  corrected,  as  explained  in 
the  next  section)  to  higher  optical  response  tensors.  To  date,  the  second-harmonic 
susceptibility  and  the  optical  activity  tensor  have  been  considered.  Although  not 
discussed  here,  it  would  be  natural  to  ultimately  extend  the  present  work  to  yet 
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higher  tensors,  specifically  those  governing  third-harmonic  generation,  second- 
harmonic  generation  in  centrosymmetric  systems,  and  the  anisotropy  of  the  di¬ 
electric  constant  in  cubic  materials.  The  intent  of  this  paper  is  to  present  a  readable 
overview;  the  reader  is  referred  to  the  original  papers  for  technical  details. 

A  Self-Energy  Correction  to  lda 

By  1980,  it  became  apparent  that  the  local  density  approximation  systematically 
underestimated  band  gaps  in  essentially  all  semiconductors.  In  the  case  of  german¬ 
ium,  metallic  rather  than  semiconducting  behavior  is  predicted.  This  systematic 
failure  led  solid-state  theorists  to  consider  more  sophisticated  alternatives  to  lda. 
After  a  few  years,  it  became  evident  that  the  GW  theory  of  Hedin  [2]  did  correctly 
predict  the  band  gaps  of  semiconductors  to  within  perhaps  100  meV  [3,4].  Re¬ 
markably,  although  the  GW  eigenvalues  differ  from  lda  eigenvalues  by  about  1  eV 
the  GW  quasiparticle  wave  functions  are  little  different  than  the  lda  wave  func¬ 
tions  [  3  ] . 

In  the  mid-1980s,  it  also  became  apparent  that  the  LDA  was  overestimating  the 
calculated  dielectric  constants  of  semiconductors  [5,6].  This  is  a  natural  conse¬ 
quence  of  the  underestimate  of  the  band  gap:  The  simplest  model  for  dielectric 
response  that  is  consistent  with  causality,  and  the /sum  rule  is  the  single-pole  model. 
The  static  dielectric  constant  is  given  in  this  model  by 

e  =  1  +  «J/coo, 

where  oop  is  the  plasma  frequency  and  co0  is  position  of  the  pole.  An  underestimate 
of  the  gap  is  analogous  to  setting  oj0  too  small,  which  leads  to  an  unwanted  increase 
in  e. 

It  was  natural  to  suppose  that,  by  correcting  the  band  gap  with  the  GW  values, 
a  more  accurate  dielectric  constant  could  be  found  for  semiconductors.  Consider 
the  formula  the  static  dielectric  constant  for  insulators 

J*  occ 

d%  2  (n~fc\pGhip\nfc)>,  (1) 

BZ 

where  —  e  is  the  charge  on  an  electron,  %  is  the  crystal  momentum  whose  domain 
of  integration  is  the  Brillouin  zone  (  bz  ),  is  the  unit  cell  volume  divided  by  ( 2x ) 3 , 
p  is  the  momentum  operator,  and 

=  \mZ)(mTc\ 

'JnTc  Zj  > 
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where  cnt  the  energy  eigenvalue  associated  with  the  state  |  nH').  As  first  suggested 
[6]  and  later  shown  numerically  [7,8],  using  the  GW  eigenvalues  in  Eq.  ( 1 )  leads 
to  a  substantial  overcorrection  (i.e.,  underestimate)  of  the  static  dielectric  constant 
for  Si,  Ge,  and  other  semiconductors. 

The  intuitive  notion  that  improved  band  structure  should  lead  to  an  improved 
calculation  of  the  dielectric  constants  of  semiconductors  proved  to  be  correct,  but 
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a  deeper  analysis  was  required.  Levine  and  Allan  [8,9]  summarized  the  GW  results 
in  the  effective  crystal  Hamiltonian 

Hf  =  H\da  +  A Pc%,  (2) 

where  A  is  the  difference  between  the  GW  eigenvalues  and  the  LDA  eigenvalues  in 
the  notation  of  Sham  and  Schliiter  [10]  and  Pct  is  the  projection  operator  acting 
on  the  conduction  bands.  The  term  A Pct  along  with  the  usual  exchange-correlation 
term  of  the  LDA,  approximates  the  self-energy  suggested  by  GW  calculations. 

It  had  been  pointed  out  in  the  context  of  the  calculation  of  the  dielectric  constants 
of  solids  that  a  nonlocal  potential  required  a  correction  to  the  momentum  matrix 
elements  [5].  In  Ref.  [5]  the  nonlocal  potential  was  a  nonlocal  pseudopotential. 
This  term  may  be  thought  of  as  arising  either  from  the  commutator  of  r  and  H 
(Ref.  [5])  or  from  V%H%  [8,9].  These  forms  are  equivalent  [11].  Matrix  elements 
of  p  are  not  fundamental,  but  depend  upon  a  particular  form  of  the  Hamiltonian. 
Matrix  elements  of  are  more  general  for  crystalline  solids. 

In  the  calculation  of  Refs.  [8]  and  [9],  the  nonlocal  pseudopotential  appears  as 
well  as  an  additional  nonlocality  in  the  term  A Pc£.  Even  though  A  need  not  carry 
^-dependence,  because  the  conduction  bands  are  different  from  X-point  to  X-point 
in  the  Brillouin  zone,  the  projection  operator  has  X  dependence,  i.e.,  there  is  a 
nonzero  gradient  with  respect  to  X .  The  detailed  mathematical  consequences  are 
given  elsewhere  [7,8] .  Only  one  example  is  given  here.  The  static  dielectric  constant 
with  this  effective  Hamiltonian  is  given  by 
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Thus,  when  the  self-energy  correction  A Pc%  is  added  to  the  LDA  Hamiltonian,  only 
one  of  three  energy  denominators  participates  in  the  energy  shift.  The  resulting 
predictions  are  extremely  accurate  by  the  standards  of  solid-state  physics. 

The  calculation  proceeds  as  follows:  Well-converged  self-consistent  LDA  ground 
state  potentials  are  found  within  the  pseudopotential /plane  wave  method  using 
the  Corning  code  [12].  We  retain  about  600  plane  waves  for  a  small  system  such 
as  GaAs,  and  about  6000  plane  waves  for  c-urea,  the  largest  system  calculated  to 
date  (with  16  atoms  in  the  unit  cell).  Virtual  orbitals  are  found  in  the  (fixed) 
ground  state  potential.  These  are  used  in  a  “sums  over  states”  implementation  of 
perturbation  theory. 

The  calculated  dielectric  constants  are  given  for  a  number  of  materials  in  Table 
I,  and  their  size  derivatives  are  given  for  a  few  materials  in  this  table.  An  excellent 
account  of  the  experimental  values,  including  the  birefringence;  only  c-urea  is  ex¬ 
ceptional  in  that  there  is  an  underestimate  of  the  static  dielectric  constant,  in  contrast 
to  the  usual  LDA  overestimate.  The  size  derivatives  for  Si  and  Ge  differ  by  an  order 
of  magnitude  (in  both  theory  and  experiment),  yet  these  materials  are  chemically 
and  structurally  very  similar.  The  difference  is  largely  due  states  near  the  T-point 
in  the  Brillouin  zone.  In  Ge,  the  direct  gap  is  much  smaller  than  the  average  gap, 
and  this  region  dominates  the  optical  response.  In  Si,  the  bands  are  flat  throughout 
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Table  I.  The  static  dielectric  constants  and  their  size  derivatives  for  a  variety  of  materials. 


Material 

A  (eV) 

coo 

d  In  c^fd  In  a 

Ref. 

LDA 

Present 

Expt. 

LDA 

Present 

Expt. 

Si 

0.9 

13.5 

11.2 

11.4 

1.21 

0.77 

0.68,  0.83 

[8] 

Ge 

0.8 

21.3 

16.0 

15.3 

10.8 

7.2 

5.3,  7.8 

[8] 

AIP 

0.9 

8.3 

7.2 

7.4 

0.7 

0.4 

[7] 

AlAs 

0.9 

9.5 

8.1 

8.2 

2.0 

1.4 

[7] 

GaP 

0.9 

10.4 

8.8 

9.0 

2.7 

1.9 

1.6,  1.9 

[7] 

GaAs 

0.8 

13.7 

11.2 

10.8 

5.9 

4.2 

2.2,  3.1 

[7] 

Se  (o) 

1.1 

10.3 

7.9 

6.2-8.4 

[22] 

Se  (e) 

1.1 

15.2 

11.5 

10.2-13.7 

[22] 

a-Quartz  (o) 

1.8 

2.48 

2.30 

2.35 

[22] 

a-Quartz  (e) 

1.8 

2.51 

2.33 

2.38 

[22] 

c-Urea  (o) 

2.03 

2.17 

[23] 

c-Urea  (e) 

2.14 

2.49 

[23] 

The  experimental  data  are  cited  in  the  references.  For  the  uniaxial  crystals,  “o”  refers  to  the  ordinary 
ray,  and  “e”  refers  to  the  extraordinary  ray. 


the  Brillouin  zone,  so  that  no  single  region  dominates  the  optical  response  to  this 
extent.  A  two-band  model  fails  to  obtain  this  order  of  magnitude  difference  [13]. 


Second-Harmonic  Generation 

The  discovery  of  second-harmonic  generation  in  1961  [14]  led  to  a  great  deal  of 
theoretical  development  in  the  field.  Both  the  electrodynamics  and  the  band  theo¬ 
retical  description  were  elucidated  shortly  thereafter  [15,16].  Local-field  corrections 
were  first  given  in  this  period  as  well  [17].  Unfortunately,  the  formalism  was  far 
too  complicated  for  a  realistic  implementation  at  that  time. 

Perhaps  the  first  attempt  at  a  realistic  band  structure  calculation  came  in  1975, 
from  Fong  and  Shen  [18],  who  used  the  empirical  pseudopotential  method — already 
known  to  give  reasonable  results  for  linear  dielectric  response  [19] — to  calculate 
X(2)  for  GaAs  and  other  III-V  semiconductors.  Unfortunately,  their  results  were 
below  the  experimental  values  by  at  least  an  order  of  magnitude.  They  speculated 
that  local-field  effects  were  responsible,  but  did  not  attempt  to  include  these  explicitly. 
The  field  lay  fallow  for  over  a  decade,  when  it  was  suggested  by  Sipe  and  co-workers 
that  perhaps  it  was  the  matrix  elements  of  Fong  and  Shen  [18],  rather  than  the 
local-field  corrections,  which  were  principally  in  error  [20].  The  relevant  matrix 
elements  were  estimated  using  a  semiempirical  method,  leading  to  much  improved 
values  for  X (2)  compared  to  experiment.  Further  work  in  the  area  has  been  pursued 
by  Sipe  and  co-workers  [21],  ourselves  [7,22-24],  and  Huang  and  Ching  [25,26]. 
The  work  of  the  present  group  has  featured  great  attention  to  the  issue  of  numerical 
convergence,  local-field  corrections,  and  the  self-energy  correction  discussed  in  the 
previous  section  for  linear  response. 
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Table  II.  The  second-harmonic  susceptibility  in  the  static  limit  for  a  variety  of  materials. 


Material 

A  (eV) 

d  = 

Ix®  (pm/V) 

d  In  Xi2)/d  In  a 

Ref. 

LDA 

Present 

Expt. 

LDA 

Present  Expt. 

AIP 

0.9 

23 

15 

11 

10 

[7] 

AlAs 

0.9 

39 

24 

15 

13 

[7] 

GaP 

0.9 

60 

35 

37 

20 

18 

[7,24] 

GaAs 

0.8 

174 

86 

83 

30 

23 

[7,24] 

Se  (dn) 

1.1 

220 

97 

100 

[22] 

a-Quartz  (dn) 

1.8 

0.48 

0.33 

0.30 

[22] 

c-Urea  {d36) 

1.1 

1.2 

[23] 

When  more  than  one  tensor  component  exists  only  the  Kleinman-allowed  component  [35]  is  reported. 
The  experimental  data  are  cited  in  the  references.  For  GaP  and  GaAs,  the  experimental  values  given  are 
from  the  review  and  reanalysis  of  Roberts  [36]. 


Results  for  the  second-harmonic  susceptibility  for  the  static  limit  lim^o 
X(2)(— 2o;;co,a;)  are  given  in  Table  II.  Frequency-dependent  results  for  GaP  are 
shown  in  Figure  1;  the  frequency  dependence  of  GaAs  has  also  been  studied  [24]. 
The  results  are  seen  to  be  in  excellent  agreement  with  experiment  for  those  cases 
for  which  experimental  results  exist.  The  lda  calculation  has  a  larger  second- 


photon  energy  (eV) 

Figure  1.  Frequency  dependence  of  the  second-harmonic  susceptibility  as  a  function  of 
frequency.  The  open  symbols  are  original  data  from  Refs.  [  37  ]  and  [  38  ] ;  the  solid  symbols 
are  from  the  reanalysis  of  Roberts  [36].  The  present  self-energy-corrected  calculation  is 
given  by  the  solid  line,  and  the  present  lda  calculation  is  represented  by  the  dotted  line. 
The  dash  dot  line  is  from  the  calculation  of  Moss,  Sipe,  and  van  Driel  [20] ;  the  diamond 
is  from  the  calculation  of  Ghahramani  and  Sipe  [21].  The  dashed  line  is  due  to  the  cal¬ 
culation  of  Huang  and  Ching  [25  ] . 
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harmonic  susceptibility  than  the  self-energy  corrected  value,  and  also  has  a  larger 
dispersion.  In  contrast,  the  results  of  Huang  and  Ching  [25]  have  a  larger  second- 
harmonic  susceptibility,  but  a  smaller  dispersion.  So  the  differences  between  the 
present  calculation  and  those  of  Huang  and  Ching  cannot  be  summarized  by  a 
simple  shift  of  oscillator  strength. 

Local-field  corrections  are  given  in  Table  III.  For  the  semiconductors,  these  are 
modest.  However,  for  urea,  the  results  are  quite  large.  Urea,  (NH2)2CO  with  two 
molecules  per  unit  cell,  is  a  hydrogen  bonded  crystal;  the  intercell  charge  transfer 
is  therefore  much  lower  than  the  covalently  bonded  semiconductors.  The  CO  group 
is  a  tiny  conjugated  7 r  system  and  is  therefore  a  good  candidate  for  having  a  large 
nonlinear  optical  response.  Our  calculations  suggest  that  urea  is  in  a  “crossover” 
regime  in  which  both  molecular  (i.e.,  local-field)  and  solid-state  effects  are  important. 

Optical  Activity 

When  linearly  polarized  light  propagates  along  the  optic  axis  of  certain  uniaxial 
crystals,  the  plane  of  polarization  is  rotated  by  a  certain  angle  per  unit  length  of 
the  material.  Specifically,  when  the  eigenmodes  of  propagation  in  the  crystal  are 
left  and  right  circularly  polarized  light  waves,  the  optical  rotatory  power  p  is  related 
to  the  indices  of  refraction  nL  and  nR  by 

P  =  r-  (nL  -  nR). 

2c 

Optical  rotatory  power  is  the  best-known  manifestation  of  optical  activity  [  27  ] , 
which  encompasses  all  optical  phenomena  which  depend  upon  both  the  direction 
of  the  electric  field  and  the  direction  of  propagation. 

As  discussed  in  Ref.  [22],  the  linear  interaction  of  a  monochromatic  electro¬ 
magnetic  wave  (with  wave  vector  q)  traveling  through  a  medium  may  be  described 
through  the  joint  solution  of  the  wave  equations  for  the  vector  potential  A 


Table  III.  Local  field  corrections  for  a  variety  of  materials  for  the  linear  dielectric  response 
and  second-harmonic  susceptibility. 


Local  field/total  (%) 


Material 

A  (eV) 

X(2> 

Ref. 

AIP 

0.9 

-8 

-12 

[7] 

AlAs 

0.9 

-7 

-8 

[7] 

GaP 

0.9 

-6 

-13 

[7,24] 

GaAs 

0.8 

-5 

-8 

[7,24] 

Se 

1.1 

T 

irT 

T 

+22 

[22] 

a-Quartz 

1.8 

1 

L*1 

1 

-p- 

-6 

[22] 

oUrea 

0 

-17,  -12 

-89 

[23] 

If  two  values  are  given,  these  are  for  the  ordinary  and  extraordinary  rays,  respectively. 
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1  d2A  -. 

Vl4--— j~  V(V-A)  = 
c  dt 


4ir 


and  a  constitutive  relation 
Ji 


1  •  2 
0)  10)  , 

- —  (fiy  -  8ij)Aj  +  - —  VijiQj^i  +  0{q  ), 
47rc  4irc 


(4) 


(5) 


where  7  is  the  current,  i,  j,  and  /  are  Cartesian  indices,  and  5  is  the  Kronecker  d. 
Given  a  knowledge  of  e  and  77,  the  relationship  between  the  wave  vector  and  the 
frequency  can  be  determined,  and  from  this  any  optical  constant  may  be  extracted. 
Band  theory  can  supply  the  equations  for  the  dielectric  function  [8,28]  and  the 
optical  activity  tensor  [22].  The  formulas  are  quite  complicated,  so  will  not  be 
repeated  here.  Suffice  it  to  say  that  they  are  given  in  terms  of  matrix  elements  of 
direct  transitions  and  an  integral  over  the  Brillouin  zone. 

We  have  completed  calculations  of  the  optical  rotatory  power  for  selenium  [22,29] 
and  a-quartz  [  22  ] .  We  restrict  our  presentation  to  Figure  2,  the  case  of  a-quartz, 
for  which  the  experimental  data  are  more  reliable  than  selenium.  We  plot  the 
optical  rotatory  power  p  in  the  combination  o)/p1/2  because  this  brings  both  the 
experiment  and  the  calculation  into  a  straight  line  form,  as  suggested  by  the  single¬ 
oscillator  model  [  30  ] .  The  deviation  of  the  experimental  data  at  the  lowest  frequency 


Photon  Energy  Squared  (eV)2 


Figure  2.  Frequency  dependence  of  the  optical  rotatory  power  p  of  a-quartz.  The  com¬ 
bination  oj/p  1/2  is  chosen  so  that  the  data  and  predictions  will  be  approximately  straight 
lines.  The  lda  prediction  is  given  by  the  dotted  line,  and  the  self-energy  corrected  curve 
by  the  solid  line.  Local  fields  are  neglected  from  this  calculation.  The  calculation  under¬ 
estimates  p  by  about  a  factor  of  5. 
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plotted  is  almost  certainly  due  to  phonons.  It  is  troubling  that  the  predicted  optical 
rotatory  power  is  too  small  by  a  factor  of  5.  For  selenium,  the  calculation  is  too 
small  by  a  factor  of  about  1.5.  To  date,  local-field  corrections  have  not  been  included 
in  the  calculation.  However,  preliminary  results  indicate  these  will  represent  order 
1  corrections  to  the  values  reported  here  [31]. 

Concluding  Remarks 

The  local  density  approximation  modified  by  a  simple  self-energy  correction  has 
successfully  predicted  the  linear  and  second-harmonic  susceptibilities  for  crystalline 
solids  in  the  insulating  regime  in  a  series  of  calculations.  Systems  as  diverse  as 
simple  and  compound  semiconductors,  a-quartz  and  c-urea  have  been  investigated. 
The  strain  derivatives  have  been  considered  as  well  [32].  In  many  cases  these  are 
in  good  agreement  with  experiments;  in  some  cases,  the  predictions  are  made  in 
advance  of  experiments.  Optical  rotatory  power  has  been  calculated  as  well,  although 
with  less  quantitative  success  to  date.  Local-field  corrections  may  play  a  larger  role 
in  optical  rotatory  power  than  they  did  in  linear  optical  response  or  second-harmonic 
generation. 

While  the  ability  to  predict  optical  constants  of  solids  has  progressed  quite  a  bit 
recently,  it  is  worth  considering  some  of  the  principal  obstacles  to  further  progress. 
One  concerns  the  simple  approximation  of  the  GW  solutions.  Obviously,  including 
more  of  energy  dependence  and  genuine  GW  quasiparticle  wave  functions  rather 
than  lda  wave  functions  should  improve  matters.  However,  the  GW  quasiparticle 
peak  strengths  are  only  about  0.8  [  3,4] .  This  means  that  20%  of  the  total  oscillator 
strength  is  of  unknown  form  and  at  unknown  energies.  The  present  theory  simply 
places  the  “missing”  oscillator  strength  at  the  quasiparticle  energies.  A  better  scheme 
would  require  a  considerable  advance  in  the  theory  [33  ] .  The  gw  itself  is  in  principle 
a  theory  of  the  one-electron  Green’s  function.  Particle-hole  interactions  are  known 
to  be  important  in  the  optical  response  of  semiconductors  [34].  Yet  their  realistic 
treatment  is  a  post-state-of-the-art  task  in  1994.  Despite  these  difficulties  in  the 
theory,  personally  I  find  it  remarkable  just  how  well  a  simple  one-electron  ansatz, 
implemented  with  sufficient  variational  freedom,  works  in  comparison  to  the 
experiments. 
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Abstract 

Liquid  metallic  caesium,  taken  up  the  liquid-vapor  coexistence  curve  towards  the  critical  point,  exhibits 
a  crossover  from  Pauli  spin  paramagnetism  near  freezing  to  Curie-like  behavior  near  criticality.  This 
transition  is  discussed  in  terms  of  conventional  Fermi  liquid  theory  at  low  temperatures  and  heavy 
Fermion  behavior  on  approaching  the  critical  region.  The  maximum  in  the  magnetic  susceptibility 
permits  an  estimate  of  the  discontinuity  q  in  the  momentum  distribution  at  the  corresponding  density, 
and  this  has  recently  been  checked  independently  using  nmr  data.  An  argument  is  presented,  via  q, 
which  links  a  nonequilibrium  transport  property,  namely,  electronic  conductivity,  to  a  thermodynamic 
quantity,  the  magnetic  susceptibility.  With  the  above  as  background,  a  search  has  been  made  for  related 
correlations  in  the  normal  state  of  the  high  Tc  superconducting  copper  oxides.  Using  again  Fermi  liquid 
theory  (now  two-dimensional),  it  is  first  argued  that  there  should  be  an  intimate  correlation  between 
nuclear  spin-lattice  relaxation  time  Tx  and  electrical  resistivity  R .  This  is  borne  out,  at  least  partially, 
by  available  experiments.  In  the  same  context,  the  anti  ferromagnetic  susceptibility  X(0,  with  Q  the 
antiferromagnetic  wave  number,  is  linked  with  Tx  and  R.  Finally  some  brief  comments  are  made  on  the 
interpretation  of  Hall  and  thermopower  measurements.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

In  earlier  work  on  strongly  correlated  liquid  metals,  taken  along  the  liquid- vapor 
coexistence  curve  towards  the  critical  point,  a  relation  between  a  transport  property, 
electrical  resistivity,  and  a  thermodynamic  quantity,  magnetic  susceptibility,  has 
been  demonstrated.  The  basic  arguments  underlying  this  relation  are  first  sum¬ 
marized  in  the  next  section. 

This  same  type  of  correlation  is  then  pressed  here  for  electron  or  hole  liquids 
flowing  through  antiferromagnetic  assemblies  in  the  normal  state  of  some  high  Tc 
copper  oxide  materials.  Elimination  of  magnetic  susceptibility  in  a  Fermi  liquid 
framework  leads  to  the  prediction  that  electrical  resistivity  is  intimately  linked  with 
nuclear  spin-lattice  relaxation  time  and  empirical  data  is  employed  to  test  this 
relation. 

Experiments  on  Hall  constant  and  thermopower  are  also  briefly  referred  to.  The 
possible  role  of  interlayer  coupling  at  the  lowest  temperatures  encountered  in  the 
normal  state  is  considered,  as  is  a  crossover  from  a  two-dimensional  Fermi  liquid 
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behavior  at  the  higher  temperatures  to  a  phase  with  quite  different  behavior  at  lower 
temperatures.  Properties  near  the  crossover  may  perhaps  be  influenced  by  the 
strength  of  the  interlayer  coupling,  and  some  experiments  are  proposed  to  exam¬ 
ine  this. 


Summary  of  Properties  of  Expanded  Alkali  Metals 

When  heated  from  the  melting  point  to  the  liquid-gas  critical  point,  Cs  undergoes 
a  roughly  fivefold  volume  expansion.  Freyland  has  measured  the  total  magnetic 
susceptibility  along  the  liquid-vapor  coexistence  curve  from  the  melting  point  up 
to  the  critical  region  [1].  It  is  clear  from  these  experiments  that  the  magnetic  prop¬ 
erties  of  the  dilute  metal  exhibit  new  features.  Figure  1  already  shows  the  density 
dependence  of  the  volume  spin  susceptibility  X'(0,  0)  extracted  from  the  measured 
total  susceptibility  [1]  by  correcting  for  the  ionic  and  electronic  diamagnetism  [2,3], 
These  data  show  an  initial  decrease  of  the  susceptibility  with  decreasing  density, 
followed  by  increasing  enhancement  at  lower  densities  until  X'(0,  0)  reaches  a  limit 
imposed  by  the  Curie  susceptibility  evaluated  for  one  electron  per  atom.  133Cs 
Knight  shift  data  shown  later  will  be  seen  to  exhibit  a  similar  enhancement  effect. 

Turning  next  to  structure,  neutron  experiments  of  Jiingst  et  al.  [4]  have  recently 
yielded  the  liquid  structure  factors  of  Cs  at  various  thermodynamic  states  along 
the  liquid- vapor  coexistence  curve.  Motivated  by  this  data  and  its  interpretation, 
theory  and  results  will  be  summarized  on  electrical  resistivity,  in  relation  to  the 
magnetic  susceptibility  already  discussed,  over  a  substantial  density  range.  A  key 
quantity  in  the  interpretation  will  be  the  magnitude  q  of  the  discontinuity  in  the 
electronic  momentum  distribution  at  the  Fermi  surface,  q  being  unity  in  a  non¬ 
interacting  Fermi  gas  (cf.  Fig.  2).  Especially  from  the  magnetic  susceptibility,  when 
used  in  conjunction  with  either  (i)  phenomenology  or  (ii)  elevated  temperature 
heavy  Fermion  theory,  q  can  be  estimated  at  a  characteristic  density,  and  can  be 
compared  with  its  calculated  value  in  the  jellium  model.  This  demonstrates  the 


Figure  1 .  Magnetic  susceptibility  of  expanded  Cs  along  liquid-vapor  coexistence  curve. 
Dashed  curve  depicts  Curie  limiting  law  (after  Warren  [2]). 
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Figure  2.  Schematic  representation  of  electronic  momentum  distribution  n(k)  as  a  function 
of  k/kf,  where  kf  is  the  Fermi  wave  number,  (a)  Depicts  schematically  reduction  from 
unity  (for  Fermi  gas)  of  Fermi  surface  discontinuity  q  due  to  electron-electron  interactions, 
(b)  Shows  further  reduction  in  q,  at  same  density  as  in  (a),  when  jellium  background  is 
replaced  by  granular  ions. 


importance  of  electron-ion  interaction,  as  well  as  electron-electron  correlation,  as 
the  critical  point  of  Cs  is  approached.  It  is  then  shown  that  q  can  be  used  to  forge 
a  somewhat  surprising  link  between  a  transport  property,  electrical  resistivity,  and 
a  static  property,  magnetic  susceptibility.  Experimental  data  are  employed  to  confirm 
this  link  directly. 


Theory  of  Magnetic  Susceptibility 

A  brief  outline  will  first  be  given  of  an  extension  of  the  phenomenological  theory 
of  March  et  al.  [5],  which  was  posed  for  metal-insulator  transitions  occurring  at 
absolute  zero,  to  elevated  temperatures  (see  also  Chapman  and  March  [6],  where 
a  microscopic  theory  based  on  heavy  Fermion  ideas  at  T  +  0  was  developed). 

One  writes,  phenomenologically,  the  free  energy  per  atom  as  an  expansion  in 
the  magnetization  per  atom,  m ,  and  the  discontinuity  (quasiparticle  renormalization 
factor)  q  in  the  electronic  momentum  distribution  n(k).  Thus 

F(m,  q,  T)~E0  +  a(T)m2  +  b(T)q  +  dT)i f  +  •  •  ■  +  e(T)qm2  +  •  •  •  .  (1) 

Here,  the  interpretation  of  q  in  terms  of  the  average  number  of  doubly  occupied 
sites  (see  March  et  al  [5]  at  T  =  0)  is  more  appropriate,  since  the  discontinuity  in 
the  single-particle  occupation  number  will  not  be  such  a  well-defined  quantity  when 
T  ±  0.  Furthermore,  as  T  oo,  one  expects  (Warren  [2])  a  reversion  to  Curie-like 
behavior  in  the  (paramagnetic  contribution  to)  magnetic  susceptibility,  as  the  de¬ 
generacy  temperature  of  the  Fermi  liquid  is  exceeded.  This  is  achieved  if  a(T)  in 
Eq.  (1)  is  proportional  to  T,  with  the  coefficient  e(T)  much  less  strongly  dependent 
on  T,  and  remaining  finite  as  T  -►  0.  Thus,  if  a  =  aT,  then  as  T  -*•  oo,  X  -► 
noPofiB^aT,  and  for  lower  temperatures  X  is  always  less  than  this  limiting  value, 
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for  all  densities.  If  a  is  chosen  as  (\)kB  this  is  simply  the  Curie  law  for  the  electrons. 
In  the  opposite  T  0  limit,  one  has  X  ->  n0fiop2B/2eq  and  one  requires  \/2e 
N(E/),  the  density  of  states  at  the  Fermi  level  in  the  limit  of  high  density  (q  -►  1), 
to  regain  the  Pauli  susceptibility.  A  crossover  between  Curie-  and  Pauli-type  sus¬ 
ceptibility  will  therefore  correspond  to  the  condition  where  a  ~  eq ,  i.e.,  2 eq  ~  kBT. 

Chapman  and  March  have  used  heavy  Fermion  theory  to  estimate  q  at  the  max¬ 
imum  in  X  shown  in  Figure  1  (see  Warren  [2],  Freyland  [1],  and  Bottyan  et  al.  [7]). 
They  find  q  ~  0.18,  a  result  which  has  subsequently  been  confirmed  by  Warren 
[3]  using  nmr  data  (see  also  the  subsection  after  next). 

It  is  important  to  contrast  the  above  with  the  result  of  the  jellium  model  at  the 
same  density:  here,  from  the  work  of  Lantto  [8],  q  is  0.53.  The  conclusion  is  clearly 
that  at  the  density  of  0.8  g  cm-3  corresponding  to  the  maximum  in  X  in  Figure  1, 
both  electron-electron  interactions  (reducing  q  in  jellium  from  1  in  the  noninter¬ 
acting  case  to  0.53  at  0.8  g  cm-3)  and  electron-ion  interactions  (q  ~  0.18  at  same 
density)  are  important  in  determining  the  properties  of  expanded  fluid  Cs. 

Magnetic  Susceptibility  of  Strongly  Correlated  Expanded  Cs  Related  to  Electrical 
Resistivity  near  Criticality 

Having  discussed  theories  of  X  in  the  previous  subsection,  the  purpose  of  this 
section  is  to  expose  an  intimate  link  between  X  and  electrical  resistivity  R ,  which 
is  forged  via  the  discontinuity  q.  In  Figure  2(a),  a  schematic  representation  is  shown 
of  the  electronic  momentum  distribution  n(k)  as  a  function  of  k/kf  in  the  jellium 
model.  Figure  2(b)  depicts  schematically  the  reduction  in  q ,  at  the  same  density  as 
in  Figure  2(a),  due  to  the  introduction  of  granular  ions.  While  in  (a),  it  is  evident 
that  electron-electron  correlation  is  already  strong,  (b)  shows  that  this  is  also  the 
case  for  the  electron-ion  interaction  (at  a  density  ~  0.8  g  cm-3  in  expanded  Cs; 
see  Figure  2). 

To  relate  X  and  R ,  let  us  note  first  from  the  discussion  of  the  previous  subsection 
that  if  the  Curie  limiting  susceptibility  is  denoted  by  Xc,  then 


leading  to  X  tending  to  the  dashed  curve  in  Figure  1  as  q  is  reduced  on  approaching 
the  critical  point. 

Turning  to  the  electrical  resistivity  R ,  it  has  to  be  said  that  there  is  presently  no 
fully  satisfactory  electronic  transport  theory  for  strongly  correlated  electrons. 
Therefore,  to  gain  orientation,  as  well  as  to  provide  motivation  for  a  particular 
presentation  of  experimental  data  on  R  and  X  below,  let  us  employ  the  independent 
electron  force-force  correlation  function  formula  of  Rousseau,  Stoddart,  and  March 
(rsm)  [9].  The  one-body  nature  of  this  equation  is  reflected  in: 

(i)  The  appearance  of  a  one-body  scattering  potential  V{r). 

(ii)  The  use  of  the  Dirac  density  matrix  p,  which  satisfies  the  idempotency  con¬ 
dition  p2  =  p. 
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As  to  (i),  it  is  to  be  noted  that  one  has  the  equation  of  motion  for  the  Dirac  density 
matrix  as 


V?,p  -  %p  =  ^y[K(r,)-  V(r2  )]/>.  (3) 

Dividing  both  sides  by  p,  one  can  then  form  both  the  forces  dV(r\  )jd ti  and  dV(x2)l 
dr2  solely  in  terms  of  the  Dirac  density  matrix  p ,  and  hence  these  forces  can  be 
eliminated  from  the  rsm  formula.  This  leads  to  a  formula  for  the  electrical  resistivity 
Ry  which  is  solely  a  functional  of  the  Dirac  density  matrix. 

However,  for  orientation,  let  us  now  assume  that  R  can  be  characterized  in  the 
many-electron  case  by  replacing  the  Dirac  density  matrix  p  above  by  the  first-order 
density  matrix  7,  which  satisfies  y2  <  y,  essentially  because  of  the  Pauli  exclusion 
principle.  With  jellium  as  an  example,  one  has 

Pooc  2  exp(/kT!  -  r2),  (4) 

I  k  |  <,kf 

whereas 

To  oc  2  n(k)  exp(/k*  ri  -  r2).  (5) 

all  A: 

The  final  step  then  is  to  expand  R  =  R[y{ri ,  r2)]  when  q  is  small  as 

R  =  R0  +  qRi  +  •  •  •  .  (6) 

Since,  as  noted  in  Eq.  (2),  (1/X  -  \/Xc)  provides  an  empirical  measure  of  the 
discontinuity  q,  substitution  in  the  (assumed)  expansion  (6)  motivates  the  final  plot, 
using  experimental  data,  of  R  vs.  (1/X  -  1/XC).  There  is  a  striking  correlation,  as 
shown  in  Figure  3. 

Knight  Shift  and  Mass  Enhancement 

Independent  support  for  the  use  of  elevated  temperature  heavy  Fermion  theory 
(Chapman  and  March  [6])  for  expanded  Cs  comes  from  work  reported  very  recently 
by  Warren  [3],  who  analyzed  Knight  shift  data  on  this  system.  The  summary  below 
is  based  on  his  work,  and  strongly  supports  the  conclusions  of  Chapman  and  March. 

Brinkman  and  Rice  [10]  predicted  an  enhancement  of  the  effective  mass  (m*) 
as  a  consequence  of  the  reduced  instantaneous  fraction  of  doubly  occupied  sites. 
The  susceptibility  and  Knight  shift  data  shown  in  Figures  1  and  4  thus  immediately 
raise  the  question:  Given  the  conventional  description  of  the  magnetic  properties 
of  the  dense  metal,  is  the  enhancement  of  X'(0,  0)  observed  in  the  dilute  metal  a 
result  of  increase  of  the  interaction  parameter  a  or  an  enhancement  of  the  effec¬ 
tive  mass? 

The  apparent  limitation  of  the  susceptibility  by  the  Curie  value  suggests  the 
answer.  Because  the  Stoner  parameter  can,  in  principle,  approach  arbitrarily  close 
to  unity,  there  is  no  reason  for  the  Stoner-enhanced  susceptibility  to  be  limited  at 
the  Curie  value.  A  mass-enhanced  susceptibility,  on  the  other  hand,  cannot  exceed 
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Figure  3.  Shows  relation  between  measured  electrical  resistivity  R  and  deviation  from 
Curie  law  (I/x  —  1/XC).  Plot  is  motivated  by  elimination  of  Fermi  surface  discontinuity  q 
between  Eqs.  (2)  and  (6). 


the  Curie  value.  Increase  of  m*  is  necessarily  accompanied  by  a  decrease  in  the 
Fermi  energy  as  the  bands  narrow,  and,  when  EF  approaches  kBT ,  a  Curie  suscep¬ 
tibility  is  to  be  expected. 

The  enhancement  of  the  effective  mass  can  be  estimated  using  a  free  electron 
form  of  the  electronic  density  of  states  with  the  mass  ratio  m*  as  a  parameter. 
Results  of  this  simple  model  plotted  in  Figure  5  show  a  striking  increase  from 
m*  ~  1  at  a  mass  density  p  -  1.3  g  cm-3  to  m*  ~  5  at  p  =  0.8  g  cm"3.  As  discussed 


Figure  4.  Knight  shift  data  on  expanded  liquid  Cs  vs  density.  Data  obtained  at  different 
pressures  exhibited  on  plot.  (After  Warren  [3].) 
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Figure  5.  Enhancement  of  effective  electron  mass  estimated  using  a  free  electron  form 
of  the  electronic  density  of  states.  Circles  are  from  work  of  Warren  [3].  Cross  obtained 
earlier  by  Chapman  and  March  [6]  at  density  of  0.8  g  cm-3. 


above,  Chapman  and  March  [6]  earlier  obtained  a  similar  mass  enhancement  at 
0.8  g  cm-3  using  a  finite-temperature  extension  of  the  Brinkman-Rice  theory. 

Recent  measurements  [1 1]  of  the  optical  reflectivity  of  expanded  Cs  provide  an 
independent  confirmation  of  the  onset  of  mass  enhancement.  The  plasma  frequency 
wp  =  47 ml/mefr  is  determined  by  the  electron  density  ne  and  the  effective  mass  and 
can  be  extracted  from  the  Drude  contribution  of  the  optical  response.  “Optical” 
values  of  m*  have  been  derived  from  the  reflectivity  data  and  can  be  compared 
with  the  “magnetic”  masses  in  the  same  density  range.  While  there  is  a  systematic 
difference  of  roughly  20%  between  the  magnetic  and  optical  masses,  the  latter  show 
a  clear  onset  of  enhancement  at  the  same  density  as  the  susceptibility  enhancement 
onset.  From  two  distinctly  different  experiments,  therefore,  one  now  has  clear  ev¬ 
idence  for  the  enhancement  of  the  electron  effective  mass  in  the  dilute  metal  (see 
also  Warren  [3]). 

A  second  predicted  feature  of  the  highly  correlated  metal  is  the  development  of 
antiferromagnetism  [3].  This  behavior  is  quite  different  from  the  dense  metal  where, 
as  discussed  in  the  previous  section,  electron  interactions  push  the  system  in  the 
direction  of  ferromagnetism.  Korringa  ratios  tj  <  1  provide  clear  experimental 
signatures  of  ferromagnetic  correlation  effects  in  the  alkali  metals  at  normal  densities. 
Figure  6,  following  Warren  [3],  shows  the  behavior  of  the  Korringa  ratio  in  expanded 
Cs  as  a  function  of  density.  Below  a  mass  density  of  about  1 .4  g  cm-3,  y\  begins  to 
increase,  and  values  exceeding  unity  are  observed  at  the  lowest  densities.  This  result 
is  clearly  inconsistent  with  the  generalized  Stoner  picture,  for  which  increasing 
interaction  effects  would  lower  the  Korringa  ratio  (cf.  Warren  [3]). 

The  increase  in  Korringa  ratio  at  low  density  reveals  a  change  in  the  charac¬ 
ter  of  spin  correlations  from  the  ferromagnetic  tendency  in  the  dense  metal  to 
antiferromagnetic-like  correlation  in  the  dilute  conducting  phase.  Values  of  rj  >  1 
require  an  increase  in  X"(Q,  oo0)  at  nonzero  Q  relative  to  the  Q  =  0  value.  This 
implies  an  antiferromagnetic  fluctuation  peak  at  some  value  of  Q  which,  if  the  peak 
were  to  diverge,  would  correspond  to  the  antiferromagnetic  Q  vector.  It  is  not 
possible  to  determine  the  detailed  Q  dependence  of  X'XQ,  co0)  from  the  nmr  data 
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Figure  6.  Behavior  of  Korringa  ratio  in  expanded  liquid  metal  Cs  as  a  function  of  density. 
Increase  at  low  density  heralds  change  in  nature  of  spin  correlations  from  “ferromagnetic- 
like”  in  dense  metal  to  “antiferromagnetic-like”  behavior  in  dilute  Cs.  (Redrawn  from 

Warren  [3].) 


alone,  but  Warren  has  suggested  a  model  dependence  that  is  consistent  with  the 
observed  77  values  (see  Fig.  6).  It  seems  from  Warren’s  work  likely  that  the  system 
is  far  from  an  antiferromagnetically  ordered  state. 

High  Temperature  Superconductors:  Relation  between  Transport  and  Magnetism 

Concerning  high  Tc  copper  oxides,  the  author  [12]  has  recently  emphasized,  in 
a  related  context,  the  importance  for  the  properties  of  the  electron  or  hole  liquids 
in  the  normal  state  that  these  are  flowing  through  antiferromagnetic  assemblies  of 
copper  spins.  To  press  in  this  case  the  correlation  referred  to  above  between  electrical 
resistivity  and  magnetic  susceptibility,  a  useful  starting  point  is  afforded  by  the  two- 
dimensional  Fermi  liquid  study  of  Kohno  and  Yamada  [13].  These  workers  link 
the  electrical  resistivity  R  with  the  magnetic  susceptibility  X(Q),  where  Q  is  the 
antiferromagnetic  wave  vector.  In  particular,  the  analysis  of  Kohno  and  Yamada 
leads  to  the  proportionality 

R  oc  T2X( Q).  (7) 

They  note  then  that  if  the  further  assumption  is  made  of  Curie-Weiss  form  C/ 
(T  +  6)  for  x(Q)  with  0  >  0,  then  one  has  for  T  >  6  that  R  oc  T,  which  is  an 
experimental  finding  over  a  substantial  range  of  temperature  in  the  normal  state 
of  high  Tc  superconductors  (see,  for  example,  Fig.  3  of  Ref.  [12]). 

Following  Egorov  and  March  [14],  one  can  test  the  form  (7)  of  Kohno  and 
Yamada  [  1 3]  by  linking  it  with  a  quite  different  property,  namely,  the  nuclear  spin- 
lattice  relaxation  time  Tx  at  copper  sites.  The  above  workers  use  their  same  Fermi 
liquid  analysis  to  link  Tx  with  X(Q)  via 
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{TyT)~x  oc  X(Q).  (8) 

Substituting  the  form  X(Q)  in  Eq.  (8)  into  Eq.  (7),  one  is  led  to  the  further  prediction 

RTy  oc  T,  (9) 

and  experimental  values  of  R  and  T{  have  been  used  to  construct  Figure  7.  The 
prediction  (9)  is  well  borne  out  at  the  higher  temperatures  in  the  plot  of  Figure  7, 
but  there  is  a  marked  deviation  from  the  two-dimensional  Fermi  liquid  prediction 
(9)  at  the  lowest  temperatures  shown. 

This  behavior  will  now  be  shown  to  have  parallels  in  the  forms  of  Hall  coefficient 
and  thermopower  over  a  similar  temperature  range. 


Hall  Coefficient  and  Thermopower 

Returning  to  the  introductory  points  made  on  strongly  correlated  electrons  in 
the  liquid  alkalis  approaching  criticality,  the  author  [15]  has  argued  elsewhere, 
again  from  Fermi  liquid  theory,  that  the  Hall  coefficient  Rh  is  itself  closely  connected 
with  electrical  resistivity  R.  However,  though  Kohno  and  Yamada  [16]  have  given 
a  general  expression  for  the  Hall  coefficient  based  on  Fermi  liquid  theory,  the  sit¬ 
uation  is  not,  it  would  appear,  sufficiently  well  understood  to  lead  to  an  immediate 
relation  to  X(Q)  appearing  in  Eqs.  (7)  and  (8).  Therefore,  we  shall  here  content 
ourselves  with  noting  that  the  various  curves  of  RH  can  be  scaled  approximately. 
If,  as  noted  in  [12],  for  example,  one  considers  Rj/  instead  of  RH  then  the  slope  of 
RJ /  correlates  with  the  superconducting  transition  temperature  Tc. 


16.0 

—  14.0 

(0 

E 

u  12.0 

—  • 

a 

=L 

• 

6  100 

_ 

• 

’Y-* 

• 

• 

,_7  8.0 

• 

—  • 

cr 

•  • 

6.0 

• 

•  •* 

4.0 

;•  .  .  ■ 

100  200  300  400  500 

Temperature  (K) 

Figure  7.  Experimental  data  for  electrical  resistivity  R  and  nuclear  spin-lattice  relaxation 
time  T\  at  copper  sites  for  normal  state  of  high  Tc  Y  Ba2  Cu4  Og.  Plot  shown  is  motivated 
by  two-dimensional  Fermi  liquid  prediction  in  Eq.  (9).  Departure  from  Fermi  liquid  is 
evident  below  100  K  (see  Egorov  and  March  [14]). 
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Figure  8.  Behavior  of  Hall  coefficient  RH  for  high  Tc  copper  oxides  (compare  Egorov 
and  March  [14])  vs.  temperature. 


The  turning  points  of  the  Hall  coefficient  and  the  thermopower,  plotted  in  Figures 
8  and  9,  respectively,  are  in  the  same  temperature  regime  as  the  minimum  in  Figure 
7.  It  is  tempting  then  to  ascribe  all  the  turning  points  to  a  common  feature.  While 
various  possible  explanations  come  to  mind,  e.g.,  the  transition  from  Fermi  liquid 
to  Luttinger  liquid  behavior,  evidence  will  be  presented  below  that  is  consistent 
with  the  assumption  that  one  is  observing,  in  Figure  7  and  possibly  also  in  Figures 
8  and  9,  “crossover”  behavior  from  a  strongly  correlated  two-dimensional  Fermi 
liquid  behavior  at  the  higher  temperatures  to  a  new  phase,  with  quite  different 


Figure  9.  Similar  to  Figure  8  but  now  for  thermopower. 
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behavior  at  the  lower  values  of  T  in  Figure  7.  It  is  natural  then  to  look  for  evidence 
for  the  importance  of  interlayer  coupling. 

Crossover  Behavior  and  Interlayer  Coupling 

While  Harshman  and  Mills  [17]  have  given  arguments  for  the  relevance  of  in¬ 
terlayer  coupling  in  high  Tc  materials,  the  work  of  a  Japanese  group  [  1 8]  has  dem¬ 
onstrated  more  recently  a  correlation  between  Tc  itself  and  the  strength  of  the 
interlayer  coupling.  It  seems  clear,  therefore,  that  it  will  be  interesting  for  the  future 
if  measurements  can  be  made  to  reproduce  Figures  7-9  for  high  Tc  materials  with 
different  interlayer  coupling  strengths.  The  test  of  the  “crossover”  behavior  above 
is  whether  then  the  “turning  point”  temperatures  Tt  prove  to  move  in  a  way  cor¬ 
related  with  the  strength  of  the  interlayer  coupling. 

Discussion  and  Summary 

It  seems  remarkable  that,  at  least  in  the  expanded  alkalis  and  in  high  Tc  materials 
in  the  normal  state,  there  is  an  intimate  relation  between  nonequilibrium  transport 
properties  and  a  thermodynamic  quantity,  the  magnetic  susceptibility.  The  origin 
in  each  case  can  be  traced  directly  to  Fermi  liquid  behavior:  In  high  Tc  to  two- 
dimensional  behavior  and  its  antiferromagnetic  consequences. 

But  it  is  important  to  understand  the  “crossover”  behavior  in  Figures  7-9  from 
a  fundamental  point  of  view.  This  behavior  may  perhaps  be  explained  in  terms  of 
interlayer  coupling,  and  the  consequent  change  from  two-dimensional  Fermi  liquid 
behavior  to  a  new  phase  with  quite  different  behavior  as  the  temperature  is  lowered. 
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Abstract 

A  methodology  to  characterize  large-scale  shape  of  macromolecular  conformations  is  generalized  and 
applied  to  a  simple  model  of  branched  polymers.  The  standard  global  analysis  of  polymer  configurations 
uses  geometric  shape  descriptors,  such  as  the  mean  radius  of  gyration.  However,  geometric  descriptors 
alone  do  not  depict  well  the  intrincate  folding  features  found  in  many  macromolecules.  For  a  more 
complete  characterization,  we  employ  here  a  new  class  of  shape  descriptors  which  convey  the  degree  and 
complexity  of  the  “entanglements”  in  a  branched  chain.  Recently,  the  methodology  has  been  applied 
to  study  statistical  properties  of  linear  polymer  chains.  In  the  present  work,  we  extend  the  technique  to 
branched  polymers  and  analyze  the  configurational  and  long-chain  behavior  of  their  entanglement  de¬ 
scriptors.  A  family  of  “stars”  (polymers  with  a  single  branchpoint)  with  excluded  volume  interaction  is 
studied,  and  the  results  are  contrasted  with  those  for  linear  chains.  The  relation  between  changes  in 
geometric  and  shape  descriptors  within  the  polymer  conformational  space  is  presented.  The  complexity 
of  entanglements  for  large  chains  and  large  stars  are  compared.  The  results  suggest  a  similar  scaling 
behavior  of  the  entanglement  descriptors  for  the  two  polymer  architectures.  ©  1994  John  Wiley  & 
Sons,  Inc. 


Introduction 

Assessing  global  features  of  three-dimensional  (3D)  macromolecular  shape  is 
essential  for  an  understanding  of  polymer  structure  and  function.  Physical  properties 
such  as  elasticity,  viscosity,  and  melting  points  can  be  related,  to  some  extent,  to 
the  nature  of  large-scale  entanglements  of  single  polymer  configurations,  in  addition 
to  local  and  interinolecular  interactions  [1,2].  Molecular  recognition  and  catalytic 
functions  of  enzymes  are  also  strongly  related  to  the  global  folding  features  of  the 
macromolecular  backbone  [3,4]. 

Standard  tools  for  the  global  analysis  of  polymer  configurations  include  geometry- 
derived  parameters,  such  as  the  mean  radius  of  gyration  and  the  mean  end-to-end 
distance  in  the  chain  [1,2].  These  geometric  descriptors  convey  mostly  size  and,  to 
some  degree,  the  compactness  of  the  chain’s  configuration.  Most  of  these  descriptors 
can  be  measured  experimentally,  and  they  allow  one  to  follow  the  polymer’s  response 
to  exterior  variables.  For  instance,  several  phase  transitions  in  polymers  (e.g.,  swell¬ 
ing)  involve  configurational  transitions  in  single  molecules  [5,6].  These  transitions 
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are  marked  by  a  rapid  change  in  geometric  descriptors  as  a  function  of  an  external 
parameter,  such  as  the  temperature  of  a  thermostat  exchanging  heat  with  the 
molecule. 

A  more  informative  analysis  of  polymer  shape  is  achieved  by  monitoring  both 
geometric  and  topological  properties.  Topological  properties  include  features  such 
as  knotting  and  entanglements  in  closed-loop  polymers  [6-8].  Recently,  a  char¬ 
acterization  of  global  folding  features  of  macromolecules  has  been  proposed,  based 
on  the  occurrence  of  crossings  in  the  two-dimensional  (2D)  projections  of  a  rigid 
backbone  [9].  From  this  information,  one  derives  the  probability  of  overcrossings 
of  a  rigid  polymer  configuration  [10],  which  provides  a  description  of  the  complexity 
of  self  entanglements  (or  degree  of folding)  in  molecular  chains.  A  related  approach 
has  been  presented  for  self-avoiding  walks  [11].  This  methodology  presents  some 
advantages  over  purely  topological  approaches:  the  analysis  uses  (and  preserves) 
the  geometry  of  a  rigid  configuration,  whereas  the  features  described  go  beyond 
size  and  compactness.  The  term  geometric  descriptor  will  be  reserved  to  parameters 
which  depend  only  on  the  polymer  geometry.  The  term  shape  descriptor  will  be 
used  for  the  parameters  which  depend  on  both  molecular  geometry  and  backbone 
connectivity.  The  descriptors  of  entanglement  complexity  we  study  in  this  work 
belong  to  the  latter  class  and  therefore  combine  aspects  of  the  geometry  and  the 
topology  of  the  polymer  chain. 

In  Ref.  [12],  we  have  shown  evidence  that  a  marked  change  in  entanglement 
descriptors  also  accompanies  a  configurational  transition  in  a  linear  polymer,  caused 
by  either  a  temperature  change  or  a  change  in  excluded  volume.  For  this  reason, 
the  fluctuations  in  the  shape  descriptors  along  dynamic  trajectories  have  been  used 
to  monitor  molecular  flexibility  and  detect  configurational  transitions  [12,13]. 

Similarly,  averaging  shape  descriptors  over  molecular  conformations  provides 
insights  on  the  configurational  state  of  the  polymer  [12,14].  The  change  in  config¬ 
urational  averages  over  various  parameters  (e.g.,  temperature,  chain  length)  is  a 
valuable  piece  of  information.  The  configurationally  averaged  geometric  descriptors, 
such  as  the  radius  of  gyration,  satisfy  simple  power  laws  in  terms  of  the  number  of 
monomers  [5,15].  Similar  scaling  behavior  has  been  found  for  shape  descriptors  of 
linear  chains,  although  with  different  exponents  and  small  dependence  on  excluded 
volume  [14,16]. 

Statistical  properties  of  geometric  descriptors  of  polymer  architectures  other  than 
the  linear  one  have  been  studied  in  the  past  few  years  [17-19,  and  refs,  therein]. 
Special  attention  has  been  given  to  branched  structures  because  these  are  often 
found  experimentally  in  liquid  crystal  polymers.  These  polymer  architectures  (also 
referred  to  as  “topologies”)  include  [17]:  stars  (polymers  with  several  branches 
stemming  from  a  single  branchpoint),  combs  (two  or  more  single-branch  branch¬ 
points,  with  either  regular  or  random  positions  for  branches  of  various  lengths), 
and  brushes  (several  multibranch  branchpoints,  with  either  free  branches  or  branches 
grafted  onto  a  surface).  In  cases,  the  scaling  behavior  of  geometric  descriptors  is 
the  same  as  for  linear  polymers  [18,19].  Differences  are  found  for  some  polymer 
architectures  with  special  constraints  in  the  number  and  length  of  the  branches 
[17].  No  information  is  available  so  far  for  descriptors  other  than  geometric  ones. 
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In  this  work,  we  begin  the  study  of  branched  polymers  from  the  viewpoint  of 
entanglement  descriptors.  In  the  next  section  we  discuss  the  extension  of  the  meth¬ 
odology  to  the  analysis  of  nonlinear  polymers  and  test  the  accuracy  of  the  config¬ 
urational  averages  of  their  geometric  and  shape  descriptors.  Another  section 
compares  the  conformations  of  linear  polymers  and  “regular”  stars  with  excluded- 
volume  interaction.  The  relation  between  geometric  and  shape  descriptors  of  these 
configurations  is  discussed  and  some  approximate  scaling  is  established.  A  further 
section  compares  shape  descriptors  of  stars  and  chains  with  the  same  (large)  number 
of  monomers  and  discusses  their  scaling  behavior.  Conclusions  and  aspects  for 
further  research  are  found  in  the  closing  section. 

Global  Descriptors  of  Entanglements  in  Linear  and  Branched  Polymers 

In  this  work,  degree  and  complexity  of  entanglements  in  a  single  polymer  is  used 
as  equivalent  to  a  “degree  of  folding”  of  a  rigid  configuration.  Here,  a  molecular 
backbone  is  more  “entangled”  the  more  complex  and  numerous  are  the  turns, 
twists,  or  folds  in  the  chain.  In  this  context,  we  describe  the  macromolecular  shape 
by  using  an  intuitive  notion.  The  more  entangled  a  rigid  backbone,  the  larger  the 
mean  number  of  “bond-bond  crossings  ”  that  it  would  exhibit  when  projected  onto 
two  dimensions.  This  mean  number  can  be  derived  from  the  probability  of  observing 
a  given  number  of  branches  crossing  over  each  other  when  the  rigid  polymer  chain 
is  “photographed”  from  an  arbitrary  viewing  direction  [10,12]. 

The  quantitative  evaluation  of  the  probabilities  of  observing  projected  crossings 
has  been  discussed  in  detail  elsewhere  for  linear  polymers  [10,13].  The  extension 
for  branched  polymers  is  straightforward.  When  projected  onto  a  plane,  a  polymer 
with  an  arbitrary  monomer  connectivity  will  exhibit  a  number  of  “double  points” 
[7]  produced  by  two  or  more  bonds  crossing  over  each  other.  For  this  reason,  these 
double  points  are  here  referred  to  as  overcrossings.  Overcrossings  can  be  due  to  two 
bonds  belonging  to  the  same  branch  of  the  polymer  or  to  two  different  branches. 

From  our  viewpoint,  the  number  of  observed  overcrossings  characterizes  a  given 
2D  projection.  The  average  of  this  number  over  all  possible  projections  characterizes 
the  polymer  at  the  given  configuration.  Finally,  the  average  of  the  mean  num¬ 
ber  of  overcrossings  over  all  possible  polymer  configurations  is  a  characteristic 
of  the  topology,  number  of  monomers,  and  physical  parameters  defining  the 
macromolecule. 

The  actual  descriptors  of  entanglement  complexity  are  as  follows.  Let  n  be  the 
number  of  monomers  (or  main  chain  “atoms”)  for  a  generic  polymer.  The  prob¬ 
ability  of  observing  N  overcrossings  ( N  >  0)  at  a  given  polymer  configuration  Kt 
will  be  indicated  as  A^n).  [If  required,  the  dependence  with  the  configuration  will 
be  denoted  by  Ajfnj).]  The  distribution  of  overcrossing  probabilities  satisfies: 

maxAr 

2  Mn)  =1,  V«  a  3,  (1) 

N=  0 

where  max  N  depends  on  the  polymer  architecture  (i.e.,  monomer  connectivity). 
For  linear  chains,  max  N={n-  2)(n  -  3)/2.  As  discussed  elsewhere,  the  distribution 
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{Aj^ri)}  is  a  global  shape  descriptor  [10],  which  conveys  aspects  of  the  polymer’s 
folding.  A  configuration  exhibiting  little  degree  of  entanglement  is  characterized 
by  high  Ajfn)  values  for  low  N  (i.e.,  it  will  appear  usually  in  placements  with  few 
overcrossings).  In  this  work,  we  shall  use  only  two  shape  descriptors  derived  from 
{Ajfri)},  namely,  the  probability  (A*)  of  the  most  probable  number  of  overcrossings 
(N*):  ' 


A*'=  Ajy*(n )  =  max  A?fri) 

{N} 


(2) 


and  the  mean  number  of  overcrossings  N: 

ma  xN 

N=  2  NAdn)  (3) 

A^O 

The  conformational  dependence  of  these  two  shape  descriptors  will  be  indicated 
by  (^4*)/  and  (TV),-  for  a  generic  configuration  Kit  whenever  needed. 

The  actual  computation  of  overcrossing  probabilities  for  branched  polymer  con¬ 
figurations  can  be  carried  out  as  discussed  in  Refs.  [9]  and  [10],  with  only  minor 
changes  to  the  algorithm.  For  each  polymer  configuration,  the  probabilities  {Ajfn)} 
and  the  mean  number  N  are  evaluated  in  this  work  up  to  three  significant  figures. 

We  have  computed  the  shape  descriptors  A*  and  N  for  a  series  of  branched 
polymer  configurations.  Architectures  for  branched  polymers  up  to  six  monomers 


Labels  for  some  simple  branched  polymers 


(4,0)  AA  (5,0) 

(6,0) 

(4,1  [2])  X'kx’  (5,1  [2]) 

(6,1  [2]) 

(5,2[2,2]) 

"Y' 

(6,1 [3]) 

>0 

(6,2[2,2]) 

"V 

(6,2[2,3)) 

(6,3[2,2,2]) 

Figure  1.  List  of  polymer  architectures  with  number  of  main  chain  atoms  n  <1 .  [See 
text  for  the  labels  ( n ,  b[zx ,  z2, . . .  ]).  These  simple  branched  polymers  are  used  to  test  the 
accuracy  of  the  conformational  searches.] 
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are  listed  in  Figure  1.  For  simplicity,  we  shall  identify  these  polymer  “topologies” 
with  the  label  ( n ,  b[zu  z2,  ■  •  ♦]),  where  n  is  total  number  of  monomers*  b  is  the 
number  of  branches  attached  to  the  main  chain,  and  zlf  z2, . . .  label  the  location 
of  the  main  chain  atoms  where  the  branches  occur.  This  simple  nomenclature  is 
enough  for  the  present  purposes.  In  Figure  1,  the  polymers  ( 4 ,  1[ 2]),  (5,  2[ 2,2]),  (6, 

2,2,2])  identify  stars,  since  they  have  a  single  branchpoint  (“atom”  2).  Moreover, 
these  are  regular  stars  [16],  because  the  length  of  each  branch  is  identical.  In  later 
sections,  we  shall  use  the  term  “star”  with  the  restricted  meaning  of  a  regular  3- 
star ,  a  branched  polymer  with  three  segments  of  the  same  length  stemming  from 
the  single  branchpoint.  [Note  that  b  -  1  for  3-stars.] 

For  a  given  polymer  architecture,  the  actual  configurations  are  generated  by 
constrained  random  walks,  with  a  purely  repulsive  interaction  between  monomers 
due  to  excluded  volume.  When  generating  these  configurations,  the  constraints  are 
as  follows:  (1)  The  distance  between  any  two  bonded  atoms  is  a  constant  £  (i.e.,  a 
constant  Kuhn’s  length  [5]);  (2)  The  distance  between  any  two  nonbonded  atoms 
must  be  larger  than  or  equal  to  a  constant  rex,  the  radius  of  the  excluded  volume. 
Otherwise,  the  positions  of  the  atoms  are  arbitrary.  The  actual  algorithm  is  discussed 
in  more  detail  in  Refs.  [12]  and  [14]. 

Due  to  the  simple  form  of  the  potential  energy  for  the  monomer  interaction,  all 
configurations  which  satisfy  the  above  two  constraints  have  the  same  statistical 
weight.  Therefore,  the  configurational  averages  correspond  to  the  microcanonical 
ensemble,  and  the  averaged  shape  descriptors  can  be  expressed  as  follows: 


M 


M 

M 


2  (A*), 

i=1 

(4a) 

M 

2  (AO,- 
/=1 

(4b) 

where  M  (M  >  1 )  is  the  number  of  (random)  polymer  configurations  generated. 
[In  practice,  we  have  added  configurations  until  reaching  a  desired  stability  in  the 
averages  (A*)  and  (N).]  The  fluctuations  in  the  molecular  shape  can  be  represented 
by  standard  deviations  in  the  shape  descriptors,  such  as  o>  =  ((A*2)  —  (A*)2)1/2. 

The  instantaneous  and  configurationally  averaged  values  of  the  geometric  de¬ 
scriptors  are  evaluated  in  the  same  manner.  In  what  follows,  we  shall  compare  the 
behavior  of  (A*)  and  (N)  with  that  of  the  radius  of  gyration.  The  instantaneous 
value  of  the  radius  of  gyration  for  a  given  configuration  is  indicated  as  RG ,  and  its 
configurational  average  as  <RC).  Another  useful  geometric  descriptor  is  R,  the  radius 
of  the  smallest  sphere  (centered  at  the  center  of  mass  of  the  chain)  that  encloses 
the  polymer  completely  [12].  Its  corresponding  average  will  be  indicated  by  (. R ). 

Each  averaged  geometric  and  shape  descriptor  is  obtained  for  a  given  topology, 
number  of  monomers  n,  and  given  values  for  the  parameters  £  and  rex.  [Note  that 
the  actual  results  depend  only  on  the  ratio  rex/£.]  A  useful  parameter  combining 
the  latter  is  y  =  rex/rc,  where  rc  is  the  critical  (or  maximum  possible)  radius  of 
excluded  volume.  For  radius  of  excluded  volume  rex  >  rc,  no  polymer  configuration 
can  be  found  where  all  atoms  satisfy  the  constraints  mentioned  above.  This  critical 
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Table  I.  Convergence  of  configurational  averages  of  the  radius  of  gyration 
(( Rg )),  its  fluctuation  (o-RG),  and  the  minimum  spherical  radius  ((/?))  for 
the  branched  ’’polymer”  (4,1  [ 2])  (n  =  4  with  one  branch,  b  =  1).  (All  data 
are  given  in  A.  A  constant  interatomic  distance  t  =  1.53  A  is  used.  The 
exact  limit  corresponds  to  the  single  plane-trigonal  configuration  found  for 
maximum  excluded  volume.) 


rex 

(Rc) 

0RG 

(R) 

0.001 

1.14 

0.14 

1.48 

0.500 

1.16 

0.12 

1.51 

1.500 

1.226 

0.069 

1.55 

2.300 

1.301 

0.021 

1.567 

2.500 

1.315 

0.009 

1.548 

2.65004 

1.3250 

0.0000 

1.530  (Exact  values) 

value  rc  depends  on  the  chosen  architecture.  A  simple  analysis  gives  the  following 
values  in  some  particular  cases:  rc  =  It  sin(01/2)  for  linear  chains,  rc  =  It  sin(02/ 
2)  for  3-stars,  rc  =  2 1  sin(03/2)  for  4-stars,  where  </>,-  =  arcos(— 1//),  for  i  =  1,  2,  3. 
With  this  notation,  the  completely  random  chains  (no  excluded  volume)  correspond 
to  y  =  0;  in  this  case,  the  whole  configurational  space  is  accessible.  On  the  other 
hand,  the  polymer  spans  the  smallest  possible  configurational  space  at  y  =  1 . 

Note  that  y  =  1  in  a  linear  chain  corresponds  to  precisely  one  configuration,  the 
rod-like  chain.  For  branched  polymers  this  may  not  need  the  case,  and  one  can 
find  infinitely  many  configurations  at  the  critical  limit  y  -*■  1 .  In  some  special  cases, 
the  values  of  the  geometric  and  shape  descriptors  can  be  known  exactly  at  this 
limit,  thus  providing  a  test  for  the  accuracy  of  their  averages  and  the  configurational 
search. 

Consider  the  simplest  regular  3-star,  corresponding  to  (4,  1[ 2])  in  Figure  1.  In 
this  case,  no  bond  can  overcross  another  since  any  two  bonds  lie  always  on  one 
plane.  Therefore;  the  probability  of  overcrossings  will  be  AN  =  d0N,  where  d0N  is  the 
Kronecker  delta,  for  any  configuration  and  any  rex  value.  The  shape  descriptors  in 
this  case  will  be: 

(A*)  =  1,  (N)  =  0,  for  the  3-star  with  n  =  4,  all  y  (5) 

with  the  configurational  fluctuations  in  the  descriptors  (standard  deviations)  exactly 
zero.  In  contrast,  the  value  of  the  radius  of  gyration  depends  on  y.  In  the  limit 
y  — ►  1,  this  particular  3-star  is  forced  to  adopt  a  single  plane-trigonal  configuration 
with  bond  angles  of  exactly  120°.  In  this  case,  the  two  geometric  descriptors  (RG) 
and  (R)  are: 

lim  (Rg)  =  3I/2//2,  lim  (R)  =  t,  for  a  3-star  with  n  =  4  (6) 

y\\  Vt  1 

We  have  verified  how  the  averages  reach  these  limit  values  with  our  algorithm  of 
configurational  search.  For  actual  calculations,  we  use  the  value  t  =  1.53  A 
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throughout,  which  corresponds  to  a  typical  carbon-carbon  single-bond  distance  in 
polymers.  The  results  for  (Rg),  (R),  and  the  fluctuation  aRc  for  the  n  =  4  star  are 
given  in  Table  I.  The  averages  have  been  computed  by  including  enough  configu¬ 
rations  to  make  stable  the  number  of  significant  figures  shown  in  Table  I.  This 
typically  involves  some  1000  configurations.  Even  for  this  very  small  system,  the 
calculations  become  computer-intensive  near  the  critical  rc  value,  and  the  limit 
itself  cannot  be  reached  numerically.  For  instance,  the  computation  of  averages  for 
rex  =  2.3  A  requires  ca.  2  h  CPU  on  a  vax-4000  computer.  The  numerical  results 
in  Table  I  tend  to  the  limit  (6).  [They  also  verify  the  behavior  (5)  for  all  configu¬ 
rations.]  It  is  noted  that  (RG)  and  (R)  do  not  behave  equally  in  terms  of  the 
excluded  volume,  contrary  to  what  is  found  in  linear  polymers  [12,14].  The  behavior 
of  (R)  is  remarkable:  it  exhibits  a  maximum  as  a  function  of  rex. 

A  similar  analysis  can  be  made  for  the  special  class  of  ( n  -  l)-stars  with  only  n 
atoms  (i.e.,  b  -  n  —  3).  For  these  branched  polymers,  the  results  (5)  hold  for  all 
configurations.  Similarly,  one  can  compute  the  exact  limits  of  the  geometric  de¬ 
scriptors  at  the  critical  radius  of  excluded  volume,  since  a  single  conformation  is 
found.  For  the  polymer  (5,  2[ 2,2])  (see  Fig.  1)  this  configuration  is  a  tetrahedron, 
and  one  finds: 

lim  (Rg)  =  2^/5 1/2,  lim  (R)  =  £,  for  a  4-star  with  n  =  5  (7) 

vfi  rfi 

The  results  for  this  polymer  are  found  in  Table  II,  with  £  —  1.53  A.  The  numerical 
averages  slowly  reach  the  limit  (7).  The  behaviors  of  (Rg)  and  (R)  are  similar  to 
those  in  Table  I. 

For  all  other  polymer  architectures  in  Figure  1,  the  chains  give  rise  to  configu¬ 
rations  which  do  exhibit  overcrossings.  In  this  case,  one  finds  (T*)  <  1  and  (N) 
>  0.  As  an  illustration,  Figure  2  shows  the  results  for  the  configurational  average 

Table  II.  Convergence  of  configurational  averages  of  the  radius  of  gyration 
((/*c)),  its  fluctuation  ( <jRG ),  and  the  minimum  spherical  radius  ((/?))  for 
the  branched  ’’polymer”  (5,2[ 2,2])  (n  =  5  with  two  branches,  b  =  2).  (All 
data  are  given  in  A.  A  constant  interatomic  distance  £  =  1.53A  is  used.  The 
exact  limit  corresponds  to  the  single  tetrahedral  configuration  found  for 
maximum  excluded  volume.) 


rex 

(Kc) 

<*RG 

(R) 

0.001 

1.230 

0.108 

1.647 

0.500 

1.241 

0.104 

1.647 

1.000 

1.271 

0.076 

1.648 

1.500 

1.311 

0.045 

1.652 

2.000 

1.349 

0.016 

1.632 

2.300 

1.364 

0.0034 

1.593 

2.400 

1.3672 

0.0010 

1.566 

2.450 

1.3682 

0.0003 

1.549 

2.4985 

1.3685 

0.0000 

1.530  (Exact  values) 
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—  Linear  polymers  (no  branches) 

■  Polymers  with  one  branch 

n  Polymers  with  two  branches 

©  Polymers  with  three  branches 
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Figure  2.  Configurational  average  of  the  mean  number  of  overcrossings  ((#))  for  simple 
branched  polymers.  [The  full  line  joins  the  results  for  the  linear  polymers  and  provides 
an  upper  bound  for  the  branched  polymers,  which  are  therefore  less  entangled.] 


of  the  mean  number  of  overcrossings  for  all  polymers  in  Figure  1 ,  calculated  with 
rex  =  0.00 1  A  (virtually  no-excluded  volume  interaction).  The  continuous  line  in 
Figure  2  joins  the  results  for  linear  chains  (nonbranched  random  polymers).  This 
line  provides  an  upper  bound  to  the  { N )  values  of  all  branched  polymers  with  the 
same  number  n  of  monomers.  The  results  indicate  that  the  larger  the  number  of 
branches  for  a  given  n ,  the  smaller  the  number  of  overcrossings.  In  other  words, 
for  n  fixed  and  small  excluded  volume,  an  increase  in  the  number  of  atoms  forming 
part  of  the  branches  reduces  the  complexity  of  the  entanglements  which  can  be 
formed  by  the  polymer.  This  is  reasonable  since  in  this  case  the  main  chain  is 
shortened,  thereby  restricting  the  possibilities  of  foldings  which  can  give  rise  to 
complex  entanglements. 

The  above  results  illustrate  the  computation  and  some  elementary  properties  of 
the  averaged  shape  descriptors  of  short  branched  polymers.  In  the  next  sections  we 
deal  with  a  more  detailed  analysis  for  a  family  of  branched  polymers  with  larger  n. 


Behavior  of  Geometric  and  Shape  Descriptors  in  the  Configurational  Spaces  of 
Linear  Polymers  and  Regular  Stars 

In  the  limit  of  large  number  of  atoms,  the  shape  descriptors  of  branched  polymers 
with  few  and  short  chains  behave  similarly  to  those  found  in  linear  chains.  In  order 
to  test  possible  differences  in  the  behaviors  of  branched  and  linear  polymers,  one 
must  choose  a  family  of  nontrivial  branched  polymers.  In  what  follows,  we  discuss 
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some  properties  of  the  simplest  of  such  polymers.  The  chosen  family  are  the  regular 
3 -stars  (in  short,  “stars”),  which  exhibit  three  single-chains  of  the  same  length  stem¬ 
ming  from  a  single  branchpoint.  [Note  that  the  values  of  n  in  this  class  of  stars  are 
restricted  to  n  =  3n’  +  1,  where  n'  >  1  is  the  number  of  atoms  in  each  one  of  the 
single-chains.] 

The  configurationally  averaged  geometric  properties  of  star  polymers  have  been 
discussed  in  the  literature  as  a  function  of  the  number  of  atoms  [17,18].  In  this 
section,  we  analyze  the  detailed  interrelation  between  geometric  descriptors  and 
entanglement  descriptors  within  the  configurational  space  of  linear  and  star  polymers 
with  a  given  n  value.  The  larger-^  behavior  of  averaged  descriptors  is  discussed  in 
another  section. 

We  have  surveyed  the  configurational  space  of  linear  and  star  polymers  with 
various  n  values  and  various  values  of  excluded  volume.  As  a  representative  example, 
we  consider  here  the  case  with  n  =  16.  This  corresponds  to  a  star  with  three  5- 
atom  single-chains  stemming  from  the  central  atom.  As  illustration,  the  2D  pro¬ 
jections  of  two  random  configurations  are  displayed  in  Figure  3.  The  left-hand  side 
configuration  (I)  has  been  generated  with  virtually  no-excluded  volume,  whereas 
the  one  on  the  right-hand  side  (II)  corresponds  to  large  excluded  volume.  The  effect 
of  excluded  volume  is  clearly  seen  in  the  local  geometry  about  the  central  atom 
(atom  “6”).  In  the  configuration  II,  atoms  5,  6,  7,  and  12  lie  closer  to  plane-trigonal 
positions.  [Bond  angles  involving  atom  6  are  102°,  112°,  and  131°,  and  the  asso¬ 
ciated  dihedral  angle  is  146°.]  The  geometric  descriptors  reflect  the  general  trend 
whereby,  on  average,  one  expects  the  stars  with  smaller  excluded  volume  to  be 
smaller  in  size  (more  compact).  [Configuration  I  has  the  instantaneous  values  RG 
=  2.69  A  and  R  =  4.41  A,  whereas  configuration  II  presents  RG  =  4.13  A  and  R 
=  6.62  A.]  Similarly,  the  role  of  excluded  volume  in  modifying  the  3D  shape  of  a 
polymer  is  well  reflected  in  the  distribution  of  overcrossings.  Figures  4  and  5  compare 
the  histograms  of  overcrossings  probabilities  (or  overcrossing  spectra)  for  the  two 
star  configurations.  In  Figure  4  there  is  a  predominance  of  no-overcrossings  (ap¬ 
proximately  30%  of  the  projections  will  show  no  overcrossings),  but  also  a  significant 


Branched  (star)  polymer  Branched  (star)  polymer 

with  rex=0.001A  (y~0)  with  rex=2.3A  (y=0.87) 

Figure  3.  Two  configurations  for  the  regular  3-star  with  n  =  16  and  different  excluded- 
volume  parameter  y  =  rex/rc.  [For  these  stars,  t  =  1.53  A  and  rc  »  2.65  A.] 
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N,  number  of  overcrossings 

Figure  4.  Overcrossing  spectrum  for  one-star  polymer  configuration  with  n  =  16  and 
small  excluded  volume  (left-hand  side  in  Fig.  3). 


0  5  10  15  20  25 

N,  number  of  overcrossings 

Figure  5.  Overcrossing  spectrum  for  one-star  polymer  configuration  with  n  -  16  and 
large  excluded  volume  (right-hand  side  in  Fig.  3). 
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frequence  of  placements  where  the  configuration  exhibits  2  and  4  overcrossings. 
Several  other  overcrossing  numbers  have  probabilities  of  1%  or  more.  Finally,  the 
global  shape  of  this  configuration  is  characterized  by  N  =  2.59  and  A*  =  0.305.  In 
contrast,  Figure  5  shows  the  complete  predominance  of  no-overcrossings  in  the  star 
with  large  excluded  volume.  Configuration  II  is  characterized  by  N  -  0.495  and 
A*  =  0.733.  Therefore,  it  can  be  classified  as  not  only  larger  in  size  than  configuration 
I  but  also  less  entangled  (i.e.,  with  entanglements  of  less  complexity). 

A  large  number  of  randomly  generated  configurations  for  linear  and  star  polymers 
with  n  =  16  have  been  analyzed  in  the  same  manner.  For  a  given  excluded  volume 
radius  vex ,  the  geometric  descriptor  RG  and  the  shape  descriptors  N  and  A*  have 
been  computed  and  monitored  within  the  configurational  space.  For  a  given  n,  one 
would  expect  that  the  two  families  of  descriptors,  though  independent  from  each 
other,  should  exhibit  some  correlation.  Intuitively,  for  n  fixed,  one  would  expect 
that  configurations  with  small  size  should  be  more  entangled  than  configurations 
with  large  size.  This  notion  can  now  be  put  in  quantitative  terms. 

Figures  6  and  7  display  the  configurations  of  linear  chains  and  stars,  respectively, 
in  2D  diagrams  of  instantaneous  values  of  RG  and  N  (for  n  =  16).  This  2D  plot 
represents  the  correlations  between  the  size  and  compactness  of  the  polymer,  on 
the  one  hand,  and  the  complexity  of  its  entanglements,  on  the  other.  [We  shall 
refer  to  this  plot  as  a  compactness-and-entanglement  map.]  Figures  6  and  7  include 
the  first  333  configurations  generated  for  each  one  of  the  values  of  excluded  volume, 
rex  =  o.OOl  A,  1.5  A,  and  2.3  A,  for  a  total  of  999  configurations.  The  diagrams 
indicate  the  most  representative  regions  of  compactness-and-entanglement  values 


□  rex=2.3  A 
!rcx=1.5A 

■  re  =0.001  X 
V  ex  J 

0.8 
0.7 
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log  RG  0.4 
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Figure  6.  Compactness-and-entanglement  map  for  configurations  of  linear  chains  ( n  = 
16)  with  various  values  of  excluded  volume.  [Each  point  gives  the  radius  of  gyration  and 
mean  overcrossing  number  for  one  configuration.  The  same  number  of  configurations 
(333)  are  plotted  for  each  rex  value.] 
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Figure  7.  Compactness-and-entanglement  map  for  configurations  of  regular  3-stars  (n  - 
16)  with  various  values  of  excluded  volume.  [Each  point  gives  the  radius  of  gyration  and 
mean  overcrossing  number  for  one  configuration.  The  same  number  of  configurations 
(333)  are  plotted  for  each  rex  value.] 


associated  with  each  choice  of  excluded  volume.  [Note  that  the  configurations  for 
rex  =  2.3  A  (white  squares)  lie  above  some  configurations  found  with  rex  =  1.5  A. 
In  turn,  some  of  the  latter  coincide  with  configurations  found  when  no-excluded 
volume  is  considered.] 

The  results  indicate  that,  even  though  linear  chains  are  larger  in  size  than  stars, 
both  exhibit  the  same  range  of  entanglement  complexity  at  small  and  intermediate 
values  of  excluded  volume.  At  larger  excluded  volume,  the  stars  appear  less  entangled 
than  chains.  The  result  is  reasonable  since  the  critical  excluded  volume  rc  for  stars 
( rc  «  2.65  A)  is  reached  before  the  rc  value  for  linear  chains  (rc  «  3.06  A),  thus 
reducing  the  folding  possibilities  for  stars.  [Note  that  the  limit  rc  2.65  A  for  stars 
is  satisfied  by  infinitely  many  configurations  with  N  >  0.] 

Despite  the  scattering  of  RG  —  N  data,  which  suggests  the  independent  nature  of 
the  two  descriptors,  some  correlation  is  evident.  A  regression  over  the  999  config¬ 
urations  of  various  excluded  volumes  rex  and  n  -  16  displayed  in  Figures  6  and  7 
provides  the  following  results: 

logRG  «  (-0.34  ±  0.0 1)  log  N  +  (0.539  ±  0.004),  <B  =  0.930,  linear  chains  (8a) 

log  Rg  »  (-0.31  ±  0.01)  log  N  +  (0.51 1  ±  0.002),  (B  =  0.957,  stars  (8b) 

where  the  95%-confidence  errors  are  given  and  6  is  the  correlation  coefficient.  The 
results  express  the  qualitative  correlation  between  geometric  descriptors  of  size  and 
compactness  and  the  shape  descriptors  of  entanglements.  Equations  (8)  do  not 
indicate  a  major  difference  between  the  scaling  of  RG  and  N  in  the  conformational 
spaces  of  stars  and  linear  chains.  The  results  for  various  n  values  ( n  <  100)  support 
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by  the  following  qualitative  relation  between  the  instantaneous  values  of  descriptors: 

(AO,  ~  ( Rg)T for  stars  and  linear  chains,  constant  n  (9) 

where  the  exponent,  rounded-off  to  the  first  significant  figure,  appears  to  be  f  « 
3.  The  subindex  “f  ’  in  Eq.  (9)  indicates  a  generic  configurations  K,  with  constant 
n.  The  scaling  (9)  must  be  interpreted  as  a  relation  between  the  ranges  of  values  of 
both  descriptors.  Actual  configurations  show  important  fluctuations  about  Eq.  (9). 
A  precise  correlation  between  RG  and  N  does  not  exist,  thus  suggesting  that  the 
entanglement  nature  of  a  polymer  fold  cannot  be  established  on  the  sole  basis  of 
size  and  compactness. 

The  small  dependence  of  Eq.  (9)  on  the  polymer  architecture  is  probably  not  a 
general  feature.  If  the  shape  descriptor  A*  is  chosen  to  monitor  the  entanglements, 
the  correlation  with  the  radius  of  gyration  shows  some  dependence  on  the  connec¬ 
tivity.  The  results  with  n  =  16  and  all  rex  values  give: 

(RG),  ~  (A*) f,  constant  n  (10) 

with  £  »  0.55  ±  0.05  for  stars  and  £  «  0.63  ±  0.05  for  linear  chains.  The  difference 
in  £  exponents  appears  to  be  beyond  what  can  be  accounted  for  by  the  configurational 
dispersion  about  the  mean. 

In  summary,  the  interrelation  of  geometric  and  shape  descriptors  expresses  quan¬ 
titatively  the  notion  that,  for  a  given  n,  compact  polymers  are  more  entangled  than 
swollen  polymers.  Yet,  the  relation  between  descriptors  is  nonlinear  and  exhibits 
large  fluctuations.  In  the  case  of  N  and  RG,  their  proportionality  [Eq.  (9)]  depends 
little  on  the  polymer  topology.  In  the  next  section,  the  analysis  is  extended  to  the 
case  of  variable  n. 

Configurationally  Averaged  Shape  Descriptors  for  Longer  Regular  3-Stars 

The  scaling  behavior  of  the  configurationally  averaged  radius  of  gyration  (or 
other  “distance”  descriptor  such  as  R)  with  the  number  of  monomers  n  is  well 
known  for  a  linear  chain  [2,5]: 

<. RG)~kn f  (11) 

in  the  case  of  long  linear  chains.  With  no  excluded-volume  interaction,  these  chains 
are  said  to  be  “ideal”  or  in  their  “0-conditions”.  In  this  case,  they  can  be  represented 
by  gaussian  statistics  with  an  exponent  v  =  0.5  [2,5].  In  the  asymptotic  limit  of 
large  excluded  volume,  the  best  estimation  for  the  exponent  is  currently  v  «  0.588 
[15].  In  this  latter  case,  corrections  to  scaling  can  play  an  important  role  for  short 
chains  [15].  Equation  (1 1)  is  known  to  be  valid  also  for  regular  stars  [18].  Modi¬ 
fications  in  the  scaling  exponents  appear  to  be  needed  for  some  special  topologies 
[17].  These  “exact”  results  provide  a  reference  to  test  the  completeness  of  the  con¬ 
formational  search  in  the  computation  of  averaged  shape  descriptors  for  longer 
chains  and  stars. 

Figure  8  shows  the  present  results  for  (Rci)  in  linear  chains  and  stars  with  mono¬ 
mer  numbers  n  <  100.  The  stars  represented  correspond  to  n  =  7,  16,  31,  61,  and 
100,  in  the  case  of  no  excluded  volume  ( rex  =  0.001  A).  The  computations  for  rex 
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=  1.5  A  are  much  more  demanding  since  fewer  conformations  pass  the  excluded- 
volume  criterion.  The  largest  star  handled  in  this  case  had  n  =  40.  Nevertheless, 
the  number  of  monomers  appears  to  be  large  enough  to  reach  the  correct  large-« 
behavior  in  the  radius  of  gyration  (cf.  Fig.  8).  A  linear  regression  fitting  of  the  results 
in  Figure  8  for  n  <  16  gives  the  exponents  v  «  0.500  ±  0.024  for  stars  with  rex  = 
0.001  A  and  v  «  0.59  ±  0.02  for  stars  with  rex  =  1.5  A.  The  good  agreement  with 
the  exact  results  indicates  a  satisfactory  sampling  of  the  configurational  space. 
Therefore,  the  configurational  averages  (A*y  and  (A)  should  be  as  reliable  as  the 
average  (RG). 

A  previous  analysis  of  shape  descriptors  for  linear  chains  with  excluded- volume 
interaction  and  number  of  monomers  20  <  n  <  500  showed  that  the  averages  of 
(A*y  and  (A)  follow  similar  power  laws  to  that  for  the  radius  of  gyration  [14]: 

(A*}  ~  anx  (12a) 

<A>  -  art 8  (12b) 

The  estimated  exponents  in  Ref.  [14]  are  A  »  -1.00  ±  0.03  and  (3  «  1.4  ±  0.1, 
for  the  medium-size  polymers.  Recent  results  by  Whittington  and  co-workers  [16] 
with  longer  polymer  chains  (400  <  n  <  1 500)  indicate  a  smaller  asymptotic  exponent, 
/?  1. 122  ±  0.005.  The  difference  is  certainly  due  to  the  fact  that  (A)  reaches  the 

asymptotic  limit  slowly;  the  exponent  «  1 .4  represents  accurately  the  results  in 


n,  number  of  atoms 


•  Stars,  rex =1.5  A 
□  Stars,  rex=O.00lA 
■  Linear  chains,  rex=0.001  A 


Figure  8.  Scaling  behavior  for  the  configurational  average  of  the  radius  of  gyration  (. Rc ) 
(in  A)  for  linear  chains  and  regular  3-stars  with  n  <  100.  [The  slopes  marked  0.5  and  0.588 
correspond  to  the  expected  exact  results  of  polymers  with  and  without  excluded-volume 
interaction,  respectively.  The  agreement  with  the  exact  results  ensures  a  thorough  confor¬ 
mational  search.] 
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the  “cross  over”  regime  of  polymer  chains  of  intermediate  size  ( n  <  500).  As  we 
show  below,  the  results  in  this  work  support  the  view  that  power  laws  such  as  (12) 
also  describe  the  molecular  shape  of  regular  3-star  polymers,  at  least  for  intermediate 
values  of  n.  Our  findings  are  summarized  in  Figures  9  and  10. 

Figure  9  shows  the  results  for  the  shape  descriptor  (A*)  as  a  function  of  the 
number  of  monomers  n  for  stars  with  and  without  excluded  volume.  The  results 
for  the  linear  chains  (full  line)  are  included  for  comparison.  The  stars  with  no- 
excluded  volume  follow  closely  the  behavior  of  linear  chains.  The  estimated  ex¬ 
ponent  in  the  scaling  (12a)  for  these  stars  is  X  »  -1.06  ±  0.12  (6>  =  0.9993),  which 
lies  within  the  uncertainty  of  the  exponent  for  chains  [14].  In  the  case  of  larger 
excluded  volume  (rex  =  1.5  A),  the  largest  stars  we  can  handle  are  still  too  short  to 
make  an  accurate  estimation  of  scaling.  Nevertheless,  Figure  9  suggests  the  same 
qualitative  behavior  as  the  stars  with  no-excluded  volume. 

Figure  10  compares  the  averaged  mean  number  of  overcrossings  (N)  in  stars 
and  linear  chains.  The  results  indicate  a  similar  behavior  to  that  found  in  the  shape 
descriptor  (A*).  The  stars  with  no-excluded  volume  follow  closely  the  behavior  of 
linear  chains,  leading  to  an  estimated  exponent  jff  «  1.4  ±  0.2  ((?  =  0.9987)  for  a 
power  law  as  in  Eq.  (12b).  For  the  regular  3-stars  with  rex  =  1.5  A,  although  the 
values  of  (A)  are  smaller  (i.e.,  lesser  degree  of  folding),  the  behavior  of  (JSt)  as  a 
function  of  n  appears  to  follow  the  same  trend  observed  with  no-excluded  volume. 
As  commented  before,  this  latter  shape  descriptor  reaches  its  asymptotic  behavior 


n,  number  of  atoms 

1  10  100 


•  Stars,  rcx=1.5A 
□  Stars,  rex=0.001A 

—  Linear  chains,  rcx=0.00lX 

V _ _ _ _ _ s 


Figure  9.  Scaling  behavior  for  the  configurational  average  of  the  maximum  probability 
of  overcrossings  (A*)  for  linear  chains  and  regular  3-stars  with  n  <  100.  [The  behavior 
for  the  stars  with  large  and  small  excluded  volumes  agrees  well  with  the  scaling  exponent 
X  *=«  -1  obtained  for  linear  chains  [14].] 
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n,  number  of  atoms 


•  Stars,  rex=1.5A 
□  Stars,  rex=0.001A 
—  Linear  chains,  rex=0.001A 


Figure  10.  Scaling  behavior  for  the  configurational  average  of  the  mean  number  of  ov¬ 
ercrossings  (A)  for  linear  chains  and  regular  3-stars  with  n  <  100.  [The  behavior  for  the 
stars  with  large  and  small  excluded  volumes  follows  the  behavior  of  linear  chains.  The 
results  for  intermediate  values  of  the  number  of  monomers  suggest  an  effective  scaling 
exponent  »»  1.4  [14].  The  true  exponent  in  the  asymptotic  limit  must  be  smaller  than 
1.4  [16].  Reaching  the  asymptotic  behavior  requires  probably  n  >  500.] 


more  slowly,  and  therefore  our  results  should  be  considered  as  corresponding  only 
to  a  regime  of  intermediate  polymer  lengths. 

In  conclusion,  averaged  shape  descriptors  of  regular  stars  follow  power  laws  in 
terms  of  n  that  are  similar  to  those  found  for  linear  polymers.  As  far  as  the  present 
results  indicate,  the  exponents  X  and  appear  to  depend  little  on  the  different 
polymer  topology  and  the  change  in  excluded  volume.  Note,  however,  that  the 
multiplying  coefficients  a  and  a  in  Eqs.  (12)  do  depend  on  excluded  volume.  The 
coefficient  a  increases  with  rex ,  whereas  the  a  decreases.  Both  behaviors  establish 
that  stars  with  large  rex  exhibit,  on  average,  fewer  overcrossings  than  the  stars  with 
smaller  excluded  volume,  i.e.,  they  former  are  less  entangled  than  the  latter. 


Further  Comments  and  Conclusions 

In  this  work,  we  have  extended  the  analysis  of  entanglement  complexity  and 
degree  of  folding  in  macromolecules  from  linear  to  branched  polymers.  Approximate 
relations  have  been  found  between  geometric  descriptors  and  shape  descriptors  [cf. 
Eqs.  (9)  and  (10)]  within  the  configurational  space  of  polymers  with  a  constant 
number  of  monomers.  Nevertheless,  the  results  show  that  the  two  families  of  prop- 
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erties  are  independent:  the  features  conveyed  by  shape  descriptors  TV  and  A*  cannot 
be  fully  retrieved  from  the  geometric  ones. 

The  results  contribute  to  a  knowledge  of  the  general  properties  of  regular  stars 
as  a  subclass  of  branched  polymers.  Regarding  the  geometric  characterization  of 
size  and  compactness,  the  scaling  law  (1 1)  is  well  established  for  stars  [18].  We  have 
shown  here  how  the  analysis  of  these  polymers  can  be  enriched  by  studying  features 
other  than  size.  In  our  case,  we  characterize  the  self-entanglements  in  the  chain  by 
means  of  shape  descriptors  which  also  follow  simple  scaling  laws  [14,16].  In  a 
similar  fashion  to  what  is  known  for  geometric  descriptors,  it  is  found  that  the 
descriptors  of  entanglement  complexity  scale  similarly  whether  the  polymer  is  a 
linear  chain  or  a  star. 

The  architectures  of  regular  3-stars  and  linear  polymers  are  sufficiently  different 
so  as  to  make  not  self-evident  a  coincidence  in  the  behavior  of  their  shape  descriptors. 
This  finding  will  probably  not  be  true  for  all  other  topologies.  For  instance,  polymers 
with  large  branching  and  complicated  connectivities  (e.g.,  where  branches  form 
complex  loops  with  the  main  chain)  may  show  an  averaged  mean  number  of  ov¬ 
ercrossings  growing  with  n  even  faster  than  a  power  law.  It  remains  to  be  seen  for 
which  families  of  simpler  branched  polymers,  such  as  combs  and  brushes,  the  be¬ 
havior  of  the  shape  descriptors  is  similar  to  the  one  found  here  for  regular  stars. 

The  present  methodology  does  not  describe  completely  all  the  essential  features 
of  macromolecular  shape,  but  it  complements  the  analyses  based  on  purely  geo¬ 
metric  properties  of  polymers.  All  distinct  aspects  of  polymer  shape  (e.g.,  size, 
compactness,  degree  of  folding,  self-entanglements,  writhing,  etc.)  must  be  taken 
into  account  for  a  full  understanding  of  the  relation  between  3D  molecular  structure 
and  various  physico-chemical  properties.  We  believe  that  the  use  of  overcrossing 
numbers  (and  their  related  shape  descriptors)  can  be  a  valuable  tool  in  the  difficult 
task  of  characterizing  the  folding  state  of  a  macromolecule  with  a  given  architecture. 
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Abstract 

In  this  paper  we  investigate  the  electronic  structure  and  the  static  longitudinal  polarizability  of  regular 
finite  and  infinite  chains  of  polythiophene  at  the  ab  initio  Hartree-Fock  level.  The  effects  of  the  inclusion 
of  polarization  functions  in  the  basis  set  and  the  influence  of  introducing  a  torsional  twist  between 
adjacent  rings  are  considered.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Conjugated  polymers  have  attracted  a  great  deal  of  theoretical  and  experimental 
interest  in  recent  years.  These  systems  possess  potential  with  respect  to  their  inter¬ 
esting  conductivity  properties  and  their  large  nonlinear  optical  responses  [1-3].  In 
this  area,  much  attention  is  directed  towards  polythiophene  and  its  derivatives 
(for  example  [4-7]).  This  is  an  attractive  polymer  due  to  its  stability  in  air  and  the 
possibilities  of  introducing  side-chain  substituents  to  enhance  its  solubility  in  com¬ 
mon  organic  solvents  [8,9].  Materials  based  upon  polythiophene  have  been 
synthesized  that  exhibit  high  conductivities  and  large  linear  and  nonlinear  re¬ 
sponses  [10,1 1]. 

Large  response  properties  are  critically  dependent  upon  the  degree  of  electron 
delocalization  along  the  polymer  backbone  [12].  X-ray  crystal  structures  indicate 
that  2,2-bithiophene  and  terthiophene  adopt  a  planar  conformation,  with  the  sulfurs 
in  adjacent  rings  anti  to  each  other  [13].  In  part  of  this  paper  we  study  the  effect 
of  introducing  a  twist  between  successive  rings  in  finite  and  infinite  polythiophene 
chains  in  order  to  observe  the  effect  this  disruption  of  conjugation  has  upon  the 
computed  properties.  Such  distortion  from  the  planar  conformation  is  observed  on 
the  introduction  of  side-chain  substituents  [14].  Previous  studies  have  studied  the 
variation  in  the  ionization  energy,  valence  band  width,  and  band  gap  with  such 
twists  using  parameterized  ab  initio  (valence  effective  Hamiltonian)  [15]  and  mndo 
calculations  [16]. 
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In  this  study,  in  addition  to  the  energies,  band  energies  and  band  gap,  we  present 
ab  initio  uncoupled  and  coupled  Hartree-Fock  predictions  of  the  static  longi¬ 
tudinal  polarizabilities.  Both  the  uncoupled  and  coupled  schemes  were  applied  to 
the  finite  chains,  while  the  treatment  of  the  infinite  chains  was  restricted  to  the 
uncoupled  Hartree-Fock  level.  Recently,  we  have  published  [17]  the  results  of  a 
comprehensive  study  of  the  dipole  polarizabilities  of  oligothiophenes  using  several 
basis  sets  (3-2 1G,  6-3 1G**,  6-3 1 1G**,  and  the  Sadlej  medium-sized  polarized  basis 
sets  [18])  and  the  extrapolated  values  of  the  longitudinal  polarizability  per  unit  cell 
of  polythiophene.  This  paper  is  a  companion  to  this  in  studying  the  effects  of  changes 
in  the  conformation  on  the  electronic  properties  of  polythiophene. 


Methodology 

The  calculations  presented  in  this  paper  fall  into  two  categories.  Initially,  geometry 
optimizations  were  performed  on  ter-thiophene  at  various  conformations,  using 
the  Gaussian-92  package  [19].  The  geometries  of  the  central  rings  were  then  used 
as  the  repeating  unit  in  calculations  of  the  polarizabilities  of  extended  thiophene 
oligomers,  again  using  Gaussian-92,  and  of  infinite  stereoregular  polythiophene, 
using  a  program  developed  in  our  laboratory,  PLH-93  [20,21]. 

The  polarizabilities  of  the  oligomers  were  calculated  at  the  uncoupled  and  coupled 
Hartree-Fock  levels  (from  hereon  referred  to  as  UCHF  and  CHF,  respectively).  The 
uchf  method,  sometimes  refered  to  as  the  sum  over  states  or  SOS  method  [22],  is 
based  upon  a  perturbation  treatment  of  the  Hartree-Fock  wavefunction,  or  the 
corresponding  scf-mos.  The  components  of  the  polarizability  can  be  expressed  as 

_  a  '\r\  ($i  I  I  I  Mi?  I  <i>i)  , .  >. 

OLUv  ~~  4  2j  (1) 

i  a  ei 

where  </>,  and  represent  doubly  occupied  and  virtual  molecular  orbitals,  in  the 
absence  of  an  electric  field,  respectively,  and  ea  and  et  their  corresponding  orbital 
energies.  The  scheme  neglects  field-induced  reorganizational  effects,  and  is  thus 
described  as  an  uncoupled  scheme.  In  the  CHF  method  [23],  the  field-induced  elec¬ 
tron  reorganization  is  included  in  a  way  which  is  fully  consistent  in  terms  of  ad¬ 
justments  in  the  average  two-electron  interactions  via  analytical  differentiation  of 
the  field  dependent  Hartree-Fock  equation. 

The  standard  quantum  chemical  treatment  of  polymers  is  based  on  the  periodic 
model  of  the  polymer  chain.  In  restricted  Hartree-Fock  theory,  the  many-electron 
wave  functions  of  the  closed  shell  periodic  systems  are  approximated  by  Slater 
determinants  constructed  from  doubly  occupied  crystalline  orbitals.  The  lcao  form 
of  such  a  crystalline  orbital  is  given  by 

Mk,  r)  =  A'cl/2  2  exp [ikja]  2  cp„(k)Xp(x  -  P  -  jae-),  (2) 

j  P 

where  n  is  the  band  index,  k  the  reciprocal  lattice  vector  in  the  first  Brillouin  zone, 
j  represents  the  cell  index,  Ncl/2  the  normalization  factor,  cpn(k)  the  /c-dependent 
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LCAO  coefficients,  and  Xp(r  -  P  —  jaez ),  the  pth  atomic  orbital  centered  in  the  yth 
cell.  The  standard  theory  of  band  structure  is  covered  in  detail  in  several  reviews 
(for  example,  see  [24,25]). 

In  methods  for  the  calculation  of  the  polarizability  of  polymers  serious  problems 
have  to  be  overcome.  The  most  serious  of  these  is  that  the  electric  field  operator 
acting  on  a  polymer  is  unbounded  and  thus  results  in  destruction  of  the  translational 
symmetry  of  the  system.  Means  of  overcoming  this  problem  have  been  proposed 
and  discussed  in  detail  in  the  literature  [26,27]. 

In  this  paper,  we  apply  a  scheme  for  the  computation  of  the  uncoupled  Hartree- 
Fock  asymptotic  longitudinal  static  polarizability  per  unit  cell  of  infinite  systems 
developed  recently  [28,29]  based  upon  the  perturbational  approach  of  Genkin  and 
Mednis  [30].  The  longitudinal  polarizability  can  be  expressed  as 


«zz  =  4  2  2  2 

i  a  k 


|fl/a(fc)|2 
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where  the  sums  over  i  and  a  represent  sums  over  occupied  and  unoccupied  bands 
respectively,  Uia(k)  are  the  dipole  transition  strengths  [28,29,31,32].  In  order  to 
calculate  the  polarizability  per  unit  cell,  the  sum  over  A:  is  replaced  by  an  integration 
over  the  first  Brillouin  zone,  to  give 


aZZ  _  y  y  f  1  QjgiK)  l  77 
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=  2  J  <*r z(i,  k)  dk 
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(4) 


This  final  result  illustrates  how  the  polarizability  can  be  decomposed  into  the  con¬ 
tributions  made  from  each  of  the  occupied  bands.  This  can  be  used  to  establish 
relationships  between  the  topology  of  the  bands  and  the  polarizability  to  provide 
insight  into  the  nature  of  bonding  and  conjugation  in  the  polymer  [17,29,33,34]. 
Coupled  schemes  to  calculate  the  polarizability  have  been  developed  recently  [35— 
37],  but  their  application  has  been  restricted  to  model  systems  due  to  the  substantial 
computational  cost  involved. 

All  calculations  were  performed  on  IBM  RS6000  Model  560  machines  of  the 
Namur  Scientific  Computing  Facility. 


Results 


Geometry  Optimizations 

All- Anti,  Planar  Poly  thiophene.  The  geometry  of  the  trimer  in  the  all  anti  con¬ 
formation  was  optimized  at  the  Hartree-Fock  level  with  the  3-2 1G  and  3-2 1G* 
basis  sets.  In  these  geometry  optimizations  all  atoms  in  the  three  rings  were  con¬ 
strained  to  be  coplanar.  The  molecule  is  represented  in  Figure  1  and  the  results 
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yC6  C7, 

h/  h5 


Figure  I .  Ter-thiophene  with  atom  labels  as  referred  to  in  Tables  I  and  II. 


listed  in  Table  I.  Parameters  obtained  from  an  mndo  geometry  optimization  [4] 
are  also  included  for  comparison.  The  values  of  5  given  in  the  final  two  rows  of 
Table  I  represent  the  bond-length  alternation,  which  is  the  difference  between  the 
two  adjacent  single  and  double  carbon-carbon  bond  lengths  indicated.  This  alter¬ 
nation  gives  an  indication  of  the  amount  electron  delocalization  along  the  carbon 
backbone.  The  geometrical  parameters  obtained  for  the  central  ring  were  used  in 
the  subsequent  polymer  calculations.  Two  such  rings  constitute  the  translational 
unit  cell  used  for  the  calculations  on  the  infinite  chain.  It  has  been  noted  [7]  that 
using  the  trimer  is  sufficient  to  obtain  reliable  geometrical  parameters  for  the  central 
ring,  with  little  further  change  being  observed  in  continuing  to  higher  oligomers. 

Twisted  Poly  thiophene.  In  the  idealized  conformations  considered  for  twisted 
polythiophene,  equivalence  of  alternate  thiophene  rings  was  imposed.  Partial  ge¬ 
ometry  optimizations  using  the  3-2 1G  basis  set  were  carried  out  on  the  three-ring 
chains  for  inter-ring  twists  between  0°  and  180°,  at  15°  intervals.  In  order  to  retain 
the  translational  periodicity  for  the  polymer  calculations,  the  chains  were  constrained 
to  be  linear  (or  rodlike)  [15],  that  is,  with  the  interring  carbon-carbon  bonds 
lying  along  the  axis  of  periodicity.  In  practice,  the  carbon-sulfur-carbon  angle 
(C5 — S2 — C8)  in  the  central  ring  and  the  sulfur-carbon-carbon  angle  (S2-C5-C4) 
along  the  backbone  were  fixed  to  be  89.77°  (the  value  obtained  in  the  optimization 
of  the  planar,  anti  conformation)  and  134.89°,  respectively.  Additionally,  the  central 
ring  carbon-sulfur  interatomic  distances  were  fixed  at  1.809  A.  All  atoms  in  indi¬ 
vidual  rings  were  constrained  to  be  coplanar,  but  all  other  bond  lengths  and  bond 
angles  were  free  to  vary.  The  results  of  the  optimizations  for  the  geometrical  pa¬ 
rameters  of  the  central  ring  are  presented  in  Table  II.  An  inter-ring  twist  angle  of 
1 80°  corresponds  to  the  all-anti  conformation.  The  carbon-hydrogen  bond  lengths 
remained  nearly  constant  for  all  twist  angles  (1.069  ±  0.001  A)  and,  therefore,  are 
not  included  in  the  table.  The  values  of  b  given  in  the  fifth  and  sixth  columns  of 
Table  II  give  the  degree  of  alternation  between  adjacent  carbon-carbon  single  and 
double  bonds. 

Energy  and  Polarizability  Calculations 

All-Anti,  Planar  Conformation.  Using  the  geometrical  parameters  given  in  Table 
I,  molecular  calculations  were  performed  for  chains  containing  from  one  to  six 
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Table  I.  Optimized  geometrical  parameters  of  all-anti,  planar  ter-thiophene.a 


Polythiophene 

3-2 1G 

3-2 1G* 

MNDO 

S,-C, 

1.795 

1.721 

S,— C4 

1.810 

1.738 

Cj — C2 

1.335 

1.347 

c2 — C3 

1.442 

1.433 

C3 — C4 

1.343 

1.355 

C,-H, 

1.065 

1.067 

C2-H2 

1.070 

1.069 

C3-H3 

1.069 

1.069 

<(Sj — C] — C2) 

111.16 

111.97 

<(S,-C4-C3) 

110.59 

110.72 

<(C,-C2~C3) 

113.91 

112.50 

<(C4 — C3 — C2) 

114.40 

113.16 

<(C,-S,-C4) 

89.45 

91.65 

<(Si — C4 — C5) 

120.54 

121.40 

<(S, — Cj — HO 

120.19 

120.76 

<(C,-C2-H2) 

123.47 

124.02 

<(C4 — C3 — H3) 

122.84 

123.59 

s2-c5 

1.809 

1.737 

1.704 

c5-c6 

1.342 

1.354 

1.380 

c6-c7 

1.435 

1.427 

1.447 

C4 — C5 

1.439 

1.454 

1.444 

c6— h4 

1.069 

1.069 

1.085 

<(Si _ Cc _ C  a) 

120.45 

121.29 

119.6 

^*1^2  ^5  ^4/ 

<<r\ _ Si _ 

89.77 

92.01 

94.6 

02  ^8/ 

<(S2 — C5 — Q) 

110.49 

110.66 

110.2 

<(C5 — C6 — C7) 

114.62 

113.34 

112.5 

<(C5 — Cg — H4) 

122.83 

123.63 

125.7 

^Cj— QA—C7 

0.093 

0.073 

0.067 

5c4-c5,c5-c6 

0.097 

0.100 

0.064 

a  Geometry  of  central  rings  used  in  calculations  on  the  extended  oligomers  and  infinite  chains.  The 
mndo  parameters  are  listed  for  comparison.  All  bond  lengths  are  given  in  angstroms.  The  atom  labels 
correspond  to  Figure  1 . 


thiophene  rings  with  the  3-2 1G  and  3-2 1G*  basis  sets.  The  larger  chains  were 
constructed  using  the  geometries  obtained  for  the  terminal  rings  in  the  trimer  op¬ 
timizations,  and  repeating  the  central  unit.  The  evolution  of  the  UCHF  and  CHF 
static  longitudinal  polarizability  per  thiophene  ring  [calculated  by  azz(N)  - 
azz(N  -  1)]  with  an  increasing  number  of  thiophene  rings  for  both  basis  sets  at  the 
two  optimized  geometries  is  represented  graphically  in  Figures  2  and  3. 

Calculations  on  infinite  chains  with  two  thiophene  rings  per  unit  cell  were  also 
carried  out  using  the  3-2 1G  and  3-2 1G*  basis  sets  using  the  optimized  geometries 
in  both  cases.  The  short  and  intermediate  ranges  of  interaction  were  set  to  ±3  and 
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Table  II.  3-2 1G  geometrical  parameters  of  the  central  ring  resulting  from  the  restricted  optimization 
of  ter-thiophene  at  various  twist  angles. 


G— G 

Q — Q 

G— C7 

5cS-6,c6_7 

<(S,  G  G1 

<(G  G  C7I 

<(G  G 

180.0 

1.454 

1.347 

1.401 

0.107 

0.054 

109.81 

115.31 

121.83 

165.0 

1.455 

1.347 

1.402 

0.108 

0.055 

109.82 

115.30 

121.78 

150.0 

1.458 

1.347 

1.404 

0.111 

0.057 

109.86 

115.26 

121.67 

135.0 

1.463 

1.345 

1.407 

0.118 

0.062 

mam 

115.22 

121.59 

120.0 

1.467 

1.343 

1.410 

0.124 

0.067 

. iif 

115.18 

121.56 

105.0 

1.471 

1.342 

■HU 

0.129 

0.071 

109.97 

115.14 

121.59 

90.0 

1.472 

1.341 

1.415 

0.131 

0.074 

115.12 

121.64 

75.0 

1.463 

1.344 

1.408 

0.119 

0.064 

109.91 

115.21 

121.74 

60.0 

1.464 

1.345 

1.408 

0.119 

0.063 

mam 

115.21 

121.77 

45.0 

1.467 

1.345 

1.406 

0.122 

0.061 

■EH 

115.25 

122.00 

30.0 

1.474 

1.343 

1.402 

0.131 

0.059 

109.75 

115.36 

122.64 

15.0 

1.482 

1.341 

0.141 

0.059 

115.51 

122.51 

0.0 

1.487 

1.340 

1.396 

0.147 

0.056 

109.54 

115.57 

123.94 

±6  unit  cells  about  the  reference  cell  respectively.  The  total  energies,  energy  of  the 
highest  occupied  crystalline  orbital  (HOCO),  valence  band-gap  (iigap),  and  uchf 
polarizabilities  per  unit  cell  and  length  are  presented  in  Table  III. 

Twisted  Conformations.  The  3-2 1G  basis  set  was  used  for  all  oligomeric  and 
polymeric  calculations  on  the  twisted  conformations  of  thiophene  chains.  This 
choice  is  justified  due  to  the  increased  computational  cost  and  relatively  small  effect 
of  the  inclusion  polarization  functions  on  the  sulfur  atom.  Using  the  optimized 
geometries  described  in  the  previous  subsection  and  listed  in  Table  II  for  each  twist 
angle,  the  uchf  and  chf  static  polarizabilities  were  calculated  for  the  three-, 
five-,  and  seven-ring  oligomer  chains.  The  evolution  of  the  CHF  polarizability  per 
pair  of  thiophene  rings  with  an  increasing  number  of  rings  for  each  twist  angle  is 
illustrated  in  Figure  4. 

Polymeric  calculations  for  the  infinite  chains  were  performed  for  each  twist  angle 
at  the  optimized  geometry  obtained  in  each  case.  The  short  and  intermediate  ranges 
of  interaction  were  again  set  to  ±3  and  ±6  translational  cells.  The  effect  of  the  twist 
angle  on  the  electronic  properties  is  detailed  in  Table  IV  and  the  behavior  of  the 
band  gap  and  uchf  static  polarizability  per  unit  cell  are  illustrated  in  Figures  5 
and  6,  respectively. 


Discussion 

Effect  of  Polarization  Functions  on  the  Sulfur  Atom 

The  inclusion  of  polarization  functions  on  the  sulfur  atom  in  the  3-2 1G*  basis 
set  generally  has  a  more  marked  effect  on  the  results  of  the  geometry  optimizations 
of  the  all-anti  trimer  than  on  the  subsequent  energy  and  polarizability  calculations. 
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Figure  2.  Evolution  of  the  CPHF  longitudinal  polarizability  per  thiophene  ring  with  an 
increasing  number  of  thiophene  rings  with  the  3-2 1G  and  3-2 1G*  basis  sets  at  the  geometries 

indicated. 


The  calculated  carbon-sulfur  bond  length  using  the  3-2 1G*  basis  set  is  substantially 
shorter  (0.072  A)  compared  to  that  obtained  using  the  basis  set  without  the  polar¬ 
ization  functions.  The  carbon-sulfur-carbon  angle  is  correspondingly  wider  (2.2°) 
in  the  former  case.  Further  investigation  [38]  indicates  that  any  role  for  the  d- 
orbitals  in  the  bonding  orbitals  is  small,  and  that  this  observation  can  be  attributed, 
at  least  in  part,  to  the  paucity  of  the  3-2 1G  basis  set  in  the  core  region. 

For  the  carbon-carbon  bonds,  the  polarized  basis  set  results  in  a  lengthening  of 
the  carbon-carbon  double  bond  (0.012  A)  and  a  lengthening  and  shortening  of  the 
interring  and  intraring  carbon  bonds  (+0.015  and  -0.008  A)  respectively,  compared 
to  the  3-2 1G  basis  set.  Due  to  these  contrary  effects,  it  is  difficult  to  make  any 
overall  remarks  regarding  the  bond  length  alternation  with  the  two  basis  sets  used. 
The  overestimation  of  bond  length  altemancy  using  ab  initio  Hartree-Fock  methods 
is  well  known  and  can  be  attributed  largely  to  the  absence  of  electron  correlation 
effects.  The  mndo  method  is  regarded  as  quite  capable  of  yielding  acceptable  geo¬ 
metrical  parameters  for  conjugated  systems  at  low  computational  cost. 

Comparison  of  the  electronic  properties  of  the  valence  region  of  infinite  planar 
polythiophene  obtained  using  the  two  basis  sets  listed  in  Table  III  indicate  a  minor 
role  for  the  polarization  functions  located  on  the  sulfur  atom.  At  the  two  fixed 
geometries,  the  difference  between  the  energy  gaps  obtained  using  the  3-2 1G  and 
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Figure  3.  Evolution  of  the  UCHF  longitudinal  polarizability  per  thiophene  ring  with  an 
increasing  number  of  thiophene  rings  with  the  3-2 1G  and  3-2 1G*  basis  sets  at  the  geometries 

indicated. 


3-2 1G*  basis  sets  is  small  (<0.02  eV).  The  difference  between  the  gaps  predicted 
at  the  two  optimized  geometries  is  more  marked,  with  the  predicted  gap  being  in 
the  region  of  0.1  eV  larger  with  the  3-2 1G*  basis  set. 

The  CHF  and  uchf  polarizabilities  per  thiophene  ring  in  the  extended  oligomeric 
calculations  illustrated  in  Figures  2  and  3  show  similar  behavior.  The  difference 


Table  III.  Electron  properties  of  infinite  polythiophene  using  3-2 1G  and  3-2 1G*  basis  sets  at  two 
different  geometries.  All  values  in  atomic  units  (except  where  marked). 


3-2 1G  optimized  geometry  3-2 1G*  optimized  geometry 

3-2 1G  3-2 1G*  3-2 1G  3-21G* 


Total  energy 

HOCO 

Egap 

azz/unit  cell 
azz/thiophene  ring 
azz/unit  length 


-1094.662095 
-0.254888 
0.2668 
(7.26  eV) 
169.58 
84.79 
11.369 


-1094.87989241 
-0.251336 
0.2660 
(7.24  eV) 
170.53 
85.26 
11.432 


-1094.65346482 
-0.250778 
0.2704 
(7.36  eV) 
163.77 
81.89 
11.095 


-1094.88779487 
-0.247438 
0.2702 
(7.35  eV) 
164.16 
82.08 
11.121 
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Figure  4.  Evolution  of  the  3-2 1G  Coupled  Hartree-Fock  longitudinal  polarizability  per 
pair  of  thiophene  rings  with  increasing  chain  length  at  different  torsion  angles  between 

adjacent  rings. 


between  the  polarizabilities  obtained  at  the  two  optimized  geometries  using  the  two 
basis  sets  is  small.  Taking  the  example  of  the  six-ring  chain,  at  the  3-2 1 G  optimized 
geometry,  the  coupled  value  per  thiophene  ring  using  the  3-2 1G  basis  set  (170.6 
a.u.)  slightly  exceeds  that  obtained  using  the  3-2 1G*  basis  (169.4  a.u.).  At  the  same 
geometry,  the  ordering  of  the  uncoupled  polarizabilities  is  reversed,  with  that  ob¬ 
tained  using  the  3-21G*  basis  (85.18  a.u.)  greater  than  that  found  with  the  unpo¬ 
larized  basis  (84.75  a.u.).  These  features  are  replicated  at  the  3-21G*  optimized 
geometry.  The  coupled  and  uncoupled  polarizability  per  ring  for  the  hexamer  ob¬ 
tained,  with  the  3-2 1G  and  3-2 1G*  basis  sets,  respectively,  are  162.5  and  160.6  a.u. 
and  82.05  and  82. 1 3  a.u.  For  chains  containing  four  and  more  thiophene  rings, 
the  uncoupled  polarizability  per  ring  is  very  close  to  saturation.  However,  the  coupled 
value  has  still  not  reached  saturation  for  the  six-ring  chain,  the  longest  chain  con- 
sidered  in  this  part  of  the  work. 

The  uncoupled  polarizabilities  per  thiophene  ring  obtained  on  the  infinite 
polythiophene  chains  are  detailed  in  Table  III.  At  the  3-2 1G  optimized  geometry, 
the  computed  polarizabilities  per  thiophene  ring  are  84.79  and  85.26  a.u.  with  the 
3-2 1G  and  3-2 1G*  basis  sets,  respectively,  while,  at  the  3-2 1G*  optimized  geometry, 
the  values  are  81.89  and  82.08  a.u.  These  are  all  in  very  close  concordance  with 
the  values  obtained  from  the  extended  oligomer  calculations.  This  confirms  that 
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Table  IV.  3-2 1 G  electronic  properties  of  infinite  polythiophene  for  various  twist  angles  between 

successive  rings.3 


Total  energy  HOCO  Em p  aJN  aJL 


180° 

-1094.61007260 

-0.2384 

165° 

-1094.61001614 

-0.2416 

O 

O 

-1094.60995122 

-0.2514 

135° 

-1094.60999025 

-0.2682 

120° 

-1094.61039159 

-0.2894 

0 

to 

0 

-1094.61170400 

-0.3137 

O 

0 

-1094.61397675 

-0.3398 

75° 

-1094.61647400 

-0.3148 

60° 

-1094.61803623 

-0.2923 

45° 

-1094.61559726 

-0.2724 

OJ 

0 

0 

-1094.60669924 

-0.2570 

15° 

-1094.59456895 

-0.2469 

0° 

-1094.58876100 

-0.2435 

3  All  values  in  atomic  units  (except  where  marked). 


0.2508  (6.82  eV) 

190.47 

12.58 

0.2578  (7.01  eV) 

184.01 

12.15 

0.2780  (7.57  eV) 

168.39 

11.11 

0.3109  (8.46  eV) 

150.37 

9.91 

0.3504  (9.54  eV) 

137.16 

9.03 

0.3933  (10.70  eV) 

129.83 

8.54 

0.4171  (11.35  eV) 

127.85 

8.40 

0.3764  (10.24  eV) 

132.56 

8.73 

0.3393  (9.23  eV) 

141.06 

9.29 

0.3061  (8.33  eV) 

153.11 

10.08 

0.2803  (7.63  eV) 

165.61 

10.88 

0.2642  (7.19  eV) 

175.34 

11.50 

0.2591  (7.05  eV) 

178.68 

11.71 

the  uncoupled  static  polarizability  per  ring  in  the  larger  oligomers  is  very  close  to 
its  saturation  value,  as  discussed  above.  The  smaller  polarizability  at  the  3-2 1G* 
optimized  geometry  indicates  less  delocalization  along  the  polymer  backbone  at 
this  geometry.  This  is  supported  by  the  slight  increase  in  interring  bond  length 
alternation  compared  to  the  3-2 1G  optimized  geometry  and  the  increase  in  band 
gap.  The  HOCO,  which  makes  the  largest  contribution  to  the  polarizability,  is  of  7 r- 
symmetry  [17],  composed  of  contributions  from  the  carbon  atoms  along  the  back¬ 
bone,  thus  resembling  segments  of  all -trans  and  trans -cisoid  polyacetylene.  For  this 
reason,  it  is  not  surprising  that  the  addition  of  d-orbitals  on  the  sulfur  atom  in  the 
3-2 1G*  basis  set  has  only  a  small  elfect  on  the  computed  polarizability.  The  influence 
of  the  bond  length  alternation  on  the  computed  polarizability  is  critical.  Using  the 
less  alternating  mndo  optimized  geometry,  the  longitudinal  polarizability  per  unit 
cell  of  infinite  polythiophene  has  been  calculated  to  be  86.02  a.u.  using  the  3-2 1G 
basis  set  [17],  slightly  larger  than  the  values  obtained  here. 

Influence  of  Torsional  Twist  between  Adjacent  Rings 

Generally  speaking,  the  effect  of  introducing  a  torsional  twist  between  adjacent 
thiophene  rings,  thus  disrupting  the  conjugation,  is  seen  to  be  large.  From  the 
values  listed  in  Table  II,  it  can  be  seen  that  the  intraring  bond  length  alternation 
is  at  a  minimum  at  the  all-anti  conformation,  indicating  a  larger  degree  of  electron 
delocalization.  The  alternation  then  increases  to  a  maximum  for  the  case  in  which 
the  planes  containing  adjacent  rings  are  perpendicular  to  each  other,  pointing  to  a 
reduction  in  conjugation,  before  decreasing  for  twist  angles  between  90°  and  0°. 
The  interring  carbon-carbon  bond  length  alternation  shows  similar  behavior  for 
twist  angles  between  180°  (all  anti-)  and  60°.  For  smaller  angles,  the  length  of  the 
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Figure  5.  Evolution  of  the  3-2 1G  uncoupled  longitudinal  polarizability  per  unit  cell  of 
infinite  polythiophene  with  the  torsion  angle  between  adjacent  rings. 


interring  carbon-carbon  increases,  increasing  the  bond  length  alternation.  This 
effect  is  most  likely  due  to  steric  effects  and  inherent  strain  as  a  result  of  the  restricted 
optimization. 

For  the  polymer  calculations,  the  variation  in  the  total  energies  calculated  (see 
Table  IV)  are  generally  small.  As  the  twist  angles  decrease  towards  0°,  the  increase 
is  notable,  but  this  is  primarily  a  result  of  the  imposed,  idealized  geometry  of 
the  chain. 

For  the  oligomer  calculations,  the  values  of  the  CHF  polarizability  per  pair  of 
thiophene  rings  for  the  heptamer  in  the  most  conjugated  case  (all-anti,  180°  degree 
twist  between  adjacent  rings)  and  the  case  in  which  adjacent  rings  are  perpendicular 
to  each  other  are  407.7  and  226.5  a.u.,  respectively,  representing  a  decrease  of  over 
40%  between  the  two  extremes.  The  difference  is  slightly  less  marked  for  the  UCHF 
polarizability  calculations,  with  the  values  at  the  all-anti  and  perpendicular  con¬ 
formations  190.3  and  127.9  a.u.,  respectively.  The  ratio  of  the  CHF  and  UCHF 
polarizabilities  for  the  most  conjugated  case  is  2.12,  while  for  the  least  conjugated 
case  it  is  somewhat  smaller  at  1.79. 

Two  important  features  of  Figure  4  showing  the  CHF  polarizability  per  unit  cell 
with  increasing  chain  length  for  twist  angles  between  180°  and  90°  should  be  em¬ 
phasized.  First,  it  is  evident  that,  in  the  less  conjugated  cases,  the  saturated  value 
of  the  polarizability  is  attained  more  rapidly  than  for  the  more  conjugated  chains. 
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Figure  6.  Evolution  of  the  3-2 1G  band  gap  of  infinite  polythiophene  with  the  torsion 
angle  between  adjacent  rings. 


This  demonstrates  the  appeal  of  polymer  calculations  able  to  provide  the  asymptotic 
value  directly.  Second,  the  evolution  of  the  polarizability  per  unit  cell  with  twist 
angle  does  not  evolve  linearly.  Small  deviations  in  the  twist  angles,  at  either  the 
more  or  less  conjugated  extremes,  provoke  only  small  changes  in  the  computed 
values.  More  substantial  twists  induce  more  substantial  changes. 

This  behavior  is  seen  again  in  the  smooth  cosinelike  curve  for  the  variation  of 
the  uncoupled  polarizability  per  unit  cell  with  twist  angle  for  infinite  polythiophene 
chains  in  Figure  5.  In  Figure  6,  in  which  the  variation  of  the  band  gap  with  twist 
angle  is  illustrated,  the  overall  form  of  the  curve  is  repeated  (although  inverted,  as 
would  be  expected,  with  the  band  gap  increasing  as  the  conjugation  is  reduced), 
but  with  a  less  smooth  variation  for  twist  angles  around  90°.  The  ionization  energies 
given  in  Table  IV  (-£hoco)  also  show  this  behavior,  with  the  ionization  energy 
being  highest  when  two  adjacent  rings  are  perpendicular  to  each  other.  The  over¬ 
estimation  of  the  band  gap  by  Hartree-Fock  calculations  due  to  the  neglect  of 
electron  correlation  is  well  known.  As  can  be  seen  from  Eq.  (3),  the  prediction  of 
the  uchf  polarizability  is  critically  dependent  on  the  size  of  the  band  gap.  The 
excessively  large  band  gap  will  result  in  an  underestimation  of  the  polarizability. 
UCHF  polarizabilities  can  only,  at  best,  give  a  qualitative  indication,  but  are  sufficient 
to  reproduce  trends.  Accurate  predictions  of  polarizabilities  require  coupled  schemes 
and  consideration  of  electron  correlation. 
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This  cosinelike  form  has  been  observed  previously  in  studies  of  the  electronic 
properties  of  polythiophene  using  the  valence  effective  Hamiltonian  method  [15]. 
However,  there  the  authors  suggest  torsions  of  up  to  40°  fail  to  induce  any  significant 
modifications  of  the  electronic  properties.  While  this  may  be  the  case  for  the  com¬ 
puted  ionization  energies  and  band  gaps,  such  a  twist  has  substantial  impact  on 
the  electron  delocalization  along  the  chain,  evidenced  by  the  substantial  reduction 
of  polarizability.  Our  observation  is  that  small  distortions  from  planarity  (0-15°) 
will  have  only  a  small  detrimental  effect  on  the  conjugation  and  computed  properties, 
while  larger  twists  result  in  considerable  changes. 

Figure  7  shows  the  valence  band  structures  at  two  adjacent-ring  twist  angles 
(180°  and  90°).  There  are  strong  similarities  between  the  band  structures  for  the 
anti  (180°)  and  syn  (0°)  conformations  (not  shown),  while  that  for  the  perpendicular 
case  shows  some  strikingly  different  features.  Perhaps  the  most  obvious  is  the  nar¬ 
rowness  of  the  upper-valence  bands.  This  supports  the  idea  that  the  7r-electronic 
distribution  is  significantly  more  localized  in  this  case. 

The  bands  making  significant  contributions  to  the  UCHF  longitudinal  polariza¬ 
bility  per  unit  cell  for  adjacent  ring  twist  angles  of  180°,  90°,  and  0°  are  given  in 
Table  V.  Distinction  between  bands  of  x-  and  ^-symmetry  enables  some  conclusions 
to  be  drawn  regarding  the  nature  of  the  conjugation  and  its  disruption  as  adjacent 
rings  are  twisted  with  respect  to  each  other.  For  the  anti  and  syn  conformers,  in 
which  all  atoms  are  coplanar,  the  7r-bands  are  composed  entirely  of  2py  (C,S)  and 
3p y  (S  only)  atomic  orbitals  perpendicular  to  the  plane  of  the  chain,  and  the  x- 
electrons  are  delocalized  over  the  whole  system.  For  the  conformer  with  a  90° 
torsion  angle  the  situation  is  different,  with  the  x-structure  being  defined  by  pA  and 
p y  orbitals  alternately.  This  leads  to  the  observed  localization  of  the  states  (the  top 
four  bands,  39-42).  However,  it  is  also  important  to  note  that  there  is  also  a  weak 
interaction  of  the  sulfur  3pz  orbital  with  the  ^-subsystem  on  the  adjacent  ring.  The 
overall  picture  is  more  complicated  and  assignment  to  x  and  a  contributions  more 
difficult. 

The  approximate  summed  x  and  a  contributions  to  the  polarizability  for  each 
conformer  are  given  at  the  foot  of  Table  V.  A  twist  of  90°  results  in  a  sharp  reduction 
of  the  t r  and  slight  reduction  of  the  <r  contributions.  Additionally,  the  different  x 
contributions  distinguish  between  the  syn  and  anti  forms.  The  bands  which  con¬ 
tribute  most  to  the  polarizability  of  the  anti  conformer  have  the  same  form  as  the 
HOCO  of  alternate  all  tram  and  trans-cisoid  segments  of  polyacetylene,  while  for 
the  syn  conformer,  they  correspond  to  the  HOCO  of  trans-cisoid  polyacetylene.  The 
sulfur  p y  orbitals  do  not  contribute  in  either  case.  The  larger  polarizability  of  the 
anti  conformer  can  be  explained  bearing  in  mind  that  trans-transoid  (all  trans ) 
polyacetylene  is  more  polarizable  than  the  trans-cisoid  form  [34]. 

Conclusions 

In  this  paper  we  have  shown  the  attractiveness  of  polymeric  methods  to  compute 
the  asymptotic  polarizability  per  unit  cell  directly  and  their  ability  to  yield  useful 
scientific  insight  into  an  interesting  problem.  Unfortunately,  the  polymeric  calcu- 
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Figure  7.  3-2 1 G  valence  band  structures  of  polythiophene  at  two  different  interring  twist 
angles:  A:  180°,  and  B:  90°. 
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Table  Y.  3-2 1 G  valence  band  contributions  to  the  longitudinal  polarizability  per  unit  cell  at  inter¬ 
ring  twist  angles  of  180°,  90°,  and  0°.a 


Band 

OO 

o 

o 

90° 

0° 

19 

0.07 

0.07 

0.06 

20 

0.08 

0.08 

0.08 

21 

0.18 

0.18 

0.16 

22 

0.12 

0.20 

0.14 

23 

0.22 

0.17 

0.19 

24 

0.31 

0.29 

0.31 

25 

0.57 

0.67 

0.60 

26 

0.54 

0.69 

0.46 

27 

0.73 

0.57 

0.70 

28 

2.64 

0.85 

0.54 

29 

1.12 

2.61 

2.33 

30 

5.09 

4.59 

3.30 

31 

0.03 

3.43 

5.38 

32 

10.29 

8.00 

0.02 

33 

4.42 

5.87 

5.54 

34 

11.08 

5.79 

16.13 

35 

8.83 

6.28 

12.86 

36 

9.21 

3.70 

0.08 

37 

0.09 

7.06 

8.03 

38 

14.05 

7.74 

11.58 

39 

1.59 

3.03 

1.68 

40 

1.82 

5.57 

3.64 

41 

5.86 

31.60 

4.59 

42 

111.47 

28.76 

100.24 

aJN 

190.47 

127.85 

178.68 

Approx,  total  ir 

120.8 

69.0 

110.1 

Approx,  total  a 

69.7 

58.8 

68.6 

a  The  final  four  bands  are  of  7r-character. 


lations  of  the  polarizability  are  limited  to  an  uncoupled  scheme.  A  coupled  scheme 
for  polymers  has  been  developed,  but  due  to  the  substantial  computational  cost 
involved  its  implementation  has  been  restricted  to  model  cases  [35,36].  For  accurate 
polarizabilities,  the  corrections  brought  about  by  including  electron  correlation, 
vibrational  contributions,  and  the  frequency  dependent  character  of  the  polariza¬ 
bility  should  also  be  considered  [37]. 

There  is  much  scope  for  further  work,  considering  the  effects  of  ring  substituents 
on  the  polarizability,  and  also  the  ab  initio  study  of  the  electronic  properties  and 
polarizabilities  of  polythiophene  chains  in  helical  coil  conformations.  The  energetics 
of  such  systems  have  already  been  studied  at  the  mndo  level  [16]. 

The  potential  of  polythiophene  and  its  derivatives  offers  an  incentive  to  under¬ 
stand  fully  the  effect  of  substituents  on  the  structure,  electronic  properties  and 
linear  and  nonlinear  response. 
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Abstract 

Ab  initio  RHF/STO-3G  band  structure  calculations  are  carried  out  on  extended  regular  polypropylene 
chains  (isotactic  helical,  syndiotactic  helical,  and  zigzag  planar)  to  assess  qualitatively  the  dependence  of 
the  energy  band  structure  and  density  of  states  on  the  molecular  geometry.  It  is  found  that  discrimination 
between  the  two  syndiotactic  forms  should  be  possible  from  spectral  features  in  the  valence  region  of 
corresponding  PS  spectra.  ©  1 994  John  Wiley  &  Sons,  Inc. 


Introduction 

In  spite  of  the  appealing  properties  of  advanced  thermoplastics  which  perform 
in  175°C  plus  environments  under  harsh  and  corrosive  conditions  [1],  polyolefins 
like  polypropylene  still  have  a  bright  future.  The  reasons  are  not  only  the  low 
production  costs  of  polyolefins  relative  to  advanced  thermoplastics,  but  also  constant 
progress  in  olefin  polymerisation  leading  to  more  versatile  polymers  of  better  con¬ 
trolled  properties,  e.g.,  the  primary  structure.  Polyolefins  are  mostly  used  as  bulk 
materials  and  free-standing  films  [2],  yet  they  occur  in  an  increasing  number  of 
applications  as  films  deposited  or  grafted  on  substrates.  In  the  most  advanced  ap¬ 
plications,  information  on  the  chemical  composition  and  molecular  structure  of 
the  top  layers  of  such  films  is  needed  because  they  determine  the  nature  and  strength 
of  the  interactions  taking  place  between  the  polymer  surface  and  the  medium 
brought  in  contact  with  it  (ambient  atmosphere,  liquids,  deposited  metals,  other 
polymer  films,  cell  cultures,  etc.).  A  major  problem  with  probing  the  surface  mo¬ 
lecular  structure  is  that  it  has  to  be  carried  out  on  the  films  as  received,  a  condition 
that  disqualifies  many  of  existing  spectroscopies. 

Theoretical  predictions  [3]  first,  then  experimental  studies  have  pointed  out  that 
changes  in  the  secondary  structure  of  polyethylene  chains  [4]  at  the  sample  surface 
can  be  followed  in  the  valence  region  of  X-ray  photoelectron  spectra.  Similarly 
inspired  by  theoretical  predictions  [5],  very  recent  experiments  [6]  carried  out  in 
gas  phase  on  normal  and  cyclic  pentane  and  hexane  have  further  proved  that  the 
XPS  valence  region  is  very  sensitive  to  the  primary  and  secondary  structure  and 
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thus  should  be  another  useful  source  of  information  of  the  molecular  structure. 
With  these  premises  and  because  of  the  favorable  escape  depth  of  the  photoelectrons 
(1—10  nm).  X-ray  photoelectron  spectroscopy  (XPS)  could  then  be  particularly  well 
suited  to  obtain  information  of  the  molecular  structure  of  polymer  chains  at  the 
sample  surface. 

To  our  knowledge,  only  three  xps  measurements  have  been  reported  on  the 
valence  region  of  polypropylene  (PP)  [7-9],  but  with  very  little  attempt  if  any  to 
relate  the  data  to  the  chain  molecular  structure  of  PP  chains  at  the  surface.  The 
theoretical  situation  on  polypropylene  is  not  better  since  reported  calculations  have 
only  been  carried  out  at  the  extended  Huckel  level  [7,10,1 1],  which  is  not  really 
suitable  for  describing  the  interactions  responsible  for  the  changes  in  the  molecular 
structure  (primary  and  secondary).  The  purpose  of  this  work  is  to  carry  out  ex¬ 
ploratory  ab  initio  rhf/sto-3G  calculations  on  extended  regular  polypropylene 
chains  in  order  to  assess  qualitatively  the  dependence  of  the  energy  band  structure 
on  the  primary  and  secondary  structures  and  its  possible  future  exploitation  in 
more  applied  situations. 


Methodology 

In  this  section  we  introduce  the  model  structures  and  provide  justification  for 
the  level  of  the  theory  used  in  this  preliminary  contribution. 

Work  Motivation  and  Model  Structures 

Polypropylene,  [ — CH2 — CH(CH3) — ]x,  is  polymerized  stereoselectively  using 
Ziegler-Natta  catalysts.  Isotactic  PP  is  obtained  by  heterogeneous  catalysis  with 
TiCl3/Al(C2H5)3  and  its  chain  structure  is  found  to  be  a  (3/1)  helix  [12],  hereafter 
denoted  /7-(TG)3.  Syndiotactic  PP  can  be  prepared  by  homogeneous  catalysis  with 
VC14/A1(C2H5)2C1.  In  the  syndiotactic  configuration  two  conformations  have  been 
reported:  a  (4/1)  helix  [13]  and  a  planar  zigzag  form  [14],  respectively  denoted  st- 
(T2G2)2  and  st-( PZZ).  The  st-{ pzz)  conformation  can  be  prepared  by  rapid  cooling 
in  ice  water  of  the  helical  form  sf-(T2G2)2  previously  melted  at  140°C,  and  subse¬ 
quently  stretching.  Infrared  and  X-ray  diffraction  (drx)  measurements  indicate 
that  under  such  conditions  the  st-{ T2G2)2  sample  is  completely  converted  to  the 
zigzag  planar  st-( pzz)  form.  The  original  helical  s/-(T2G2)2  structure  can  be  reversibly 
recovered  by  heating  the  st~( pzz)  form  at  100°C  for  a  few  hours. 

When  it  comes  to  molecular  structure  at  the  surface,  however,  the  situation  is 
not  as  clearcut  as  in  bulk  because  infrared,  which  has  a  sampling  depth  of  1  /im 
and  more,  and  drx  are  poor  detectors  of  the  surface  characteristics.  It  is  known, 
for  example,  that  in  bulk  the  high  molecular  weight  polyethylene  oxide)  chains, 

[ — CH2 — CH2 — O — ]x ,  adopt  a  zigzag  planar  conformation,  while  those  of  low 
molecular  weight  are  (7/2)  helices  [15].  Thanks  to  combined  theoretical  and  XPS 
studies  [16],  it  has  been  pointed  out  that  polyethylene  oxide)  chains  at  the  surface 
sample  are  dominantly  in  the  helical  form,  irrespective  of  their  molecular  weight. 
Accordingly,  it  cannot  be  concluded  without  a  further  check  that  at  the  surface 
sample  the  polypropylene  syndiotactic  chains  undergo  similar  changes  as  in  the 
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bulk  during  the  above-mentioned  heat  treatments.  The  purpose  of  the  present  work 
is  therefore  to  establish  a  preliminary  theoretical  basis  for  forthcoming  xps  mea¬ 
surements  on  PP  samples  treated  in  the  way  just  described. 

In  this  exploratory  study  we  confine  our  interest  to  a  comparison  of  the  electronic 
band  structure  of  the  regular  *7-(TG)3,  ,rt-(T2G2)2,  and  st-( pzz)  extended  PP  chains. 
In  constructing  the  model  structures  shown  in  Figure  1,  standard  bond  distances 
have  been  used  (0.154  and  0.108  nm  for  C — C  and  C — H,  respectively)  and 
tetrahedral  valence  angles  have  been  assumed  for  C — C — C,  C — C — H,  and 
H— C—H. 


Figure  1 .  Top  and  side  views  of  the  molecular  structure  of  polypropylene  chains  in  (a) 
isotactic  helical  z'/-(TG)3,  (b)  syndiotactic  helical  ^-(T2G2)2,  and  (c)  syndiotactic  planar 

zigzag  st-( pzz)  forms. 
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Calculations 

The  analysis  in  this  work  is  based  on  band  structures  [17,18]  computed  at  the 
rhf  level  with  the  PLH  program  [19],  using  the  STO-3G  basis  set  [20].  The  plh 
program  includes  long-range  coulomb  interactions  which  are  accounted  for  via  a 
multipole  expansion  technique  including  all  monopole-quadrupole  and  dipole- 
dipole  interactions.  For  the  three  forms,  the  number  of  unit  cells  in  the  short  and 
intermediate  region  are  7  (—3  to  +3)  and  13  (—6  to  +6),  respectively,  which  is 
found  sufficient  to  ensure  proper  convergence  of  the  results  with  respect  to  the 
lattice  summations.  The  threshold  for  the  two-electron  integrals  and  the  criterion 
for  convergence  on  the  density  matrix  elements  are  set  to  10-7  a.u.  and  10~5,  re¬ 
spectively. 

The  choice  of  this  theoretical  level  stems  from  previously  successful  comparisons 
[3-6]  of  similar  calculations  with  corresponding  experimental  data  on  the  closely 
related  linear  and  cyclic  alkanes  as  well  as  polyethylene.  First,  they  show  that  the 
electronic  structure  signature  of  polyethylene  chain  conformations  depends  on  the 
strength  of  hyperconjugative  interactions  between  the  — CH2 —  moieties  along  the 
polymer  spine.  Therefore,  it  is  important  to  rely  on  scf  approaches  to  allow  the 
electron  charge  distribution  to  relax  according  to  the  conformational  modifications 
imposed.  Second,  the  main  features  of  the  one-electron  picture  are  conserved  in 
the  case  of  saturated  hydrocarbons,  except  possibly  in  the  high  energy  region  of  the 
inner  valence  band  [3,5,6].  Since  most  of  the  conformational  changes  take  place 
in  the  outer  part  of  the  inner  valence  band  and  in  the  outer  valence  band,  band 
structure  calculations  can  be  fruitfully  and  safely  used  for  qualitative  searches  of 
structural  information  in  the  xps  valence  band  of  saturated  hydrocarbons.  It  was 
stressed,  however,  that  a  similar  approach  would  be  totally  inappropriate  in  the 
case  of  conjugated  polymers  as  recently  shown  experimentally  [21]  and  theoretically 
[22].  Finally,  in  all  these  studies  the  STO-3G  minimal  basis  has  been  found  to 
perform  quite  satisfactorily  compared  to  the  other  bases,  an  observation  that  has 
motivated  our  choice  for  this  basis  set  in  the  context  of  this  exploratory  contribution. 

Results  and  Discussion 

In  this  section  we  briefly  compare  the  relative  stability  of  the  three  PP  chain 
structures  as  calculated  in  this  work  before  considering  salient  differences  in  their 
electronic  structure. 

Relative  Stability  of  the  Chains 

As  intuitively  expected,  calculations  predict  very  comparable  total  energies  ET 
for  the  three  structures.  It  must  be  kept  in  mind  that  these  calculations  have  been 
carried  out  on  model  structures  built  with  standard  bond  lengths  and  angles.  Nev¬ 
ertheless,  it  is  interesting  to  find  that  the  isotactic  and  syndiotactic  helical  structures, 
respectively  /7-(TG)3  and  tf-(T2G2)2,  are  very  comparable  in  energy,  differing  by 
less  than  1.6  kJ  mol-1.  This  difference  is  nearly  five  times  larger,  amounting  to  7.3 
kJ  mol-1  for  the  two  syndiotactic  structures,  i.e.,  ^-(T2G2)2  and  st-( pzz).  In  the 
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case  of  the  isotactic  structure,  it  is  easy  to  understand  that  the  formation  of  a  helix 
is  favored  because  it  reduces  the  steric  interactions  between  methyl  groups,  which 
would  be  much  too  close  in  a  straight  zigzag  planar  form.  In  the  syndiotactic  case 
the  situation  is  less  clearcut  because,  due  to  their  alternate  locations  with  respect 
to  the  carbon-carbon  backbone,  the  methyl  groups  are  much  further  apart  than 
they  would  be  in  a  hypothetical  zigzag  planar  isotactic  form.  Thus,  it  appears  that 
the  interactions  between  the  methyl  groups  and  the  neighboring  hydrogens  are  still 
too  large  and  impose  to  the  chain  its  helical  sr-(T2G2)2  structure.  This  point,  however, 
would  require  further  work  with  a  more  sophisticated  theoretical  description  to  be 
properly  analysed. 

Despite  the  present  level  of  the  theory  and  the  arbitrariness  of  geometrical  pa¬ 
rameters  used  in  this  exploratory  study  to  build  the  chain  structures,  if  7.3  kJ  mol  1 
is  a  realistic  energy  difference,  then  packing  interactions  should  be  the  factors  ul¬ 
timately  controlling  the  molecular  structure  adopted  by  the  syndiotactic  form  as  a 
function  of  temperature.  This  alone  is  important  for  the  main  purpose  of  this 
contribution  since  chain  interactions  at  the  surface,  and  therefore  molecular  struc¬ 
ture,  could  be  somewhat  different  than  in  the  bulk.  It  also  raises  an  interesting 
question  as  to  the  long-term  stability  of  the  planar  zigzag  form  at  low  temperatures 
and  ultimately  the  role  of  chain  interactions  in  this  situation. 

Electronic  Structure 

In  this  subsection  we  examine  the  difference  in  the  shape  of  band  structures  and 
their  corresponding  density  of  states  and  make  some  comments  on  the  possible 
observation  of  these  changes  by  actual  xps  experiments.  The  band  structures  of 
the  three  selected  structures,  //-(TG)3,  5/-(T2G2)2,  and  st-( pzz),  shown  in  Figures 
2a,  2b,  and  2c,  look  apparently  different,  but  this  is  only  due  to  the  difference  in 
the  number  of  — CH2— CH(CH3)—  moieties,  respectively  three,  four,  and  two, 
needed  to  constitute  the  translational  repeat. 

Considering  first  the  two  helical  structures,  z7-(TG)3  and  s/-(T2G2)2,  the  overall 
one-electron  energy  dispersion  curves  with  respect  to  k  are  rather  similar  and  lead 
to  closely  related  density  of  states  graphs,  D(E),  in  Figures  3a  and  3b.  This  obser¬ 
vation  fits  with  the  one-electron  energies  which  delineate  the  inner-  and  outer- 
valence  regions  (Table  1).  There  are  differences,  however,  in  the  D{E)  graphs  that 
are  worth  stressing  in  view  of  XPS  detection.  We  will  limit  our  attention  to  three 
regions,  respectively  denoted  I,  II,  and  III  in  Figure  3. 

Owing  to  the  relatively  high  photoionization  cross  sections  of  C2s  atomic  functions 
prevailing  in  the  description  of  the  one-electron  states  in  the  inner  valence  region, 
the  apparently  minor  inverted  relative  intensities  of  the  two  outermost  peaks,  I  in 
Figure  3,  should  be  detectable  even  if  it  is  less  marked  than  in  the  case  of  the  folds 
in  polyethylene  chains  [3],  In  region  II  (Fig.  3),  characteristic  of  the  methyl  groups, 
two  lines  are  predicted  in  the  case  of  the  syndiotactic  helix,  .s7-(T2G2)2,  while  only 
one  peak  is  obtained  in  the  case  of  the  isotactic  case,  it-( TG)3.  However,  it  will 
require  high  resolution  instruments  such  as  the  scienta  300  to  reveal  such  a  dif¬ 
ference  which  simply  could  amount  to  variable  peak  widths. 


Energy  (a.u.)  Energy  (a.u.) 
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k-point 

(C) 

Figure  2.  (c)  Syndiotactic  planar  zigzag  st-( PZZ)  form.  Energies  are  expressed  in  atomic 

units. 

The  outer  valence  region,  III  in  Figure  3,  which  appears  to  be  quite  different  for 
the  it-( TG)3  and  s?-(T2G2)2  forms,  could  in  principle  be  used  as  a  source  to  discrim¬ 
inate  the  primary  structure  of  polypropylene  chains.  Caution  should  be  exercised 
in  this  case,  however,  because  in  the  outer  valence  region  the  atomic  functions 
representing  the  one-electron  states  are  dominantly  C2p  and  their  relative  photoion¬ 
isation  cross  sections  are  at  least  10  times  smaller  than  the  C2s  ones.  In  addition  to 
a  resulting  lower  intensity,  the  individual  levels  will  also  be  more  differently  affected 
by  these  cross-section  effects  than  the  inner  valence  states  [3b-d,5,6].  Despite  the 
neglect  of  cross-section  and  many-body  effects  in  the  present  calculations,  we  can 
only  acknowledge  the  fact  that  this  region  is  potentially  rich  in  information  on  the 
molecular  structure;  but  more  advanced  treatments  using  a  Green’s  function  ap¬ 
proach  should  be  considered,  e.g.,  as  in  Refs.  [3b-d,5,6].  For  instance,  with  such  a 
treatment  it  was  theoretically  pointed  out  [5]  and  experimentally  verified  [6]  that 
this  region  contains  information  that  allows  the  differentiation  between  two  types 
of  folds  at  the  surface  of  polyethylene  lamellae.  Thus,  with  this  proviso  of  cross- 
section  and  electron  correlation  effects,  the  XPS  outer  valence  region  is  likely  to  be 
different  in  both  isotactic  and  syndiotactic  helices,  as  suggested  by  the  differences 
in  the  shape  of  region  III  in  Figures  3a  and  3b;  but,  as  in  the  case  of  the  levels 
signing  the  methyl  groups,  high  resolution  experiments  will  be  needed  to  assess 
these  predictions. 

In  our  problem,  detecting  conformations  by  xps  is  a  more  interesting  goal  than 
discriminating  the  primary  structure.  Indeed,  before  deposition  as  thin  films  on 
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Figure  3.  (c)  Syndiotactic  planar  zigzag  sf-(PZZ)  form.  Energies  are  expressed  in  atomic 

units. 


substrates,  the  primary  structure  of  the  polymer  chains  can  easily  be  assessed  by 
nmr,  but  it  is  less  obvious  to  probe  the  conformation  of  the  chains  at  the  surface 
of  the  polymer  film  once  deposited.  It  is  precisely  this  kind  of  difference  that  exist 
between  the  syndiotactic  s/-(T2G2)2  and  st-( pzz)  forms  that  we  are  addressing  now. 

In  the  case  of  the  rt-(T2G2)2  and  st-( pzz)  forms,  one  notices  that  the  relative 
intensity  ratio  of  the  two  peaks  in  the  region  I  of  the  density  of  states  is  significantly 
more  different  than  found  previously  when  comparing  the  two  helical  structures 


Table  I.  Total  energy  ET  per  —  CH2— CH(CH3)—  fragment  of  the  polymer  chain  and  selected 
one-electron  energies  cXtV  of  the  band  structure  (i  =  o  or  i,  respectively  for  outer  and  inner  valence, 
and  y  =  h  or  /,  respectively  for  highest  or  lowest  energy). 


Et 

it< TG)3 
-115.732323 

^-(T2G2)2 

-115.732916 

tf-(PZZ) 

-115.730098 

Inner  valence 

Hi 

C/  .h 

-1.084 

-1.088 

-1.086 

-0.689 

-0.687 

-0.705 

Outer  valence 

C0j 

-0.623 

-0.632 

-0.627 

Coh 

-0.357 

-0.365 

-0.354 
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z7-(TG)3  and  ^-(T2G2)2.  More  important  from  an  experimental  viewpoint  is  that 
the  ratio  of  the  low  energy  peak  of  region  I  compared  to  peak  II  is  also  very  different 
in  structures  ^-(T2G2)2  and  st-(PZZ).  This  behavior  has  been  understood  in  recent 
work  on  polyethylene  folds  [3a-b].  The  intensity  rise  of  the  low  energy  peak  in 
region  I  is  related  to  the  weak  dispersion  of  the  topmost  band  of  the  inner  valence 
region  around  —0.70  a.u.  as  can  be  seen  in  Figure  2c  (see  also  Table  1).  With  respect 
to  the  corresponding  band  in  s/-(T2G2)2,  a  significant  energy  stabilisation  of  the 
order  of  0.5  eV  is  noted  for  st-( PZZ).  This  originates  in  a  sizeable  through-space 
interaction  of  hyperconjugative  nature  building  up  in  the  zigzag  planar  form  between 
the  — CH2 —  moieties  oriented  in  a  parallel  direction.  This  particular  orbital  mixing, 
also  referred  to  as  sigma  conjugation  along  the  polymer  backbone,  has  been  pre¬ 
viously  invoked  [23]  to  explain  the  extremely  weak  absorption  intensity  in  the 
infrared  spectra  of  the  — CH2 —  wagging  motions  of  the  all-staggered  conformation 
of  polyethylene.  The  net  result  of  this  stabilization  and  weak  energy  dispersion  with 
respect  to  k  is  a  higher  density  of  state  at  —0.70  a.u.  and  a  higher  intensity  peak  in 
the  region  I.  As  in  the  case  of  polyethylene  folds,  the  shift  and  intensity  variation 
should  be  detected  by  xps  provided,  of  course,  that  the  chains  are  dominantly  in 
a  zigzag  planar  conformation  at  the  surface  of  the  cooled  and  stretched  syndiotactic 
polypropylene.  These  features  should  be  useful  signatures  of  the  conformational 
changes  that  can  take  place  in  the  chains  at  the  surface  of  polymer  films. 

Regarding  the  outer  valence  region  III,  we  will  limit  our  comments  to  those 
already  made  in  the  case  of  the  two  helical  structures.  They  definitely  show  differ¬ 
ences  that  should  be  traced  in  actual  spectra  provided  the  intensity  is  not  too  low. 
In  any  event,  more  advanced  treatments  including  effects  of  the  photoionization 
cross  section  and  electron  correlation  will  be  needed  to  provide  a  better  basis  for 
interpretation. 


Concluding  Remarks 

The  goal  of  this  work  was  to  assess  the  dependence  of  the  electronic  structure  of 
polypropylene  chains  on  their  primary  and  secondary  structures  and  the  extent  to 
which  the  changes  can  constitute  fingerprints  for  xps  experiments. 

The  main  result  of  this  exploratory  and  rather  simple  study  is  that  the  two  reported 
conformations  of  syndiotactic  polypropylene,  s/-(T2G2)2  and  st-(pzz),  could  be  de¬ 
tected  from  features  of  their  XPS  valence  bands,  especially  the  relative  intensities 
of  two  main  peaks  in  the  low  energy  regions  I  and  II  of  the  inner  valence  band.  In 
that  respect,  it  worth  stressing  the  consistency  between  vibrational  and  electronic 
structure  information:  They  both  point  to  the  importance  in  extended  systems  of 
hyperconjugative  interaction  between  the  — CH2 —  moieties  of  the  polymer  and 
their  role  in  “transmitting”  the  conformational  message.  It  should  be  added  that 
the  majority  of  vinyl  polymers  are  based  on  repeat  units  of  the  general  type, 
— CH2 — CRjR2 — ,  and  thus  are  likely  to  have  such  sigma  conjugation  operating 
along  their  backbone  depending  on  the  chain  structure.  More  advanced  descriptions 
of  the  electronic  structure  of  polypropylene  are  now  in  progress  [24];  they  will  be 
paralleled  with  xps  measurements  on  carefully  designed  heat  treatments  of  poly¬ 
propylene  samples  [25]. 
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It  should  be  stressed  at  this  point  that  effects  of  primary  and  secondary  structures 
on  the  electronic  structures  cannot  really  be  taken  into  account  separately  in  model 
calculations.  For  example  and  along  the  same  line,  theoretical  calculations  [26] 
have  recently  pointed  out  the  danger  in  relying  on  straightforward  correlations 
between  NMR  and  infrared  results  to  afterwards  assess  the  isotacticity  percentage 
in  polyacrylonitrile  from  infrared  data  alone. 

Regarding  the  conformation  of  polyolefin  chains  in  the  bulk,  it  is  interesting  to 
note  that  the  polarizability  per  — C2H4—  unit  in  polyethylene  is  roughly  2%  higher 
in  zigzag  planar  form  than  in  other  conformations  [27].  Owing  to  the  role  played 
by  the  electric  polarizability  on  the  strength  of  intermolecular  interactions,  this 
observation  might  provide  another  clue  as  to  why  alkane  chains  favor  rather  sys¬ 
tematically  the  straight  zigzag  planar  form,  even  when  quickly  cooled  down  on 
cold  substrates  from  the  gas  to  solid  phase.  In  relation  with  syndiotactic  polypro¬ 
pylene,  it  is  conceivable  that,  when  thermal  agitation  is  reduced,  the  zigzag  planar 
form  is  actually  preferred,  most  likely  because  of  synergetic  interactions  between 
the  chains  in  which  polarizability  is  enhanced  by  hyperconjugation  mechanisms. 
Polarizability,  being  an  extensive  property,  has  ample  opportunity  to  become  de¬ 
cisive  with  respect  to  the  preferred  conformation  adopted  by  polyolefin  chains. 
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Abstract 

We  report  some  of  our  recent  results  from  theoretical  modeling  of  the  interaction  between  metals  and 
^-conjugated  molecules.  We  apply  the  semiempirical  Austin  Model  1  method  for  the  investigation  of 
two  fundamentally  different  systems:  sodium  interacting  with  diphenylpolyenes  and  aluminum  interacting 
with  poly(p-phenylenevinylene)  and  derivatives.  In  the  former  case,  electronic-structure  calculations 
are  also  performed  using  the  nonempirical  pseudopotential  Valence  Effective  Hamiltonian  (  veh)  tech¬ 
nique.  For  sodium  interacting  with  diphenylpolyenes,  we  investigate  the  geometric  and  electronic  structure 
modifications  that  are  induced  upon  charge  transfer  in  a  series  of  diphenylpolyenes  with  an  even  number 
of  carbons  (from  stilbene  to  a,w-diphenyltetradecaheptaene,  i.e.,  one  to  seven  double  bonds  in  the  polyene 
part  of  the  molecule).  Densities  of  valence  states  generated  from  the  veh  calculations  are  directly  compared 
to  experimental  ultraviolet  photoelectron  spectroscopy  valence  band  spectra;  these  are  recorded  during 
successive  sodium  exposure  of  the  molecular  solids.  The  charge-storage  states  in  the  series  are  discussed 
in  terms  of  soliton-antisoliton-pairs  and  polaron-like  states  induced  upon  doping  (reduction).  Introducing 
aluminum  atoms  onto  poly  (p -phenylene  vinylene )  systems  allows  us  to  study  the  initial  stages  of  interface 
formation.  We  find  that  aluminum  atoms  preferentially  react  with  the  vinylene  linkages  in  both  poly(p- 
phenylenevinylene)  and  poly(2,5-dimethoxy-p-phenylenevinylene).  When  carbonyl  groups  appear  on 
the  side  of  the  chains,  as  in  poly(2,5-dialdehyde-/?-phenylenevinylene),  new  reactive  sites  are  induced, 
leading  to  structures  with  stabilities  comparable  to  those  in  the  most  stable  configurations  involving  a 
single  vinylene  group.  In  all  three  systems  investigated,  the  interaction  with  aluminum  induces  major 
modifications  of  the  polymer  chains  with  interruptions  of  the  ir-system  caused  by  formation  of  spMike 
defects.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Conjugated  polymers  constitute  a  class  of  materials  that  has  attracted  much  at¬ 
tention  during  the  past  decade.  Initially,  studies  mainly  focused  on  the  ability  of 
doping  conjugated  polymers  to  high  electrical  conductivities;  considerable  work 
has  been  devoted  to  the  understanding  of  the  charge  storage  mechanisms  in  these 
materials  [1,2].  Nowadays,  it  is  well  established  that  excess  electronic  charges,  in¬ 
troduced  via  redox  chemistry  either  by  electron  acceptors  or  donors,  are  accom¬ 
modated  as  charged  solitons  in  trans-polyacetylene,  due  to  the  degeneracy  of  the 
ground  state  [  3-5  ] .  The  extension  of  these  soliton  wave  functions  is  about  1 5-20 
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carbons  in  long  polyene  chains  [  5  ] .  However,  theoretical  studies  have  suggested 
that  the  width  of  the  solitons  shrinks  as  the  length  of  the  polyene  chain  decreases, 
down  to  an  extension  of  about  3-5  carbon  sites  in  tetradecaheptaene  [6] .  A  recent 
ultraviolet  photoelectron  spectroscopy  study  of  the  sodium  doping  of  a,co-diphen- 
yltetradecaheptaene  supports  this  prediction;  the  charge-storage  states  appear  to  be 
in  the  form  of  soliton-antisoliton  pairs,  confined  within  the  14-carbon  long  polyene 
chain  [7]. 

These  quasi-one-dimensional  systems  also  possess  many  other  interesting  features 
such  as  nonlinear  excitations,  very  large  nonlinear  optical  responses,  as  well  as 
remarkable  semiconducting  properties.  The  latter  have  made  it  possible  to  use 
some  of  the  7r-conjugated  materials  as  active  components  in  electronic  or  optoelec¬ 
tronic  devices;  in  particular  devices  such  as  Schottky  diodes  [  8  ] ,  field-effect  tran¬ 
sistors  [8],  and  light-emitting  diodes  (LED’s)  [9-14]  have  been  demonstrated. 

A  number  of  LED’s  based  on  different  conjugated  polymers  have  been  reported. 
The  color  of  the  emitted  light  can  be  tuned,  from  blue  [13]  to  green  /yellow  [9]  or 
to  red  [10] ,  by  either  choosing  the  appropriate  polymer,  modifying  a  specific  polymer 
by  means  of  attaching  side  groups,  or  using  copolymers.  The  polymer  led’s  show 
promising  characteristics  and  unique  properties  such  as  flexibility  [12]  and  the 
possibility  of  emission  of  polarized  light  (a  first  step  towards  polarized  light-emitting 
diodes  has  been  the  observation  of  polarized  photoluminescence  from  highly  ori¬ 
ented  polymer  blends  [15]).  The  most  common  polymers  in  led’s  nowadays  are 
poly(/?-phenylenevinylene),  ppv,  and  its  derivatives  [9-12].  In  polymer-based 
LED’s,  the  polymer  is  sandwiched  between  two  metal  electrodes.  Electrons  and 
holes  are  injected  from  the  metal  contacts  into  the  polymer  and  negatively  and 
positively  charged  polarons  are  formed.  These  charged  polarons  migrate  under  the 
influence  of  the  applied  field  and  when  oppositely  charged  species  find  each  other, 
they  form  polaron-excitons  which  can  then  undergo  radiative  recombination. 

The  interfaces  between  the  electroactive  polymer  and  the  metal  electrodes  thus 
represent  important  characteristics  of  the  devices.  Low  workfunction  metals  such 
as  aluminum  and  calcium  are  used  for  the  electron  injecting  contact,  while  the  hole 
injecting  contact  usually  consists  of  indium-tin-oxide  or  in  some  cases,  the  highly 
conducting  form  of  polyaniline  [12] .  X-ray  photoelectron  spectroscopy  (XPS)  studies 
of  A1  on  ppv  [16]  and  A1  on  poly(/?-phenylene)  [17]  have  indicated  the  presence 
of  significant  amounts  of  oxygen  at  the  metal  /polymer  interface.  The  exact  influence, 
on  the  device  function,  of  the  oxygen  present  at  the  interface  is  not  fully  understood 
at  the  moment;  interfacial  oxygen  can  build  up  during  the  metal  evaporation  [17] 
or  be  introduced  during  film  preparation.  Examples  of  the  latter  case  can  be  found 
in  the  literature;  results  from  XPS  studies  of  ppv  surfaces  show  that  oxygen  is  present 
in  significant  amounts  [18-20],  although  the  precise  form  of  the  oxygen  in  these 
samples  is  not  clear.  In  contaminated  polyphenylquinoxaline,  both  C  —  O  and 
C  =  0  species  were  identified  by  xps  measurements  [21]. 

The  use  of  conjugated  molecules  or  oligomers  of  conjugated  polymers  as  models 
for  the  conjugated  polymers  themselves  has  been  an  approach  extensively  used 
during  the  last  few  years  [  22-29  ] .  For  example,  many  studies  have  been  devoted 
to  the  investigation  of  the  interaction  between  metals  and  conjugated  molecules, 


METALS  AND  tt-CONJUGATED  POLYMERS 


483 


including  oligomers  of  conjugated  polymers  [25,30-34] .  These  studies  can  be  helpful 
for  the  understanding  of  the  interface  formation  between  metals  and  conjugated 
systems  as  well  as  the  possible  charge-storage  configurations  induced  by  charge 
transfer.  In  addition,  information  from  this  type  of  model  studies  can  contribute 
significantly  to  the  understanding  of  the  real  conjugated  polymer  systems;  it  might 
be  possible  to  predict,  on  the  basis  of  the  oligomers,  the  behavior  in  the  polymeric 
case.  The  study  of  oligomers  of  conjugated  polymers  and  of  conjugated  molecules 
is  attracting  increasing  interest  also  due  to  the  fact  that  these  molecular  materials 
can  themselves  be  exploited  in  device  applications,  as  for  example  in  the  all-organic 
transistor  developed  by  Gamier  et  al.  where  a-sexithienyl  has  been  used  as  the 
active  component  [  35  ] . 

In  this  contribution,  we  review  the  results  of  some  of  our  recent  theoretical  studies 
on  the  interaction  between  metals  and  conjugated  molecules.  We  have  chosen  to 
present  two  fundamentally  different  systems;  the  interaction  between  sodium  and 
diphenylpolyenes  and  that  between  aluminum  and  poly (^-phenylenevinylene )  and 
derivatives;  this  allows  us  to  study  the  initial  stages  of  metal /conjugated  polymer 
interface  formation. 

In  the  sodium/ polyene  study,  we  have  considered  a,co-diphenylpolyenes  with  1 
to  7  double  bonds  in  the  polyene  part  of  the  molecules.  The  molecules  are  hereafter 
denoted  DPx  (x  =  1-7),  where  DP  stands  for  diphenyl  and  x  is  the  number  of 
double  bonds  in  the  polyene  segment.  The  molecular  structures  are  shown  in  Figure 
1(a).  The  theoretical  results  are  compared  to  XPS  and  ultraviolet  photoelectron 


a) 


b) 


Figure  1.  Molecular  structures  of:  (a)  the  diphenylpolyene  series,  where  x  denotes  the 
number  of  double  bonds  in  the  polyene  part;  and  (b)  the  oligomers  of  poly(/?-phenyle- 
nevinylene)  where  the  substituents  refer  to:  Rl,  poly(/?-phenylenevinylene)  itself;  R2, 
poly(2,5-dimethoxy-/?-phenylenevinylene);  and  R3,  poly(2,5-dialdehyde-Jp-phenylene- 

vinylene). 
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spectroscopy  (ups)  data  recorded  during  successive  sodium  exposure  of  the  di- 
phenylpolyene  molecular  solids  [  36  ] . 

We  then  discuss  the  results  of  calculations  on  aluminum  interacting  with  ppv 
and  poly(2,5-dimethoxy-;?-phenylenevinylene),  DMeOPPV.  Due  to  the  lack  of 
current  information  about  the  exact  configuration  of  oxygen  in  contaminated  sur¬ 
faces  of  PPV,  we  have  also  chosen  to  investigate  some  possible  effects  of  oxygen  by 
incorporating  carbonyl  groups  on  the  side  of  the  polymer  chains.  In  this  case,  we 
actually  replaced  the  MeO  groups  along  the  DMeOPPV  chains  by  carbonyl  groups 
to  form  poly  (2, 5-dialdehyde-;? -phenylenevinylene),  dappv.  The  molecular  struc¬ 
tures  are  shown  in  Figure  1(b). 

Theoretical  Methodology 

Geometry  optimizations  have  been  performed  on  the  pristine  and  metal /molecule 
systems,  using  the  semiempirical  Austin  Model  1  (amI)  method  [37],  which  is 
known  to  yield  reliable  geometries  for  large  organic  molecules.  The  experimental 
XPS  data  on  the  sodium/ diphenylpolyene  interaction  indicate  the  appearance  of 
ionic  sodium  atoms,  i.e.,  there  occurs  complete  charge  transfer  between  the  sodiums 
and  the  DPx  molecules  [  36  ] .  The  interaction  can  thus  be  considered  as  corre¬ 
sponding  to  a  doping  (reduction)  of  the  DPx  molecules  by  sodium.  The  doping  is 
simply  simulated  in  the  calculations  by  adding  one  or  two  electrons  to  the  molecules 
in  the  case  of  singly  and  doubly  doped  systems  (i.e.,  effectively  taking  into  account 
one  or  two  sodiums  per  molecule,  respectively).  In  the  case  of  doubly  charged 
systems,  calculations  including  explicitly  the  counter-ions  have  also  been  car¬ 
ried  out. 

From  experimental  data,  it  is  deduced  that  the  diphenylpolyenes  are  almost  totally 
planar  in  the  molecular  solids.  The  tilt  angles  of  the  phenyl  groups  with  respect  to 
the  polyene  plane  have  been  measured  to  be  on  the  order  of  5.4°  in  the  case  of 
DP4  [38]  and  about  3-7°  for  stilbene  (DPI )  [39-42].  Such  small  torsion  angles 
have  negligible  effects  on  the  electronic  structure;  the  molecules  have  therefore  been 
taken  as  fully  planar  in  all  the  calculations.  The  am  1 -optimized  geometries  of  the 
neutral  and  doubly  charged  diphenylpolyenes  have  served  as  input  for  electronic- 
structure  calculations  performed  using  the  Valence  Effective  Hamiltonian  (veh) 
pseudopotential  method  [43,44];  the  applicability  of  the  veh  approach  in  inter¬ 
preting  photoelectron  valence  band  spectra  of  conjugated  systems  is  well  established 
[45-48].  In  addition,  VEH  provides  accurate  estimates  of  the  location  of  the  gap 
states  in  doped  conjugated  systems  [49]. 

In  order  to  make  a  direct  comparison  between  the  VEH-calculated  electronic 
densities  of  valence  states  (dovs)  and  the  experimental  solid-state  UPS  valence 
band  spectra,  a  broadening  by  convolution  with  a  gaussian,  whose  full  width  at 
half  maximum  is  taken  to  be  0.7  eV,  is  applied  to  the  calculated  electronic  levels. 
This  is  in  order  to  simulate  the  experimental  resolution  as  well  as  peak  broadening 
due  to  solid-state  effects,  e.g.,  intermolecular  interactions. 

The  appearance  of  charges  in  conjugated  molecules  usually  leads  to  strong  ge¬ 
ometry  relaxations  [5,50];  a  single  charge,  i.e.,  a  radical-ion,  is  then  referred  to  as 
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a  polaron  in  condensed  matter  physics  terminology;  two  charges  on  the  molecule, 
i.e.,  a  di-ion,  correspond  to  a  so-called  bipolaron  or  confined  soliton-antisoliton 
pair  [5,50].  We  will  be  using  this  terminology  throughout  this  article;  also,  the 
lowest  electronic  transition  will  be  referred  to  as  the  bandgap  since  we  are  dealing 
with  molecular  solids. 

In  order  to  model  the  reaction  between  aluminum  and  ppv  compounds,  we  have 
considered  oligomers  of  ppv,  DMeOPPV,  and  dappv  as  representatives  for  the  poly¬ 
mers.  The  results  of  a  preliminary  study  of  the  interaction  between  A1  and  ppv 
have  been  reported  recently  [51];  the  ppv  chain  was  in  that  case  simply  represented 
by  a  trans- stilbene  molecule;  here,  we  have  considered  oligomers  consisting  of  three 
monomer  units  capped  by  the  appropriate  phenyl  group  for  each  system  investigated. 
Recently,  a  systematic  study  of  the  interface  between  A1  and  trans- polyenes  indicated 
that  the  Modified  Neglect  of  Diatomic  Overlap  (mndo)  and  AMl  methods  are  able 
to  describe  adequately  the  metal /polymer  interactions,  in  comparison  to  ab  initio 
Hartree-Fock  methods  including  polarization  functions  [31].  Torsions  between 
different  units  along  the  chains  can  occur  in  the  case  of  ppv  and  its  derivatives; 
such  torsions  are  treated  more  accurately  by  the  AMl  method  than  the  mndo 
method,  which  explains  our  choice  of  the  former. 

Full  geometry  optimizations  for  pairs  of  A1  atoms  reacting  with  the  three  ppv  - 
based  polymers  have  been  carried  out  for  a  wide  number  of  possible  configurations, 
leading  to  different  local  energy  minima.  The  A1  atoms  have  been  introduced  on 
the  central  units  of  the  trimers,  i.e.,  within  the  region  indicated  by  the  dashed 
parentheses  in  Figure  1(b).  Estimates  of  the  charge  distributions  have  been  obtained 
following  a  simple  Mulliken  population  analysis;  since  this  approach  is  strongly 
method  dependent,  we  are  interested  only  in  the  qualitative  trends  in  the  charge- 
distribution  evolutions. 


Sodium  Interacting  with  Diphenylpolyenes 


Neutral  Systems 

The  AMl  optimized  geometries  of  the  neutral  DP7  to  DPI  molecules  are  shown 
in  Table  I.  The  optimizations  results  in  very  similar  bond-length  differences  between 
the  central  single  and  double  bonds  throughout  the  whole  series;  the  bond-length 
alternation  ranges  from  0.097  A  in  DP7  to  DP4,  up  to  0.109  A  in  DPI.  The  cor¬ 
responding  value  for  the  central  part  in  an  am  1 -optimized  30-carbon  long  trans - 
polyacetylene  segment  is  0.096  A  (1.347  and  1.443  A  for  the  double  and  single 
bonds,  respectively),  i.e.,  very  close  to  what  is  obtained  in  the  diphenylpolyenes. 
The  experimental  values  for  the  carbon-carbon  bonds  in  Jra^s-polyacetylene  are 
1.36  and  1.44  A,  i.e.,  a  bond-length  alternation  of  0.08  A  [52]. 

The  wave  functions  of  the  highest  occupied  and  lowest  unoccupied  molecular 
orbitals  (homo  and  LUMO,  respectively)  show  dominant  contributions  from  the 
polyene  part  of  the  molecules  in  the  case  of  neutral  DP3-DP7.  The  polyene-localized 
nature  of  these  orbitals  is  the  reason  why  diphenylpolyenes,  at  least  the  longer 
molecules,  are  suitable  as  models  for  /rarcs-polyenes  [7,53].  However,  in  DP2  and 
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Table  I.  amI  optimized  geometries  for  the  neutral  DP7-DP1  molecules;  the  bond  lengths  are  in  A 

(the  labels  are  those  from  Fig.  2). 


rCx — Cy 

DP7 

DP6 

DP5 

DP4 

DP3 

DP2 

DPI 

Cl—  C2 

1.396 

1.395 

1.396 

1.396 

1.396 

1.396 

1.396 

C2 — C3 

1.391 

1.392 

1.391 

1.391 

1.392 

1.392 

1.396 

C3 — C4 

1.406 

1.406 

1.406 

1.406 

1.406 

1.407 

1.407 

C4 — C5 

1.451 

1.451 

1.451 

1.451 

1.451 

1.451 

1.452 

C5 — C6 

1.346 

1.346 

1.346 

1.346 

1.346 

1.345 

1.343 

C6 — C7 

1.445 

1.445 

1.445 

1.445 

1.445 

1.447 

C7 — C8 

1.347 

1.347 

1.347 

1.347 

1.347 

C8 — C9 

1.444 

1.444 

1.444 

1.444 

C9— CIO 

1.347 

1.347 

1.347 

CIO— Cl  1 

1.444 

1.444 

ci  i —  or 

1.347 

DPI,  there  occur  significant  contributions  from  the  phenyl  groups  to  the  HOMO 
and  lumo  levels. 

Singly  Charged  Systems 

The  evolution  of  the  experimental  UPS  valence  band  spectra  upon  sodium  ex¬ 
posure  of  DP5-DP1  are  such  that  at  the  lowest  sodium  exposure  levels,  a  single 
broad  feature  grows  up  in  the  originally  forbidden  energy  gap;  upon  further  doping, 
i.e.,  beyond  about  one  sodium  atom  per  molecule,  a  second  feature  appears  [36,54] . 
Such  an  evolution  is  not  observed  in  DP7  and  DP6,  where  two  in-gap  features  are 
observed  even  for  the  lowest  doping  levels  [  7  ] .  These  data  are  interpreted  to  indicate: 
(i)  the  formation  of  singly  doped  species  at  low  doping  levels  in  the  cases  of  DP5- 
DP1,  the  low  doping  regime  being  therefore  simulated  in  the  calculations  by  adding 
a  single  electron  to  the  DP5-DP1  molecules;  (ii)  in  DP6  and  DP7,  even  low  sodium 
exposure  leads  to  two  alkali  atoms  per  molecule. 

In  Table  II,  we  display  the  AMI -optimized  structures  for  the  singly  charged 
DP5-DP1  molecules.  The  polyene  part  of  the  molecules  evolves  towards  an  almost 
undimerized  situation;  the  bond-length  difference  between  the  central  single-  and 
double-bonds  is  0.006,  0.008,  0.010,  0.017,  0.029  A  in  going  from  DP5  to  DPI. 
This  kind  of  geometry  evolution  in  the  singly  charged  DPx  molecules  can  be  referred 
to  as  corresponding  to  the  formation  of  a  polaron;  the  situation  is  actually  very 
reminiscent  of  that  occurring  in  Jra/is’-polyacetylene  [5,50]. 

The  HOMO  (singly  occupied)  and  HOMO-1  wave  functions  in  the  case  of  the 
singly  charged  DP5-DP1  are  mainly  localized  on  the  central  part  of  the  polyene. 
Upon  charging  the  molecules  with  one  electron,  the  lumo  of  the  neutral  molecules 
becomes  singly  occupied  and  moves  into  the  originally  forbidden  energy  gap,  form¬ 
ing  the  singly  occupied  homo  of  the  charged  molecules.  The  former  HOMO  of  the 
neutral  molecules  moves  upward  in  energy  forming  the  homo-1  level  in  the  charged 
molecules;  this  upward  motion  is  small  (for  instance,  0.29  eV  in  DP5),  which 
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Table  II.  am  I  optimized  geometries  for  the  singly  charged  DP5-DP 1  molecules;  the  bond  lengths  are 

in  A  (the  labels  are  those  from  Fig.  2). 


rCx — Cy 

DP5- 

DP4~ 

DP3“ 

DP2" 

DPU 

Cl— C2 

1.395 

1.395 

1.396 

1.396 

1.398 

C2 — C3 

1.390 

1.388 

1.387 

1.386 

1.383 

C3 — C4 

1.412 

1.413 

1.416 

1.420 

1.424 

C4 — C5 

1.436 

1.431 

1.424 

1.417 

1.412 

C5 — C6 

1.362 

1.369 

1.376 

1.382 

1.383 

C6 — C7 

1.412 

1.405 

1.399 

1.399 

C7 — C8 

1.381 

1.387 

1.389 

C8 — C9 

1.396 

1.395 

C9 — C9' 

1.390 

explains  that  in  the  ups  data,  a  single  in-gap  state  appears  in  the  polaron-like  sit¬ 
uations  [36,54]. 

It  is  important  to  note  at  this  stage  that  the  UPS  data  for  DP1-DP5  show  a 
continuous  motion  of  the  Fermi  level  away  from  the  valence  band  edge,  until  the 
saturated  doping  level  of  about  two  sodiums  per  molecule  is  reached.  This  is  in 
agreement  with  a  transformation  upon  doping  from  polaron-like  charged  species 
towards  soliton-antisoliton  pair  defects.  In  contrast,  in  the  case  of  DP7  and  DP6, 
the  Fermi  level  shifts  only  in  the  very  first  sodium  exposure  step  and  then  remains 
at  the  same  position  throughout  the  whole  doping  regime. 

Doubly  Charged  Systems 

The  doping  levels  in  the  saturation  regime  are  estimated  through  the  relative 
intensities  between  the  Na(  Is)  and  C(  Is)  XPS  core-level  spectra;  they  are  found 
to  be  about  two  sodiums  per  molecule  throughout  the  whole  series  of  diphenyl- 
polyenes  included  in  this  study.  The  experiments  also  indicate  the  sodiums  to  be 
ionic;  this  thus  corresponds  to  the  formation  of  doubly  charged  DP*  molecules 
(dianions).  The  amI  optimizations  for  the  doubly  charged  molecules  are  performed 
with  the  sodium  counter-ions  explicitly  included.  The  calculations  are  restricted  to 
conformations  where  the  DPx  molecules  are  taken  to  be  planar  and  the  counter¬ 
ions  are  introduced  above  the  molecular  plane;  these  simplifications  are  consistent 
with  the  results  of  x-ray  scattering  data  [55]  and  recent  high-level  ab  initio  calcu¬ 
lations  including  correlation  effects  [  56  ] . 

The  optimized  amI  geometries  are  shown  in  Table  III.  The  sodium  atoms  are 
located  symmetrically  around  the  center  of  the  molecules,  see  Figure  2.  A  total 
reversal  of  the  bond-length  alternation  pattern  is  observed  in  the  middle  of  the 
molecules  for  all  of  the  doubly  charged  systems;  these  can  therefore  be  described 
as  containing  a  confined  soliton-antisoliton  pair.  The  sodiums  are  centered  on  top 
of  the  third  carbons  from  the  phenyl  groups  (i.e.,  carbons  7  and  7'  in  Fig.  2)  in  the 
cases  of  DP7  and  DP6;  for  DP5-DP3,  the  sodiums  are  above  the  polyene  carbons 
labeled  5  and  5',  which  are  directly  connected  to  the  phenyl  groups;  finally,  for 
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Table  III.  AMl  optimized  geometries  (bond  lengths  in  A)  for  the  doubly  charged  DPx  molecules  with 
counter-ions  included.  The  bonds  are  labeled  following  the  structures  in  Fig.  2. 


rCx — Cy 


Cl — C2 
C2 — C3 
C3 — C4 
C4 — C5 
C5 — C6 
C6 — C7 
C7 — C8 
C8 — C9 
C9— CIO 
CIO— Cl  1 

cii— cir 

Na— Cx 


Na2DP7 


1.396 

1.389 

1.411 
1.435 
1.367 
1.400 

1.412 
1.360 
1.434 
1.350 
1.440 

2.663  (C7) 


Na2DP6 


1.396 

1.388 

1.413 

1.431 

1.375 

1.390 

1.419 

1.356 

1.438 

1.350 

2.728  (C7) 


Na2DP5 


1.396 

1.387 

1.419 

1.416 

1.402 

1.366 

1.432 

1.351 

1.439 


2.665  (C5) 


Na2DP4 


1.396 

1.386 

1.421 

1.412 

1.409 

1.362 

1.435 

1.351 


Na2DP3 


1.394 

1.386 

1.424 

1.404 

1.419 

1.357 

1.438 


Na2DP2 


1.398 

1.380 

1.441 

1.377 

1.430 

1.356 


2.770  (C4) 


Na2DPl 


1.404 

1.373 

1.450 

1.369 

1.429 


3.087  (C4) 


2.617  (C5)  2.596  (C5) 


DP2  and  DPI,  the  sodiums  are  situated  above  the  phenyl  groups  (above  carbons 
4  and  4'  and  on  top  of  the  center  of  the  phenyl  groups,  respectively).  The  sodium 
to  molecular  plane  distances  are  close  to  2.6  A  for  DP7  to  DP3  and  increases 
somewhat,  to  about  2.7  A,  when  the  sodiums  are  located  over  the  phenyl  groups 
as  in  DP2  and  DPI.  Ab  initio  calculations  including  electron  correlation  effects 
result  in  a  sodium  to  polyene  distance  of  2.4  A  [  56] ,  in  agreement  with  the  exper¬ 
imental  results  [  55  ] ;  thus,  the  sodium  to  polyene  distance  is  somewhat  overestimated 
with  the  semiempirical  am  1  method. 

The  homo  and  homo-1  wave  functions  (which  correspond  to  the  soliton- 
antisoliton  levels  in  the  doubly  charged  systems)  have  large  contributions  from  the 
polyene  part  of  the  molecules  and  are  centered  at  the  carbons  closest  to  the  sodium 
atoms  in  the  cases  of  DP7  to  DP3;  for  these  molecules,  there  are  only  very  small 
contributions  from  the  phenyl  groups.  However,  as  the  polyene  segment  shrinks 
in  size  and  the  defects  are  forced  towards  the  phenyl  groups,  the  contributions  from 
these  groups  increase  and  lead  to  significant  lcao  coefficients  on  the  phenyls  for 
DPI  (although  the  largest  coefficients  are  still  found  on  the  vinylene  moiety). 


Comparison  with  Experiment 

In  the  comparison  with  the  ups  experimental  data,  we  focus  on  the  low  binding 
energy  part  of  the  spectra,  i.e.,  the  region  covering  the  topmost  x-levels.  These  are 
the  7r-levels  which  are  mainly  involved  in  the  doping  process. 

In  Figure  3,  we  compare  the  densities  of  states  (DOS)  obtained  from  VEH  cal¬ 
culations  with  the  experimental  ups  He  I  (21.2  eV)  valence  bands  for  the  neutral 
DPx  molecules.  The  energy  scales  are  relative  to  the  vacuum  level.  The  energy 
levels  as  obtained  from  veh  are  indicated  by  bars  in  the  figure.  The  experimental 
valence  bands  and  the  calculated  DOS  are  in  excellent  agreement.  The  peaks  labeled 
A,  at  about  -8  eV  in  the  UPS  spectra,  have  their  most  significant  contributions 


METALS  AND  tt-CONJUGATED  POLYMERS 


489 


DPI 


DP2 


Figure  2.  Atom  labeling  for  the  diphenylpolyenes  and  optimized  positions  of  the  sodium 
counter-ions  included  in  the  case  of  doubly  charged  molecules. 


coming  from  the  phenyl  groups.  We  recall  that  the  degenerate  HOMO  orbitals  of 
benzene  split  upon  connection  to  the  polyene  segment.  One  of  these  orbitals  has  a 
node  at  the  carbon  site  attached  to  the  polyene,  resulting  in  orbitals  localized  within 
the  phenyl  groups  of  the  diphenylpolyenes.  Since  peak  A  is  dominated  by  such 
contributions,  this  peak  is  unaffected  by  the  length  of  the  polyene  segment.  The 
peaks  at  lower  binding  energies,  i.e.,  the  two  peaks  labeled  B  and  C  in  the  cases  of 
DP7-DP4  and  the  peak  labeled  C  for  DP3-DP1,  strongly  depend  on  the  polyene 
length  since  large  coefficients  are  present  on  the  polyene  part  of  the  molecules.  To 
point  out  the  quality  of  the  veh  simulation  of  the  UPS  data,  it  is  particularly  illus¬ 
trative  to  compare  the  evolutions  of  the  peak  originally  labeled  B  in  DP7 ,  see  Figure 
3.  As  the  polyene  segment  shortens,  peak  B  goes  up  in  binding  energy  to  form  a 
low-energy  shoulder  to  peak  A  in  DP4.  In  DP3,  it  appears  as  a  high-energy  side 
shoulder  and  in  DP2,  as  a  separate  peak  at  higher  binding  energy.  The  evolutions 
are  identical  on  both  experimental  and  theoretical  spectra. 

In  Figure  4,  we  compare  the  experimental  ups  valence  bands,  as  recorded  at 
saturation  doping  (i.e.,  at  about  two  sodiums  per  molecule)  with  the  VEH-calculated 
DOS.  Again,  the  general  features  in  the  experimental  and  theoretical  spectra  agree 
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Figure  3.  Comparison  between  the  low  binding  energy  part  of  the  VEH-calculated  densities 
of  states  and  the  experimental  UPS  valence  band  spectra  in  the  cases  of  the  neutral  DP7- 

DP1  molecules. 


very  well  throughout  the  series.  The  most  important  result  is  the  appearance  of  two 
new  states,  labeled  D  and  E,  within  the  originally  forbidden  bandgap.  The  splitting 
between  the  two  gap  states  depends  on  the  polyene  length;  as  the  polyene  segment 
shortens,  the  splitting  increases  due  to  the  increased  overlap  between  the  soliton- 
antisoliton  wave  functions.  The  theoretical  VEH  and  experimental  splittings  between 
the  D  and  E  peaks  are  summarized  in  Table  IV.  The  experimental  data  and  the 
results  from  the  veh  calculations  agree  extremely  well  (within  the  experimental 
error  of  about  0.1  eV)  for  DP7  to  DP3.  In  the  cases  of  DP2  and  DPI,  however, 
the  veh  splitting  is  overestimated  by  about  0.3  eV. 

We  note  that  the  results  for  the  longer  diphenylpolyenes  are  fully  consistent  with 
those  of  a  recent  study  on  the  p  -doping  of  /3-carotene,  a  molecule  which  contains 
11  carbon-carbon  double  bonds  along  the  conjugated  backbone  [24].  ESR  and 
optical  absorption  data  indicate  the  formation  of  spinless  dications,  which  are  de¬ 
scribed  as  containing  soliton-antisoliton  pairs. 
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Figure  4.  Comparison  between  the  low  binding  energy  part  of  the  VEH-calculated  densities 
of  states  and  the  experimental  UPS  valence  band  spectra  in  the  cases  of  the  doubly  charged 

DP7-DP1  molecules. 


Aluminum  Interacting  with  Poly(/>-phenylenevinylene)  and  Derivatives 

We  are  able  to  find  many  different  stable  configurations  at  the  AMl  level  for  the 
aluminum/ PPV  systems  investigated,  depending  on  the  initial  positions  of  the  alu¬ 
minum  atoms.  We  have  considered  pairs  of  aluminum  atoms  introduced  either  on 
the  same  side  or  on  opposite  sides  of  the  molecular  plane  of  the  oligomers  and 

Table  IV.  Experimental  ups  and  veh  values  of  the  energy  difference  (in  eV)  between  the  two  highest 
occupied  levels  in  doubly  charged  DP7-DP1. 


DP7  DP6  DPS  DP4  DP3  DP2  DPI 
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located  either  on  the  same  unit  or  on  separate  units.  A  general  result  is  that  the 
most  energetically  stable  structures  are  found  when  the  two  aluminum  atoms  interact 
with  the  same  chemical  moiety,  i.e.,  the  same  vinylene  group  or  the  same  phenylene 
group.  We  now  discuss  the  most  representative  (most  stable)  configurations  for 
each  system. 


The  Al2/poly( p-phenylenevinylene)  System 

In  Figure  5,  we  display  the  most  energetically  favorable  structures  of  the  Al2/ 
poly  (p-phenylenevinylene)  complex  in  the  cases  where:  (a)  two  aluminum  atoms 
interact  with  a  vinylene  group;  and  (b)  two  aluminum  atoms  interact  with  a  phen¬ 
ylene  ring.  Only  the  part  mostly  affected  by  the  interaction  with  the  aluminum 
atoms  is  shown,  i.e.,  the  central  part  of  the  molecules  [as  indicated  by  the  dashed 
parentheses  in  Fig.  1(b)].  In  the  same  manner  as  what  is  found  for  polyacetylene 
[31]  or  polythiophene  [57],  each  aluminum  atom  tends  to  form  a  covalent  bond 
with  a  carbon  atom. 

The  most  stable  configuration  for  the  A12/ppv  complex  is  when  the  two  aluminum 
atoms  are  bonded  to  a  single  vinylene  group.  The  energy  difference  between  struc¬ 
tures  (a)  and  (b)  is  about  16.4  kcal/mol  in  favor  of  structure  (a);  this  can  be 
explained  by  the  loss  of  aromaticity  of  the  phenylene  ring  in  case  (b).  This  result 
is  in  qualitative  agreement  with  a  recent  ab  initio  modeling  of  the  Al2/poly(p- 
phenylenevinylene)  system,  using  a  stilbene  molecule  as  a  model  for  ppv  [51].  In 
addition,  recent  xps  and  ups  studies  of  the  initial  stages  of  interface  formation 
between  aluminum  and  ppv  indicate  [through  the  changes  in  the  UPS  valence  band 


Figure  5.  Illustration  of  two  of  the  most  energetically  favorable  am  1 -optimized  structures 
of  the  Al2/poIy(/7-phenylenevinylene)  complex.  Only  the  central  part  of  the  molecule 
[indicated  by  the  dashed  parenthesis  in  Fig.  1(b)]  is  shown. 
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and  in  the  shake-up  structure  of  the  XPS  C(ls)  peak]  that  the  aluminum  atoms 
preferentially  interact  with  the  vinylene  moieties  [58]. 

The  modifications  induced  by  the  aluminum  atoms  are  mainly  localized  to  the 
units  to  which  the  aluminum  atoms  are  bonded.  When  the  aluminum  atoms  are 
attached  to  a  vinylene  group,  all  the  bonds  associated  with  this  moiety  elongate 
(the  double  bond,  r6  —  7,  increases  from  about  1 .34  A  to  1 .54  A  (pure  single  bond) 
and  the  single-like  bonds,  r5  — 6  and  r7~  8,  elongate  to  about  1.48  A  and  1.49 
A,  respectively)  and  the  7r-conjugation  is  interrupted.  In  the  case  when  the  two 
aluminum  atoms  interacts  with  a  phenylene  ring  [structure  (b)  in  Fig.  5],  strong 
modifications  occur  and  the  planarity  of  the  ring  is  lost;  torsion  angles  of  about 
30  °  appear  around  bonds  r9  —  1 0  and  r8  —  9 The  aromatic  character  of  the  phen¬ 
ylene  ring  disappears  and  a  dienic  structure  is  formed  (bonds  r8  9  and  rlO'  1 1 
decreasing  down  to  1 .36  A  from  about  1 .4 1  A  while  the  other  bonds  in  the  phenylene 
ring  elongate  to  1.48  A  or  more). 

The  changes  in  Mulliken  atomic  charges  as  a  result  of  the  interaction  with  alu¬ 
minum  show  a  charge  transfer  of  about  0.47  |  e  |  from  each  aluminum  atom  to  the 
conjugated  molecule  in  both  systems.  The  additional  charge  on  the  molecule  is 
mainly  accommodated  on  the  carbons  to  which  the  aluminum  atoms  are  attached; 
the  net  atomic  charge  on  carbons  6  and  7  in  structure  (a)  increases  by  about  0.42 
| e | ,  that  on  carbons  9'  and  10  in  structure  (b)  by  about  0.35  \e\.  The  highest 
occupied  molecular  orbitals  are  localized  in  character;  the  HOMO  and  HOMO-1 
levels  are  almost  totally  confined  to  the  aluminum  atoms  and  the  carbons  within 
the  moieties  the  aluminums  are  attached  to. 

The  Al2/poly(2,5-dimethoxy-p-phenylenevinylene)  System 

Some  of  the  stable  structures  of  the  Al2/DMeOPPV  complex  are  shown  in  Figure 
6.  When  two  aluminum  atoms  are  introduced  in  the  vicinity  of  a  phenylene  ring, 
many  different  stable  configurations  are  obtained,  of  which  several  are  comparable 
in  stability.  We  display  some  of  the  most  stable  cases  involving:  in  structure  (b), 
only  phenylene  ring  carbons;  in  (c),  a  multiple-bond  structure;  and  in  (d),  the 
formation  of  an  A1  —  O — C  complex.  The  most  stable  configuration  is,  as  in  the 
case  of  A12/ppv,  a  situation  where  the  two  aluminum  atoms  interact  with  a  single 
vinylene  group;  structure  (a)  is  more  stable  than  systems  (b) ,  (c),  and  (d)  by  about 
2 1 .4, 20.2,  and  55.5  kcal /mol,  respectively.  These  results  indicate  that  the  interaction 
with  oxygen  is  not  favored  in  this  system. 

The  aluminum  atoms  induces  modifications  in  structures  (a)  and  (b)  that  are 
very  similar  to  those  obtained  for  the  corresponding  situations  for  A12/ppv  .  In  the 
case  of  structure  (c),  one  of  the  aluminum  atoms  interacts  with  both  a  vinylene 
and  a  phenylene  moiety;  the  shortest  Al— C  distance,  about  1.88  A,  is  found  for 
the  bond  to  C9  of  the  phenylene  ring.  We  note  that  such  an  am  1 -calculated  Al  C 
bond  length  might  be  on  the  short  side,  since  high-level  ab  initio  calculations  in¬ 
cluding  the  effects  of  electron  correlation  indicate  Al  C  bond  lengths  on  the  order 
of  1.95  A  [59,60].  The  Al  — C  distances  involving  carbons  6,  7,  and  8  are  in  the 
range  2.32-2.55  A.  The  second  aluminum  atom  is  positioned  above  the  phenylene 
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ring,  with  A1 — C  distances  ranging  between  2.50  A  for  A12  CIO'  and  2.82  A  for 
A12  —  C9.  The  planarity  of  the  phenylene  ring  is  lost,  with  the  largest  torsion  angle, 
about  19.6°,  occurring  around  bond  C8  — C9'.  Finally,  in  structure  (d),  the  two 
aluminum  atoms  forms  A1  —  O — C  complexes.  As  in  structure  (b),  the  phenylene 
ring  acquires  a  dienic  structure,  with  bonds  C8  —  9  and  CIO'  11  shortening  to 
1.36  A.  The  planarity  of  the  phenylene  ring  is  also  lost  with  torsion  angles  around 
the  single  bonds  of  about  16-17°.  The  Al— O  bonds  have  lengths  of  about  1.80 
A  and  the  Al  —  C  bonds,  about  2.26  A. 

The  changes  in  Mulliken  atomic  charges  indicate  a  charge  transfer  from  each 
aluminum  atom  to  the  conjugated  molecule  of  about  0.4-0.5  |e|  in  systems  (a), 
(b),  and  (d).  As  in  the  case  of  the  A12/ppv  system,  the  additional  charge  is  mainly 
accommodated  on  the  carbons  to  which  the  aluminum  atoms  are  attached;  the  net 
atomic  charge  on  these  carbons  increases  by  about  0. 3-0.4  |e|  for  structures  (a) 
and  (b)  and  0.26  | e |  for  structure  (d).  In  structure  (c),  about  0.16  |e|  is  transferred 
from  All,  while  about  0.54  |e|  is  transferred  from  A12.  The  multiple-bonding 
character  of  structure  (c)  is  reflected  in  the  charge  transfer  which  involves  several 
carbons;  the  largest  increases  in  net  atomic  charges  are  seen  on  carbons  C6,  C8, 
C9,  and  Cl  1. 

The  presence  of  the  methoxy  substituents  in  DMeOPPV  allows  for  stable  config¬ 
urations  involving  Al  — O — C  complex  formation.  These  structures,  however,  are 
less  stable  than  those  where  aluminums  bind  to  carbons  on  the  phenylene  and/ or 
vinylene  moieties.  As  in  the  case  of  unsubstituted  PPV,  the  most  stable  structure  is 
found  when  the  two  aluminum  atoms  are  situated  on  a  single  vinylene  group,  i.e., 
structure  (a).  This  structure  is  about  20  kcal/mol  more  stable  than  any  configuration 
involving  the  phenylene  carbons.  We  can  thus  conclude  that  the  aluminum  atoms 
preferentially  interact  with  the  vinylene  groups  in  the  initial  stages  of  interface 
formation  with  DMeOPPV. 

The  Al2/poly(2,5-dialdehyde-p-phenylenevinylene)  System 

Stable  configurations  for  the  A12/dappv  complex  are  illustrated  in  Figure  7. 
Structures  (a)  and  (b)  are  again  nearly  identical  to  those  found  in  the  A12/PPV 
complexes.  The  major  difference  here,  however,  is  that  there  occur  configurations 
involving  interactions  between  the  aluminums  and  the  carbonyl  groups  which  are 
as  stable  as  structure  (a).  These  are  illustrated  in  Figure  7  as  structures  (c)  and 
(d),  which  are  1 .4  kcal/mol  more  stable  and  2.0  kcal/mol  less  stable  than  structure 
(a),  respectively.  Structure  (c)  is  actually  similar  to  structure  (a),  one  aluminum 
being  however  linked  to  both  the  vinylene  moiety  and  a  carbonyl  group.  Structure 
(d)  is  reminiscent  of  structure  (c)  in  the  Al2/MeOPPV  complex;  it  is  a  configuration 
where  one  aluminum  has  multiple  bonding  to  a  ring,  the  other  being  attached  to 
the  oxygen  of  a  carbonyl  and  the  nearly  vinylene  moiety.  We  note  that  the  high 
reactivity  between  aluminum  atoms  and  carbonyl  groups  indicated  here  is  consistent 
with  the  results  from  both  experimental  studies  (e.g.,  polycaprolactone  [61],  poly- 
ethyleneterephthalate  [62,63],  polyacrylic  acid  [64],  or  polyimide  [65-67])  and 
theoretical  studies  (e.g.,  polyimide  [68]  and  polymethylmethacrylate  [69])  of  alu¬ 
minum  interacting  with  C  =  0  containing  polymers. 
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The  modifications  induced  by  the  aluminum  atoms  in  structures  (a)  and  (b)  are 
very  similar  to  those  obtained  for  the  corresponding  situations  in  A12/PPV.  In  the 
case  of  structure  (c),  one  C=0  group  is  involved  in  the  complex.  The  shortest 
bond  lengths  involving  aluminum  atoms  can  be  found  for  A1 1  —  C6  and  A12  —  04 
(about  1.82  and  1.76  A,  respectively);  the  A12  —  Cl  distance  is  about  2.3  A.  This 
is  reflected  in  the  carbon-carbon  backbone  around  the  vinylene  group;  the  double 
bond  character  of  the  C6  —  Cl  bond  is  lost  (the  bond  increases  from  1.34-1.50 
A);  a  small  decrease  is  found  for  the  Cl  —  C8  bond  length,  from  1.45  to  1.43  A, 
while  the  C5  —  C6  bond  elongates  from  1 .45  to  1 .48  A.  In  structure  (d),  A12  forms 
a  complex  with  Cl  and  04;  the  A12  —  Cl  distance  is  long,  about  2.5  A,  which  leads 
to  only  small  modifications  in  the  vinylene  bridge;  the  A12  —  04  distance  is  ca. 
1.72  A.  The  04  —  C16  bond  elongates,  from  1.24  to  1.35  A,  while  the  C16  —  C9' 
bond  goes  from  1.48  A  down  to  1.37  A.  All  is  positioned  above  the  phenylene 
ring,  with  the  shortest  A1  —  C  bond,  All  —  CIO,  about  2.39  A  long;  the  other  A1 1  C 
bonds  are  about  2.5  A  long;  the  phenylene  acquires  a  quinoid  character.  The  A1 — O 
bond  lengths  in  these  structures  [1.76  A  in  structure  (c)  and  1.72  A  in  structure 
(d)]  are  close  to  those  obtained  in  other  theoretical  studies  of  the  interaction  between 
aluminum  and  carbonyl  containing  molecules:  the  ab  initio  local  spin  density,  LSD, 
modeling  of  the  Al-polyimide  interaction  results  in  an  A1  —  O  bond  length  of  1.80 
A  [68]  and  an  MNDO  study  of  aluminum  interacting  with  polymethylmethacrylate 
gives  a  bond  length  of  about  1.6  A  [69]. 

The  changes  in  Mulliken  atomic  charges  in  the  cases  where  there  are  bonds  only 
between  aluminum  and  carbons,  i.e.,  structures  (a)  and  (b),  show  that  about  0.4- 
0.5  |  e  |  is  transferred  from  each  aluminum  atom.  In  the  structures  involving  oxygen 
in  the  bonding,  structures  (c)  and  (d),  the  charge  transfer  from  the  aluminum 
atom  connected  to  an  oxygen  atom  is,  as  expected,  lower  than  that  for  the  aluminum 
atom  connected  to  a  carbon  atom.  The  increase  in  net  atomic  charge  on  the  oxygen 
atom  closest  to  an  aluminum  atom  is  about  0.034  |e|  in  structure  (c)  and  about 
0. 164  |  e  |  in  structure  (d).  Noteworthy  is  the  increase  in  net  atomic  charge  on  the 
carbonyl  carbon  C16  in  structure  (d),  by  ca.  0.20  |e| . 

The  replacement  of  the  methoxy  groups  along  the  DMeOPPV  oligomer  by  the 
carbonyl  (acceptor)  groups  allows  for  the  formation  of  new  stable  structures,  which 
involve  the  appearance  of  A1  —  O  —  C  bonds  and  are  comparable  in  stability  with 
the  otherwise  most  stable  structure,  i.e.,  that  where  the  two  aluminum  atoms  interact 
with  a  single  vinylene  moiety.  The  most  stable  of  these  new  structures  is  found 
when  the  two  aluminum  atoms  forms  a  complex  with  a  vinylene  moiety  and  an 
oxygen  atom  of  a  carbonyl  group,  as  in  structure  (c).  These  results  indicate  that  if 
oxygen  is  present  in  the  form  of  C  — O  groups  on  the  ppv  surface,  the  vinylene 
unit  no  longer  constitutes  the  single  most  favorable  site  of  bonding  for  aluminum 
atoms  at  the  initial  stage  of  interface  formation.  Since  the  oxygen  in  the  carbonyl 
groups  are  directly  bonded  to  an  aluminum  atom  in  the  A1  —  O  —  C  complexes, 
we  expect  this  to  be  true  even  for  other  positions  of  the  carbonyl  groups. 

Synopsis 

We  have  applied  the  semiempirical  Austin  Model  1  method  for  the  investigation 
of  the  interaction  between  metals  and  conjugated  molecules.  Two  systems  repre- 
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senting  two  different  situations  have  been  presented:  (i)  the  interaction  between 
sodium  and  a  series  of  diphenylpolyenes  where  charge  transfer  doping  (reduction) 
occurs;  and  (ii)  the  interaction  between  aluminum  and  ppv  and  derivatives  where 
covalent  bonding  takes  place. 

We  find  that  extra  electronic  charges,  induced  by  doping,  are  stored  in  polaron- 
like  charge  storage  states  for  DP5  to  DP  1  at  low  doping  levels,  with  a  transformation 
to  soliton-antisoliton-like  charge  storage  states  upon  increasing  the  doping  towards 
saturation,  i.e.,  a  doping  level  of  two  charges  per  molecule.  The  scenario  for  DP7 
and  DP6  is  such  that  the  doubly  charged  molecules  are  formed  even  at  very  low 
doping,  and  thus  the  extra  charges  are  accommodated  in  soliton-antisoliton-like 
charge  storage  states  throughout  the  whole  doping  range. 

In  the  case  of  the  A12/ppv  system,  the  most  stable  structure  is  found  to  correspond 
to  a  configuration  where  the  two  aluminum  atoms  bind  to  a  single  vinylene  moiety, 
in  agreement  with  recent  experimental  photoelectron  spectroscopy  studies  of  the 
initial  stages  of  interface  formation  between  aluminum  and  ppv  [  58  ] .  The  presence 
of  methoxy  substituents  in  the  Al2/DMeOPPV  system  allows  for  stable  structures 
involving  A1  —  O — C  bond  formation;  however,  these  structures  are  here  much 
less  stable  than  those  involving  carbons  on  the  phenylene  rings  and/or  vinylene 
moieties.  As  in  the  case  of  A12/ppv,  the  most  stable  structure  for  the  Al2/DMeOPPV 
system  is  calculated  to  be  that  where  the  two  aluminum  atoms  interact  with  a  single 
vinylene  unit. 

New  stable  structures,  containing  A1  —  O  —  C  complexes,  are  found  when  C=0 
groups  are  present  on  the  side  of  the  ppv  chains.  Some  of  these  new  stable  structures 
are  comparable  in  stability  with  the  case  where  the  two  aluminum  atoms  interact 
with  a  single  vinylene  moiety.  The  most  stable  structure  calculated  for  this  system 
involves  both  a  vinylene  unit  and  the  adjacent  carbonyl  group. 
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Abstract 

A  systematic  mathematically  based  approach  is  made  to  categorize  the  different  stereochemical  pos¬ 
sibilities  for  finite  carbon  clusters  comprised  of  fully  bonded  smooth  nets  of  carbon  atoms  all  of  which 
are  trigonally  hybridized.  An  exceptional  richness  of  ideas  and  theorems  from  topology,  differential 
geometry,  and  graph  theory  are  found  to  be  relevant  and  to  govern  not  only  the  conventional  chemical 
graph-theoretic  molecular  interconnection  patterns  but  also  topo-geometric  features  of  the  stereochemically 
conceivable  structures  and  their  associated  strain.  The  usually  considered  fullerenes  (with  or  without 
isolated  pentagonal  rings)  form  but  a  fraction  of  the  overall  possibilities,  some  of  which  e.g.,  do  not 
require  pentagonal  rings  (or  other  “small”  rings).  But  too  there  emerges  from  the  theory  certain  special 
structures  which  turn  out  often  to  correspond  to  experimentally  observed  fullerenes.  ©  1994  John  Wiley 
&  Sons,  Inc. 


Background 

The  theory  of  isomerism  has  long  been  an  issue  of  principal  interest  in  chemistry 
for  well  over  a  century.  In  the  last  few  decades  “quantum  chemistry”  has  developed 
as  an  important  field  to  compute  relative  stabilities  of  different  isomeric  structures — 
each  treated  as  an  individual  (well-pronounced)  minimum  in  a  many-dimensional 
potential-energy  hypersurface.  But  a  formalized  classical  approach  remains  of  crucial 
value  for  a  unified  view — to  indicate  possibilities  and  to  provide  a  framework  for 
thought  and  discussion.  The  elegant  classically  based  mathematical  ideas  developed 
(e.g.)  by  Polya  [1]  and  now  much  elaborated  [2]  focus  almost  entirely  on  “substi¬ 
tutional”  isomerism.  In  distinct  contrast  is  isomerism  involving  a  single  chemical 
element.  In  this  regard,  because  of  the  recognized  singular  ability  of  carbon  to  form 
numerous  well-characterized  structures  (and  thereby  give  rise  to  the  field  of  organic 
chemistry),  the  isomerism  of  elemental  carbon  clusters  should  be  of  key  interest. 
Indeed  this  is  an  area  where  traditional  stereochemistry  [  3  ]  also  turns  out  to  focus 
more  on  “locally”  defined  features,  e.g.,  substitutional  isomerism  or  chiral  centers. 

Here  then  a  mathematically  formalized  classical  approach  to  elemental  carbon 
isomers  is  indicated,  especially  for  the  case  where  every  atom  is  in  the  same  valence 
state.  In  fact  ( somewhat  “brief”)  suggestions  of  possibilities  for  such  isomers  dates 
back  some  time  [4,5],  though  focus  was  on  just  a  couple  possibilities:  C20  and  C60 
icosohedral-symmetry  cages.  Further,  interest  was  immensely  boosted:  first  in  1985 
with  the  laser-ablation  molecular-beam  experimental  realization  [6]  of  C6o  clusters 
assigned  to  the  “uniquely  elegant”  truncated-icosohedral  structure  of  Figure  1;  and 
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Figure  1 .  The  truncated-icosahedron  structure,  wherein  the  double  bonds  may  be  dis¬ 
tributed  in  numerous  ways  ( 12500  in  number),  one  of  which  makes  double  all  the  bonds 

not  in  a  pentagon. 


second  in  1990  with  the  experimental  isolation  [7]  and  characterization  of  mac¬ 
roscopic  quantities  of  this  novel  species.  Also  now  experimentally  studied  are  C70, 
C76,  C7g,  Cg4  species  [8-11]  (among  others),  all  evidently  polyhedral  cage  structures 
such  as  in  Figure  2  involving  similar  local  ring  structures  but  exhibiting  lower 
symmetries.  On  the  theoretical  side  electronic-structure  calculations  have  now  been 
made  at  widely  varying  levels  of  computational  sophistication,  usually  on  one  or 
two  (repeatedly  investigated )  structures.  Within  the  realm  of  purely  sp2-hybridized 
classical  polyhedral  structures  focus  has  been  on  those  involving  just  5-  and  6- 
membered  rings:  all  higher-symmetry  such  C6o  structures  were  identified  [12];  the 
class  of  icosohedral-symmetry  such  carbon  species  of  any  size  was  studied  [13,14] ; 
higher-symmetry  such  cages  of  different  sizes  were  systematically  considered  [15,16]; 
and  finally  attempts  were  made  [17,18]  (and  even  realized  [19])  to  generate  all 
possible  such  cages  of  arbitrary  symmetries  up  to  a  given  size. 

Besides  these  polyhedral  cages  with  carbon  atoms  of  a  single  (sp2)  valence  state 
there  remain  numerous  other  classical  possibilities.  Other  ring  sizes  may  be  con¬ 
sidered  [15,20]  and  also  there  are  toroidal  possibilities  [15,21,22]  for  structures. 
Beyond  these  possibilities  one  could  consider  the  general  problem  of  identifying 
all  possible  connected  (so-called  regular)  graphs  where  every  vertex  is  of  the  same 
appropriate  degree.  Indeed  Balaban  and  co workers  [23]  have  repeatedly  addressed 


Figure  2.  Examples  of  C70,  C76,  C78,  and  C84  polyhedral  cages,  wherein  in  each  case 
double  bonds  may  be  distributed  in  numerous  ways  so  that  no  unpaired  sites  remain. 
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such  a  graph-theoretic  problem  in  reference  to  the  valence  isomers  of  the  annulenes. 
But  for  our  present  problem  this  would  too  easily  overlook  the  preferences  dictated 
by  geometric  strain,  as  is  known  [24  ]  to  be  important — and  further  such  an  approach 
would  miss  some  interesting  stereochemical  possibilities.  Here  we  seek  a  global 
accounting  of  both  strain  effects  and  novel  stereoisomeric  possibilities  within  a 
classical  view  by  considering  embeddings  of  molecular  graphs  in  various  surfaces 
which  in  turn  are  to  be  embedded  in  Euclidean  3-space.  Such  an  approach  then 
entails  the  collection  of  some  graph-theoretic,  topological,  and  differential-geometric 
ideas  and  theorematic  results  so  that  they  can  be  merged  in  a  way  useful  for  the 
stereochemical  problems  at  hand.  This  then  sets  the  bulk  of  fullerene  research  in 
a  larger  context  with  many  interesting  little  explored  extensions. 

Structural  Focus 

Just  what  might  be  the  range  of  conceivable  carbon  clusters? 

First,  in  flames  and  molecular  beams  a  number  of  presumably  transient  species 
have  been  detected  and  studied,  even  theoretically.  See,  e.g.,  the  review  by  Weltner 
and  Van  Zee  [25  ] .  Such  species  of  fewer  than  about  10  carbon  atoms  are  however 
typically  radicals  or  polyradicals.  Further  open  chains,  as  well  as  fragments  cut 
from  the  graphite  or  diamond  lattice,  should  be  radicaloid,  since  classically  all 
entail  dangling  bonds  at  the  boundary.  Here  we  focus  on  other  possibilities  without 
“dangling  bonds.” 

Second,  there  are  a  great  number  of  possible  classical  chemical-bonding  structures, 
such  as  in  Figures  3  or  4.  The  species  in  Figure  4  are  viewed  to  be  based  on  sets  of 
cages  as  of  Figure  1 ,  but  now  with  pairs  or  quartets  of  carbon  atoms  taken  to  be 
tetrahedrally  hybridized,  each  such  atom  with  one  bond  external  to  the  cage  con¬ 
necting  to  a  similar  atom  in  another  cage.  But  all  these  species  entail  atoms  of 


Figure  3.  Examples  of  classically  allowed  (more-or-less  planar)  elemental-carbon  mo¬ 
lecular  “clusters”  wherein  the  tetravalence  of  each  carbon  atom  is  satisfied. 
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Figure  4.  Examples  of  conceivable  “multi-cage”  elemental  carbon  clusters.  The  structure 
within  the  cages  is  not  detailed,  and  in  the  second  and  more  so  in  the  third  example  cage 
sizes  have  been  artifactually  diminished  while  retaining  cage  separation  so  that  the  inter¬ 
cage  bonds  may  be  clearly  discerned. 


different  hybridizations  (while  also  there  is  considerable  strain  for  the  species  in 
Fig.  3 ),  and  here  we  choose  to  focus  on  species  with  but  a  single  type  of  hybridization. 

Third,  with  but  a  single  hybridization  there  are  a  variety  of  possibilities.  For 
exclusive  sp-hybridization  the  situation  seems  perhaps  the  most  clearly  resolved: 
all  classical  structures  are  just  cycles,  so  that  there  are  no  constitutional  isomers 
among  these  species,  though  upon  embedding  in  Euclidean  3-space  <£  3  these  cycles 
may  be  knotted  in  many  conceivable  ways.  For  solely  sp3-hybridization  each  site 
is  to  be  bonded  to  four  others,  such  as  is  the  case  for  the  graphs  of  Figure  5— -but 
in  all  known  cases  there  seems  to  be  excessive  strain  upon  embedding  in  & 3 ,  with 
constraints:  that  bonds  radiating  from  each  atom  should  be  at  an  angle  « 109°  to 
the  others,  that  these  bonds  are  not  too  bent,  and  that  they  are  of  similar  lengths. 
Evidently  the  remaining  case  where  the  sole  type  of  hybridization  is  sp2  is  the  case 
of  primary  interest — there  being  examples  of  structures  without  inordinate  strain 
already  noted  in  Figures  1  and  2. 

The  case  of  entirely  sp2-hybridized  classically  bonded  structures  then  is  the  area 
(of  natural  interest)  pursued  here.  In  addition,  to  the  polyhedral  and  torihedral 
structures  considered  earlier  (as  noted  in  the  introduction)  there  are  many  other 
possibilities  even  if  we  avoid  overstrained  structures  (such  as  indicated  in  the  pre¬ 
vious  paragraph  involving  sp3  carbons).  For  instance,  one  may  consider  the  manner 
of  interconnecting  three  graphitic  sheets  as  indicated  in  Figure  6(a),  then  imagine 
the  sheets  bent  around  as  indicated  in  6(b),  or  any  one  of  numerous  other  con- 


Figure  5.  Two  (mathematically  well-known)  graphs  with  each  site  connected  to  four 

other  sites. 
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Figure  6.  In  ( a)  a  local  way  of  interconnecting  sp2-carbons  is  indicated  so  that  an  overall 
entirely  sp2-hybridized  structure  as  in  (b)  or  (c)  can  be  constructed.  Note  that  the  atoms 
along  the  dotted  lines  have  their  trigonal  hybridization  axes  orthogonal  to  the  neighboring 
atoms  to  which  they  are  bonded. 


ceivable  possibilities.  Here  there  are  adjacent  sp2-atoms  with  dramatically  (near 
orthogonal)  different  orientations,  so  that  range  of  conjugation  is  limited.  Indeed 
these  carbons  along  the  dotted  lines  in  6(a)  and  (b)  tend  to  leave  their  ^-electrons 
unpaired,  so  that  such  species  should  have  an  (unstable)  polyradicaloid  character. 
Of  course  arranging  two  such  dotted-line  arrays  of  carbon  atoms  to  be  immediately 
adjacent  as  in  6(c),  atoms  along  the  two  dotted  lines  may  pair  to  one  another, 
thereby  avoiding  radicaloid  character.  Nevertheless  here  as  a  beginning  we  consider 
situations  where  not  only  is  there  uniformity  in  hybridization  but  also  in  local 
orientation  of  the  hybridization  axes  (as  is  equivalent  to  requiring  that  there  is  no 
boundary  for  the  conjugation).  Hopefully  some  of  the  ideas  elucidated  in  this  case 
will  carry  back  to  more  general  circumstances  such  as  mentioned  in  this  paragraph. 

Closed  Surfaces 

The  formal  construction  of  feasible  tri-a-valent  (and  uni-ir-valent )  carbon  struc¬ 
tures  should  take  into  account  in  some  fashion  the  manner  of  embedding  into 


506 


KLEIN  AND  LIU 


Figure  7.  In  ( a)  the  result  of  adding  one  handle  to  a  sphere — this  result  being  homotopic 

to  the  torus  of  (b). 


Euclidean  3-space  <£3.  Since  such  sp2-hybridized  carbon  atoms  generate  locally 
planar  structures,  it  is  natural  to  assume  a  constraint  that  the  molecular  graphs  are 
to  be  drawn  in  such  ( usually  smooth  but  always  sufficiently  well-behaved  to  at  least 
be  compact)  2-manifolds  (or  surfaces)  without  intersecting  bonds.  Since  there  are 
to  be  no  dangling  bonds  and  no  other  (different- valent)  atoms,  it  is  natural  to 
constrain  the  surfaces  to  be  closed,  i.e.,  without  edges  where  dangling  bonds 
would  occur. 

Thence  the  topology  [26,27  ]  and  differential  geometry  [28,29]  of  closed  surfaces 
should  be  of  relevance.  There  are  two  fundamental  types  of  closed  surfaces:  ori- 
entable  ones  with  two  sides  and  nonorientable  ones  with  a  single  side.  Examples  of 
orientable  surfaces  S  include  the  sphere  and  the  torus,  and  they  may  be  characterized 
in  terms  of  their  genus  y(S),  which  is  the  number  of  “handles”  added  to  a  sphere 
to  obtain  a  surface  homotopic  (i.e.,  topologically  equivalent)  to  S  (see  Fig.  7).  A 
common  example  of  a  one-sided  surface  is  the  Mobius  strip,  though  it  has  an  edge, 
and  so  is  not  closed.  A  nonorientable  closed  S  may  be  characterized  in  terms  of  its 
cross-cap  number  n(S),  which  is  the  number  of  cross-caps  added  to  a  sphere  to 
obtain  a  surface  homotopic  to  S  (see  Fig.  8).  The  fundamental  result  governing 
closed  surfaces  is: 

The  Dyck — Dehn — Heegard  Theorem — Every  closed  surface  S  falls  into  one  of 
the  two  categories:  either  S  is  orientable  with  n(S)  -  0  and  y(S)  >  0,  or  else  £  is 
nonorientable  with  n(S)  >  0  and  y(*S)  =  0. 


Figure  8.  A  sphere  in  which  a  cut  has  been  made,  with  “orientations”  indicated  on  the 
two  halves  of  the  cut.  Rejoining  the  two  cut  edges  such  that  the  two  arrows  match  head- 
to-tail  and  tail-to-head  recovers  the  sphere;  however,  rejoining  the  two  edges  such  that  the 
two  arrows  match  head-to-head  and  tail-to-tail  yields  a  cross-cap. 
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The  counts  n(S )  and  y(S )  so  given  are  homotopic  invariants.  It  is  an  immediate 
implication  of  this  theorem  that  the  simultaneous  addition  of  cross-caps  and  handles 
to  a  sphere  results  in  something  already  identified  in  the  theorem— in  fact,  when 
a  cross-cap  is  present  any  handle  also  present  turns  out  to  be  homotopically  re¬ 
placeable  by  two  cross-caps.  With  only  handles  or  cross-caps,  the  Euler  characteristic 
is  often  introduced  a s 


2  -  27 (S),  n(S)  =  0 
2  -n(S),  n(S)±  0 


(3.1) 


and  turns  out  to  be  perhaps  the  most  fundamental  topological  invariant  for  a  closed 
surface. 

Next  the  closed  surface  S  is  to  be  embedded  in  Euclidean  3-space  £3.  A  first 
fundamental  theorem  here  is: 

The  J ordan- Alexander  Theorem— If  a  closed  surface  S  is  embedded  without  self¬ 
intersection  in  £3,  then  S  must  be  orientable,  in  which  case  when  the  points  of  S 
are  removed  from  £3  the  remnant  falls  into  two  individually  connected  regions 
disconnected  from  one  another. 

Henceforth  we  consider  only  oriented  S  as  candidates  for  constructing  feasible 
carbon  isomers,  though  if  the  assumption  of  atoms  of  a  single  valence-state  were 
dropped  the  networks  on  the  surface  could  be  “tenuous”  enough  to  allow  surface 
self-intersection  [22],  and  thence  (multiply-hybridized)  structures  on  nonorientable 
surfaces  could  be  realized. 

With  the  presumption  of  a  non-self-intersecting  embedding  of  S  in  <£3,  a  general 
characterization  of  the  surface’s  shape  is  of  interest.  In  this  regard  curvatures  along 
different  directions  at  a  point  in  the  surface  may  be  considered,  with  kx  and  k2  the 
minimum  and  maximum  values  at  that  point.  (The  curvature  k  at  a  point  on  a 
smooth  curve  is  the  inverse  of  the  radius  of  the  circle  which  “kisses”  the  curve  at 
that  point.)  Then  the  Gaussian  curvature  (at  that  point  of  S)  is  defined  as 

k  =  ±  |  k\k2 1  (3.2) 

where  the  sign  is  +  if  the  kx  and  k2  orientations  of  curvature  are  toward  the  same 
side  of  S,  whereas  the  sign  is  -  if  the  k{  and  k2  orientations  of  curvature  are  toward 
opposite  sides  of  S.  A  fundamental  result  involving  an  area  integral  over  all  of  S 
is: 

Gauss-Bonnet  Theorem — For  an  orientable  closed  smooth  surface  S  in  £ 3,  the 
total  Gaussian  curvature  integrated  over  S  is  a  topological  invariant 

J  k  ds  =  27tX(5')  . 


That  is,  for  the  surfaces  S  around  the  outside  of  a  basketball,  or  of  an  (American) 
football,  or  of  a  discus  the  total  Gaussian  curvature  integrated  over  all  of  S  has  the 
same  value  4ir,  whereas  for  the  surfaces  around  a  doughnut  or  a  coffee  mug  the 
result  is  another  characteristic  value  0. 

A  special  natural  type  of  surface  [30]  embedded  in  e3  is  that  comprised  entirely 
of  planar  polygonal  faces.  Indeed  a  formal  study  of  such  so-called  polyhedra  (es- 
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pecially  more  “regular”  ones  with  X(S')  =  0)  dates  back  to  the  Greeks.  But  we  wish 
to  make  no  constraint  on  X(S)  and  to  emphasize  this,  we  will  here  refer  to  these 
so  unconstrained  closed  orientable  surfaces  with  polygonal  faces  as  topohedra.  At 
a  vertex  i  of  such  a  topohedron  where  several  faces  meet  one  may  define  an  angle 
defect 


0/  =  2?r  -  2  <t>ai  (3.3) 

a 

where  the  sum  is  over  the  face  angles  4>ai  for  the  faces  a  meeting  at  i.  For  example, 
for  a  cube  the  8  angle  defects  are  each  7t/2,  whereas  for  a  regular  tetrahedron  each 
of  the  4  angle  defects  is  i r.  Then  a  fundamental  result  for  topohedra  is: 

Descarte’s  Theorem — If  Sis  a  (closed)  topohedral  surface  in  <£3,  then 

vertices 

2  <t>i  =  2xX(5) 

i 

where  the  sum  is  over  all  angle  defects  0,  of  the  vertices  i  of  S. 

In  fact,  this  result  may  be  viewed  to  be  the  same  as  that  of  the  Gauss-Bonnet 
theorem  if  we  realize  the  angle  defects  are  essentially  the  net  Gaussian  curvatures 
associated  to  each  vertex — i.e.,  if  small  (nonoverlapping)  regions  around  each  vertex 
are  smoothed  out  and  k  is  integrated  over  the  zth  such  region,  then  the  result  is  0/ . 

A  topohedron  S  is  also  characterized  by  counts  v,  e ,  and  / of  its  vertices,  edges, 
and  faces.  But  that  S  is  closed,  places  an  important  constraint  on  these  counts: 

Euler’s  Theorem — For  a  topohedral  closed  S , 

v  -  e  +  f=  X(S). 

In  the  present  chemical  context  we  associate  atoms,  bonds  and  rings  to  the  vertices, 
edges  and  faces  of  such  a  topohedron. 

Embeddings 

Beyond  the  fundamental  results  of  the  preceding  section  characterizing  the  closed 
surface  S ,  there  is  also  the  mathematical  problem  [31,32]  of  characterizing  the 
embedding  of  the  graphical  molecular  framework  G  within  S .  Flere  an  embedding 
of  a  graph  G  into  a  surface  S  is  an  (injective)  mapping  where  vertices  and  edges  of 
G  are  mapped  to  points  and  open  line  segments  such  that  each  such  image  is 
entirely  disjoint  from  all  others,  and  each  line-segment  edge  image  has  (limiting) 
end  points  corresponding  to  the  vertices  that  the  edge  interconnects.  When  all  these 
images  of  vertices  and  edges  of  G  are  removed  from  S,  there  results  a  set  of  surface 
fragments,  called  faces.  If  all  these  faces  are  homeomorphic  to  an  ( open )  disc,  then 
the  embedding  is  said  to  be  cellular.  Thence  we  have  a  more  “graphical”  version 
of  the  last  theorem  of  the  previous  section: 

Euler’s  Theorem — If  a  cellular  embedding  of  G  with  v  vertices  and  e  edges  in  a 
closed  surface  S  gives  / faces,  then 
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v-e+f=x(S). 

Clearly  vertices  and  edges  of  G  might  be  correlated  with  vertices  and  edges  of 
polyhedra,  as  well  as  with  atoms  and  bonds.  The  theorem  applies  to  rather  general 
graphs  though  in  the  present  context  we  are  interested  in  3-connected  graphs,  being 
those  connected  graphs  which  require  the  removal  of  at  least  3  edges  to  separate  it 
into  two  fragments. 

Which  of  the  many  embeddings  of  a  graph  G  into  a  surface  S  might  be  “equiv¬ 
alent”  is  of  chemical  relevance,  because  such  equivalences  can  correspond  to  chem¬ 
ical  (stereo)  isomers.  Two  embeddings  \j/  and  p  are  combinatorially  equivalent  if 
there  is  a  one-to-one  correspondence  between  vertices,  edges,  and  faces  of  and  <p 
such  that  incidences  between  vertices,  edges,  and  faces  are  preserved.  That  is,  if  i, 
e,  and  a  are  vertex,  edge  and  face  images  under  $  corresponding  to  e\  and  ot 
under  p,  then 

i  ~  e  <=>  V  ~  e' 

e  ~  a  <=»  e'  ~  a'  (4.1) 

where  “  indicates  “incident  to”.  A  key  result  is: 

The  Steinitz- Whitney  Theorem— If  S  is  homeomorphic  to  a  sphere,  then  each 
planar  3-connected  graph  corresponds  to  a  unique  combinatorial  equivalence  class 
of  polyhedra. 

Though  a  powerful  classification  result  for  topologically  spherical  S ,  this  theorem 
does  not  seem  to  extend  (simply)  to  X(5')  0.  Still  for  the  X{S)  =  0  case  this 

theorem  reduces  the  characterization  of  these  equivalence  classes  to  a  standard 
graph-theoretic  problem. 

A  more  refined  equivalence  is  relevant  for  stereoisomerism.  Two  cellular  embed¬ 
dings  i p  and  <p  of  G  into  S  are  topocombinatorially  equivalent  if  there  exists  a  con¬ 
tinuum  of  cellular  embeddings  ^ , :  G  S  with  t  E  [0,1]  such  that:  first,  =  lfc 
second,  every  point  in  an  image  of  G  in  S  changes  continuously  with  t ;  and  third, 
=  p,  That  is,  the  atoms  and  bonds  in  S  are  imagined  to  be  able  to  be  moved 
about  (with  t  the  time  variable)  such  that  the  regions  of  occupation  do  not  come 
into  coincidence  at  any  time.  Presumably  then  this  corresponds  more  closely  to 
stereoisomer  classes  in  chemistry.  Again  the  case  of  X(*S')  =  0,  is  better  handled: 

Whitney-Steinitz  Refinement— If  S  is  homeomorphic  to  a  sphere,  then  each 
combinatorial  equivalence  class  corresponds  either  to  one  or  two  topocombinatorial 
equivalence  classes  depending  on  whether  mirror  embeddings  are  topocombina¬ 
torially  equivalent  or  distinct,  respectively. 

In  the  case  of  a  single  such  achiral  topocombinatorial  equivalence  class  there 
often  (maybe  always)  is  an  achiral  embedding  within  the  class.  If  the  condition 
X(S)  =  0  were  removed  from  the  hypothesis  in  this  theorem,  the  consequent  state¬ 
ment  would  not  generally  be  true.  For  example,  typical  topohedral  embeddings  on 
a  torus  may  be  viewed  as  a  rectangle  as  in  Figure  9  first  with  top  and  bottom  edges 
to  be  joined  together  and  second  with  left  and  right  edges  to  be  joined  together— 
but  typically  topocombinatorially  inequivalent  embeddings  result  depending  on 
the  “order”  in  which  the  two  joinings  are  done,  at  least  so  long  as  we  also  have  S 
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embedded  in  <£3.  That  is,  having  joined  together  the  same  bonds  crossing  the  edges 

of  the  rectangle,  the  same  graph  results  though  we  have  different  consequent  ste-  i 

reoisomers.  But  there  are  yet  more  topocombinatorial  possibilities  [22]  if,  e.g.,  on 

the  left  in  Figure  9  one  end  of  the  tube  is  twisted  through  a  full  rotation  before  i 

making  the  final  joining.  (The  cyclic  arrow  on  the  left  in  this  figure  indicates  the 

manner  of  this  twist.)  For  higher  genuses  beyond  the  torus  surely  many  more  such 

possibilities  arise. 

But  even  with  the  torus  further  possibilities  arise  in  connection  with  the  embedding 
of  S  into  <33.  Again  with  reference  to  the  middle-left  side  of  Figure  9  we  can  think 
of  the  open  cylinder  as  corresponding  to  the  sides  of  a  (somewhat  thick)  cord,  but 
before  forming  it  into  a  torus  the  cord  could  be  knotted.  As  a  consequence  knot 
theory  [33]  becomes  of  relevance.  But  in  the  present  context  this  issue  seems  sep¬ 
arable  [22]  from  much  of  the  rest  of  our  current  interest  and  will  not  be  considered 
further  here. 


Specialization 

The  theorematic  results  of  the  two  preceding  sections  may  be  (simply)  utilized 
to  establish  consequent  results.  For  trivalent  graphs  G  (to  be  embedded  in  an  S') 
the  number  of  incidences  between  vertices  and  edges  may  be  counted  in  terms 
either  of  the  number  v  of  vertices  or  of  the  number  e  of  edges  to  yield 


Figure  9.  Two  manners  of  forming  a  rectangle  into  two  different  tori,  which  with  a 
suitable  graph  embedded  on  the  sheet  in  a  fixed  manner  can  each  identify  to  different 

stereoisomers. 


ELEMENTAL  CARBON  ISOMERISM 


511 


3v  =  2e  .  (5.1) 

Further  if  the  number  of  « -sided  faces  is  denoted  by  /„,  then  the  number  of  incidences 
between  vertices  and  faces  can  similarly  be  counted  in  two  ways  to  yield 

3u  =  2  nfn  .  (5.2) 

n 

Thus  Euler’s  theorem  of  the  preceding  section  leads  to: 

Euleric  Specialization— For  trivalent  graphs  G  cellularly  embedded  in  a  closed 
surface  S, 

3/s  +  2/4  +/5  =  6  X(S)  +/7  +  2/8  +  3/9  +  •  •  •  . 

In  organic  chemistry  hexagonal  (benzenoid)  rings  play  a  special  role,  so  that 
some  sort  of  constraint  to  approach  this  circumstance  might  be  of  interest.  In  fact, 
rings  of  size  3  are  quite  strained,  while  4-membered  conjugated  rings  (and  to  a 
slightly  lesser  extent  8-membered  conjugated  rings)  are  unstable — this  being  the 
content  of  the  standard  4 n  +  2-Hiickel  rule.  Thence  it  seems  plausible  to  define  a 
cellular  embedding  of  a  trivalent  G  on  S  to  be  benzenoid  if  all  faces  are  hexagonal 
and  more  generally  to  be  qua-benzenoid  if  it  has  a  minimal  number  [consistent 
with  X(S’)]  of  nonhexagonal  rings  any  of  which  are  pentagonal  or  heptagonal.  The 
preceding  theorem  readily  leads  to  a  characterization  of  these  special  embeddings: 

Euleric  Corollary— For  trivalent  qua-benzenoid  graphs  cellularly  embedded  in 
a  closed  surface  S,  a\lfn  =  0forn¥z6  except  for 


/5  =  12  and  /7  =  0 

for 

X(S)  =  2 

fs  =  6  and^  =  0 

for 

x(S)  =  1 

/,  =  0  and  =  -2 X(S) 

for 

X(S)  <  0 

Notably  benzenoid  graphs  arise  if  and  only  if  X(*S)  =  0,  i.e.,  if  S  is  a  torus  or  a  so- 
called  “Klein  bottle”  (named  after  the  renowned  geometer,  Felix  Klein).  The  qua- 
benzenoids  on  spherically  homeomorphic  surfaces  are  usually  termed  fullerenes, 
for  which  a  further  refinement  applies  [  34  ] : 

Motzkin-Grunbaum  Theorem— For  every  even  vertex  count  v  >  24  there  exists 
at  least  one  fullerene,  and  the  sole  smaller  fullerene  is  the  dodecahedron  with 
v  =  20. 

In  fact,  the  proof  of  this  is  constructive,  providing  examples  of  such  structures. 
A  further  focus  notes  that  the  Hiickel  4n  +  2  rule  is  usually  extended  to  a  statement 
concerning  cycles  around  pairs  (or  even  triples)  of  fused  rings — and  in  particular, 
the  8-membered  cycle  around  any  two  fused  pentagons  should  contribute  toward 
instability.  Thence  fullerenes  with  no  abutting  pair  of  pentagons  are  defined  as 
preferable  or  isolated-pentagon-rule  fullerenes.  Parallel  to  the  preceding  theorem 
there  is  another  result  [  35  ] : 

Preferable-Fullerene  Theorem— For  every  even  vertex  count  v  >  70  there  exists 
at  least  one  preferable  fullerene,  and  the  sole  smaller  preferable  fullerene  is  the 
truncated  icosohedron  with  v  =  60. 

Again,  the  proof  is  constructive.  Further,  all  fullerenes  so-far  experimentally  ob¬ 
tained  in  macroscopic  quantities  fall  into  the  class  of  this  theorem,  with  the  two 
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smallest  such  preferable  fullerenes  (at  v  =  60  and  v  =  70)  corresponding  to  the 
two  species  most  readily  experimentally  obtained. 

There  are  further  results  which  concern  structural  stability  in  terms  of  a  classical 
view  of  the  distribution  of  double  bonds.  If  one  can  place  double  bonds  onto  the 
graph  embedding  with  exactly  one  double  bond  incident  at  each  site  (or  alternatively 
if  there  exists  a  spanning  subgraph  every  vertex  of  which  is  univalent),  then  such 
a  placement  (or  corresponding  univalent  subgraph)  is  termed  a  Kekule’  structure. 
Standard  classical  chemical  theory  views  such  structures  as  a  “necessity”  (or  nearly 
so)  for  a  stable  structure,  so  that  the  following  is  of  relevance: 

Kekule’-Structure  Theorem — Every  trivalent  graph  cellularly  embeddable  on  a 
sphere  admits  at  least,  three  Kekule’  structures. 

Amusingly,  the  proof  [  35  ]  utilizes  the  famous  “four-color  theorem.”  The  present 
result  thence  indicates  resonance  is  always  conceivable  for  such  carbon  cages.  Thus 
a  graph-theoretic  resonating  valence-bond  approach  seems  a  possibility  as  indeed 
we  have  pursued  [12,13,15,19]. 

Next  slightly  less  standard  but  still  classical  chemical-bonding  ideas  suggest  that 
[36]  conjugated  molecular  structures  are  more  chemically  stable  (and  more  “ar¬ 
omatic”  )  if  Kekule’  structures  are  admitted  with  a  larger  number  of  hexagonal  rings 
containing  three  double  bonds,  in  which  case  such  rings  are  said  to  be  conjugated 
(at  least  in  the  associated  Kekule’  structures).  Thus,  it  is  meaningful  [37,38]  to 
define  a  cellular  graph  embedding  to  be  a  Clar-sextet  structure  if  there  exists  a 
Kekule’  structure  wherein  every  double  bond  is  a  member  of  two  (i.e.,  the  maximum 
number)  conjugated  hexagonal  faces.  On  the  other  hand,  Fowler  et  al.  [14,39]  have 
noted  that  from  Hiickel  calculations  there  seems  to  be  a  stable  class  of  (preferable) 
fullerenes:  the  leap-frog  cages,  each  such  G  constructable  from  another  fullerene 
G\  (of  one  third  as  many  vertices)  via  a  two-fold  process: 

(a)  cap  every  face  of  the  embedding  Gy  on  a  sphere  S  to  obtain  G2\  then 

(b)  take  the  dual  of  G2  to  obtain  G. 


Figure  10.  The  capping-dualization  scheme  to  generate  leap-frog  cages. 
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The  process  is  indicated  in  Figure  10,  but  also  it  can  be  obtained  via  an  alternative 
two-step  process: 

(a)  take  the  dual  of  Gi  to  obtain  G\ ;  then 

(b)  truncate  G\  to  obtain  G. 

This  process  is  illustrated  in  Figure  11.  The  relevant  theorem  here  is  [38]: 

Leap-Frog-Clar-Sextet  Theorem— Clar-sextet  fullerenes  are  exactly  the  leap-frog 
fullerenes. 

Thence  this  theorem  provides  evidence  in  addition  to  other  evidence  for  stable 
^-electronic  structures  for  leap-frog  cages.  Questions  concerning  a-strain  of  the 
network  however  require  other  considerations  (as  indicated  in  the  up-coming 
“Cluster  Strain”  section  here).  Notably  [39]  (as  is  seen  from  Fowler’s  leap-frog 
transformation,  which  multiplies  the  number  of  vertices  in  Gj  by  3  in  forming  G) 
the  truncated  icosahedron  is  the  smallest  Clar-sextet  cage  (and  the  next  largest  has 
v  =  12  vertices  as  may  be  seen  from  the  Motzkin-Grunbaum  theorem) . 


Generation  and  Enumeration 

Beyond  the  theorems  of  the  preceding  sections  the  generation  of  explicit  cage 
graphs  is  desired — followed  by  a  characterization  of  these  cages.  For  the  toroidal 
case  an  attempt  [22]  has  been  made  to  identify  all  the  benzenoid  graphs  (such  as 
are  possible  via  the  “Euleric  corollary”  of  the  preceding  section)— and  even  all  the 
possible  topocombinatorial  equivalence  classes  of  embeddings  may  have  been  found. 
The  somewhat  related  problem  of  identifying  all  possible  benzenoid  graphs  for 
infinite  length  “buckytubes”  (a  number  of  which  have  been  recently  experimentally 
realized  [  40  ] )  has  been  simply  solved  [  4 1  ] .  The  case  of  fullerene  graphs  embeddable 
on  a  spheroidal  graph  has  a  simple  (one-to-one)  correspondence  to  combinatorial 
equivalence  classes  of  embeddings,  but  the  generation  of  all  these  graphs  seems  to 
offer  a  significant  challenge,  though  the  fullerene  cages  of  icosohedral  symmetry 


Figure  1 1.  The  alternative  dualizing-truncation  scheme  to  generate  leap-frog  cages. 
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is  complete  [13,14,42],  as  well  as  some  aspects  [15,16]  for  other  higher  sym¬ 
metry  cases. 

There  is  one  comprehensive  brute-force  approach  for  generation  of  graph  cages 
embeddable  on  a  spheroidal  surfaces,  though  the  computational  expense  increases 
rapidly  (perhaps  exponentially)  with  vertex  number  v,  though  a  previous  presen¬ 
tation  [18]  warrants  improvement.  In  this  approach  Schlegel  diagrams  are  grown, 
starting  from  an  initial  seed  (say  a  pentagonal  ring)  and  adding  rings  at  different 
locations  around  the  boundary  of  the  current  stage  of  the  growing  seed. 

At  any  (nonfinal)  stage  of  an  acceptable  growing  seed  T,  there  are  a  number 
(>  1 )  of  pairs  of  currently  divalent  sites  on  the  boundary  such  that  no  other  divalent 
sites  lie  between  the  two  on  “the”  boundary  between  these  two  sites,  as  in  the  left 
part  of  Figure  12.  Between  every  such  pair  a  new  ring  is  grown  (if  possible  with  the 
given  number  of  intervening  already  tri  valent  sites  on  the  boundary)  in  two  possible 
ways,  as  a  pentagon  or  as  a  hexagon,  each  result  as  indicated  in  the  right  part  of 
Figure  12.  To  minimize  the  length  of  the  rapidly  accumulating  list  L  of  growing 
seeds  T,  the  seeds  are  monitored  and  deleted  if  they  can  not  yield  a  desired  candidate 
cage.  This  is  done  keeping  track  of  the  numbers  of  different  types  of  faces,  edges, 
and  vertices  generated — where  these  types  are,  respectively,  defined  to  be  charac¬ 
terized  by:  the  size  of  a  face;  the  sizes  of  the  two  faces  on  either  side  of  an  edge; 
and  the  sizes  of  the  three  faces  incident  at  a  vertex.  Hence  the  admissible  (final) 
types  are:  5  or  6  for  faces;  (5,5),  (5,6)  or  (6,6)  for  edges;  and  (5,5,5),  (5,5,6), 
(5,6,6)  or  (6,6,6)  for  vertices.  We  denote  the  (final)  numbers  of  these  various  types 
of  graph  embedding  pieces  by  a  type  label  appended  to  every  subscript  to  /,  e  or  v. 
Then  defining  p  and  q  as  e(5>5)  and  u(5>5j5)  it  may  be  shown  that  [18]: 

Fullerene  Cage-Typing  Theorem — For  a  fullerene  with  p  =  e(5)5),  q  =  v(5^5), 
and  v  vertices  the  numbers  of  faces,  edges,  and  vertices  of  the  various  types  are 

/( 5)  =  12  £(5,5)  =  P  V {5,5,5)  ~  <1 

/(6)  =  (v/2)  -  10  £(5)6)  =  60-2 p  u(5s5)6)  =  2p-3q 

^(6,6)  =  v  —  60  +  p  ^(5,6,6)  =  60  —  4p  +  3q 

u(6,6,6)  =  v  —  60  +  2p  —  q  . 

Thence  in  growing  a  seed  the  current  counts  of  these  types  of  substructures  are 
monitored,  discarding  a  potential  growing  seed  if  the  number  of  any  one  type 
exceeds  the  final  values  as  given  above  for  target  graphs  with  chosen  fixed  values 
for  v,  p,  q.  Thus  a  list  C  of  candidate  cages  results. 


Figure  12.  Depiction  of  the  growth  step  to  be  independently  made  at  each  “subsequent5 
pair  of  currently  divalent  sites  such  as  here  indicated  by  asterisks. 
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A  final  matter  of  concern  is  the  elimination  of  redundant  graphs  from  the  cage 
list  C.  This  is  done  by  our  own  effective  graph  isomorphism  testing  algorithm  [43  ] , 
though  there  apparently  were  algorithms  already  developed  elsewhere  [44].  Our 
scheme  scales  with  time  much  the  same  (i.e.,  ~u3)  as  involved  in  diagonalizing  a 
graph’s  adjacency  (or  Hiickel)  matrix.  The  algorithm  has  the  broad  structure  in¬ 
dicated  in  Figure  13,  so  that  given  two  graphs  Gx  and  G2  the  algorithm  returns  one 
of  three  responses: 

( 1 )  Gx  and  G2  not  isomorphic. 

(2)  Gj  and  G2  isomorphic. 

(3)  Gx  and  G2  isomorphism  undetermined. 

But  notably  for  all  (of  many  thousands)  of  the  cages  we  have  tested  only  the  (fa¬ 
vorable)  first  or  second  answers  have  resulted.  In  fact,  such  a  favorable  response 
has  been  obtained  for  all  “molecular”  graphs  we  have  tested,  including  those  selected 
because  of  their  presumed  difficulty. 

For  example,  we  obtain  the  resultant  (combinatorial  equivalence  classes  of)  60- 
site  fullerenes  given  in  Table  I.  Notably  this  result  (particularly  the  number  in  the 


1st  stage  < 


V 


2nd  stage  < 


V 


C  start  ) 

i 

L± 

i 


compute  tabular  invariants 


l 


1* 


f  "failure*  ) 


Figure  13.  Gross  flowchart  for  graph  isomorphism  testing.  Here  T(GX)  and  T(G2)  are 
“tabular  invariants”  for  graphs  Gx  and  G2  while  A(Gx)  and  A(G2)  are  their  adjacency 

matrices. 
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Table  I.  Fullerene  isomers  for  v  =  60  sites. 


p\q 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

q — sum 

0 

1 

1 

1 

0 

0 

2 

1 

0 

1 

3 

3 

0 

0 

3 

4 

17 

0 

0 

17 

5 

81 

5 

0 

0 

86 

6 

215 

39 

0 

0 

0 

254 

7 

210 

147 

6 

0 

0 

363 

8 

145 

214 

54 

0 

0 

0 

413 

9 

23 

132 

131 

11 

0 

0 

0 

297 

10 

7 

28 

116 

42 

4 

0 

0 

197 

11 

0 

1 

31 

54 

10 

0 

0 

0 

96 

12 

I 

0 

6 

16 

25 

2 

0 

0 

0 

50 

13 

0 

0 

0 

2 

10 

7 

0 

0 

0 

19 

14 

0 

0 

0 

0 

3 

5 

2 

0 

0 

0 

10 

15 

0 

0 

0 

0 

0 

0 

2 

0 

0 

0 

0 

2 

16 

0 

0 

0 

0 

1 

0 

0 

0 

0 

1 

17 

0 

0 

0 

0 

0 

0 

0 

0 

0 

18 

0 

0 

0 

0 

1 

0 

0 

1 

19 

0 

0 

0 

0 

0 

0 

0 

20 

0 

0 

0 

0 

1 

1 

q  —  3  column)  corrects  an  earlier  table  [45]  where  rather  larger  but  incomplete 
initiating  seeds  were  used  in  an  attempt  to  limit  the  number  of  members  of  the  list 
G  (before  elimination  of  duplicates).  The  currently  reported  generation  assumes 
certainly  correct  initial  seeds  as  in  Figure  14,  so  that  all  cages  are  generated. 

Another  scheme  generates  graphs  making  a  “spirality”  assumption  and  the  as¬ 
sumption  of  absence  of  isospectrality — both  assumptions  seemingly  fairly  reliable 
though  there  seem  [46]  to  be  counter-examples  to  the  first.  For  the  second  as¬ 
sumption  the  results  from  the  application  of  our  algorithm  yield: 

Empirical  Spectrality  Theorem — Among  the  fullerene  polyhedra  there  are  no 
isospectral  pairs  of  fewer  than  72  vertices,  and  among  the  fullerenes  with  isolated 
pentagons  there  are  no  isospectral  pairs  of  fewer  than  98  vertices. 


Figure  14.  Respective  seeds  for  the  three  cases:  first  p  =  0;  second  p  >  1  and  q  =  0;  and 

third  q  >  1 . 
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There  is  yet  another  generation  scheme  [  47  ]  replacing  the  “spirality  assumption 
by  a  weaker  one — the  approach  is  much  like  ours  of  Figure  12  except  that  to  a 
growing  seed  pentagonal  and  hexagonal  faces  are  added  between  just  one  pair  of 
currently  divalent  vertices  selected  (in  a  “desirable”  way).  This,  of  course,  avoids 
many  cage  duplicates,  but  invokes  an  assumption  (perhaps  justified)  about  the 
form  of  the  consequent  Schlegel  diagram. 


Curvature  for  Clusters 

Crucial  to  cluster  geometry  are  the  curvature  ideas  of  the  “Closed  Surfaces” 
section.  But  in  fact  for  chemical  purposes  some  extensions  of  these  ideas  are  relevant. 
Thus  we  introduce  an  absolute  isotropic  curvature 

M  -  \kxk2\  (7T) 

and  an  absolute  anisotropic  curvature 

X  =  (|/ci|  -  \k2\)2.  (7.2) 

Of  course  just  as  for  the  Gaussian  curvature  the  corresponding  integrated  quantities 
are  of  interest,  and  should  satisfy  inequalities,  one  of  which  is: 

Absolute-Isotropic-Curvature  Theorem — For  an  orientable  closed  smooth  sur¬ 
face  S, 

J  \k\  ds  >  47r[l  +  y(S)]  . 

This  in  fact  “readily”  follows  from  the  Gauss-Bonnet  theorem,  as  we  now  indicate: 
Associated  to  S  there  is  a  minimum-sized  convex  surface  Sc  viewable  as  being 
obtained  from  S  by  tightly  wrapping  around  it  with  something  like  “cellophane,” 
so  that  all  portions  of  Sc  not  coincident  with  S'  are  flat  in  at  least  one  direction, 
whence  k  =  0.  The  portion  of  S  not  coincident  with  Sc  is  denoted  Sl,  and  one  has 
(from  the  Gauss-Bonnet  theorem) 

47t[1  —  y(S)]  =  f  k  ds  +  I  k  ds  .  (7.3) 

Jsc  Js‘ 


Then  noting  that  Sc  being  convex  has  y(Sc)  =  0,  we  have 

f  *  ds  =  4tt[1  -  y(S)]  -  4x[l  -  y(Sc)]  =  -4 iry(S)  .  (7.4) 

Js' 


Thus 


J  |  k  |  ds  =  J  |  k  |  ds  +  J\  |  k  |  ds  =  47t  +  J\  |  k  |  ds  >  4ir  +  k 


ds1 


(7.5) 


and  the  theorem  is  obtained. 

General  results  for  the  integrated  absolute  anisotropic  curvature  seem  to  be  more 
difficult  to  obtain.  But  there  is: 
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Extended  Willmore  Conjecture — For  a  closed  smooth  orientable  surface  S, 


-kmin\2ds>  8(v  -  2)wy(S) . 


In  fact  the  usual  Willmore  conjecture  [48,49]  is  for  y(S)  =  1  and  strictly  concerns 
the  integral  over  k\  +  but  the  present  conjecture  implies  the  usual  one. 

In  addition  to  these  results  belonging  to  classical  differential  geometry,  there  are 
some  further  analogous  combinatorial  results.  We  define  the  combinatorial  curvature 
at  face  a  of  a  cellular  embedding  of  G  on  S  to  be 

a  j  2^  \ 

A„  =  27t—  2  (’r- IT]-)  (7.6) 


where  the  sum  is  over  vertices  around  the  face  and  |  / 1  is  the  degree  of  vertex  i. 
This  definition  parallels  that  of  ( 3.3 )  where  ( 7r  -  2x/ 1  / 1 )  would  be  the  angle  turned 
through  at  vertex  i  if  the  embedded  edges  incident  at  i  radiated  straightly  outward 
at  uniformly  spaced  angles  around  i.  Also  in  parallel  with  this  and  (3.2),  one  may 
define  a  combinatorial  curvature  at  each  vertex  i, 


(7.7) 


where  the  sum  is  over  faces  incident  at  /  and  |  a  |  is  the  number  of  sides  to  the  face 
a.  Then  Euler’s  theorem  comes  into  a  close  parallel  with  the  differential-geometric 
curvature  theorems  of  the  third  section: 

Euleric  Combinatorial  Curvature  Theorem — If  G  is  cellularly  embedded  on  a 
closed  orientable  surface,  then 

2  A0  =  2*X{S)  =  2  A,  . 

a  i 

In  fact  this  theorem  is  just  a  restatement  of  the  earlier  Euler’s  theorem  as  may  be 
seen  after  substitution  of  the  definitions  for  and  A/ .  But  in  the  present  version 
of  the  theorem  the  analogy  with  Descartes’  theorem  or  the  Gauss-Bonnet  theorem 
is  more  closely  drawn. 


Cluster  Strain 

These  various  curvature  ideas  and  results  are  relevant  to  the  strain  arising  in 
realizable  cluster  geometries.  That  there  are  two  different  curvature  measures  (geo¬ 
metric  and  combinatorial )  is  important.  The  content  of  the  Gauss-Bonnet  theorem 
and  the  theorem  just  finished  is  that  globally  these  two  types  of  curvatures  must 
match  (i.e.,  be  equal).  But  even  local  matching  is  needed  if  strain  is  to  be  avoided. 
That  is  if  they  do  not  match  in  some  region  R  containing  the  ring  a,  then  there 
arises  a  consequent  strain  at  ring  a  involving  bond  dilatation  in  order  to  accom¬ 
modate  the  graph-theoretic  combinatorial  curvature  to  the  geometrically  realized 
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curvature.  A  strain  at  a  due  to  the  region  of  rings  within  a  graphical  distance  n  of 
a  then  seems  to  be  (proportional  to) 


en(a)  = 


IMH 


and  an  overall  curvature  mismatch  for  face  a 

e(«)  -  2  «n(«)  • 


(8.1) 


(8.2) 


In  addition  there  are  local  isotropic  and  anisotropic  curvature  contributions  to  the 
strain  involving  bond-angle  bending,  as  have  already  been  discussed  elsewhere  [15]. 
Thus,  the  total  strain  energy  associated  to  a  molecular  graph  G  cellularly  embedded 
in  a  surface  S  which  is  in  turn  embedded  in  <£3  is 

E(G,S)  =  a  f  \k\  ds  +  b  f  X  ds  +  c  2  «(«)  (8-3) 

Js  Js 


with  a,  b,  and  c  parameters  (actually  with  a  «  2b).  The  first  absolute-isotropic- 
curvature  term  here  is  bounded  by  the  first  theorem  of  the  preceding  section— and 
in  fact  this  bound  evidently  is  an  equality  if  the  surface  S  (as  embedded  in  &3)  is 
“unlumpy.”  The  second  (absolute-anisotropic-curvature)  term  and  the  third  (cur¬ 
vature-mismatch)  term  are  more  problematic  and  indeed  often  have  opposing  ten¬ 
dencies.  For  example,  for  a  large  fullerene  graph  G  say  as  in  Figure  15  it  is  seen 
that  G  is  embeddable  on  the  surface  of  a  sphere,  whence  there  is  no  anisotropic 
curvature  strain,  but  as  a  consequence  there  is  curvature  mismatch  in  the  regions 
of  hexagonal  faces  (these  having  0  combinatorial  curvature),  and  also  there  is 
mismatch  at  the  pentagonal  faces,  whose  combinatorial  curvature  exceeds  the  local 


Figure  15.  Conceivable  spherical  structure  for  C54 o- 


520 


KLEIN  AND  LIU 


geometric  curvature.  Alternatively,  S  may  be  deformed  to  the  shape  of  an  icoso- 
hedron  as  in  Figure  16,  whence  there  is  no  curvature  mismatch,  but  there  is  notable 
anisotropic  curvature  strain,  along  the  edges  of  the  icosohedron.  Evidently  a  shape 
between  that  of  an  icosohedron  and  a  sphere  is  preferable,  and  indeed  is  just  what 
can  be  recognized  in  quantum-chemical  computations  [50]  on  such  large  fullerenes. 

As  another  point  of  interest  one  can  inquire  what  might  be  the  conditions  that 
the  absolute-anisotropic  curvature  and  curvature  mismatch  terms  both  become 
their  minimum  value  0.  Evidently  the  extended  Willmore  conjecture  indicates  that 
in  order  to  achieve  this  we  must  have  7(5)  =  0,  and  we  might  further  constrain 
attention  to  the  fullerenes  (i.e.,  tri valent-graph  embeddings  in  a  sphere  such  that 
all  faces  are  pentagonal  or  hexagonal).  Then  we  have: 

Fullerenic  Conjecture — The  only  fullerenes  such  that  the  curvature  mismatch 
and  the  absolute-anisotropic  curvature  both  vanish  simultaneously  are  the  dode¬ 
cahedron  and  the  truncated  icosahedron. 

Of  course  the  dodecahedral  structure  is  unfavorable  for  other  reasons:  because 
it  is  open  shell  (at  icosahedral  symmetry)  and  because  the  absolute-isotropic  cur¬ 
vature  per  site  is  rather  high  (i.e.,  60/20  =  3  times  as  high).  Thence  the  truncated 
icosahedron  structure  is  singled  out  as  exceptional  in  several  ways:  first,  by  the 
preferable-fullerene  theorem  of  the  “Specialization”  section;  second,  by  the  Clar- 
sextet  leap-frog  theorem  along  with  the  comment  shortly  following  the  theorem) 
in  the  same  section;  and  third,  by  the  present  fullerenic  conjecture.  Thus  there  is 
a  3-fold  justification  of  Kroto  et  al.’s  original  suggestion  [6]  of  the  “uniquely  elegant” 
truncated-icosahedron  structure  for  Qo- 

Further  qualitative  insight  and  even  quantitative  results  should  we  believe  be 
possible  utilizing  our  general  strain  formulation  of  Eq.  (8.3).  This  we  plan  to  report 
on  further  elsewhere. 


Prospects 

Here  an  outline  towards  a  broad  extension  of  current  organic  stereochemical 
theory  has  been  attempted.  Beyond  the  matter  of  structural  possibilities,  many 


Figure  16.  Conceivable  more  icosahedrally  shaped  structure  for  C540. 
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questions  remain  unanswered,  most  prominently:  first,  as  to  what  sorts  of  novel 
“emergent”  properties  might  arise;  and  second  as  to  what  means  there  might  be  to 
synthesize  the  various  conceived  possible  structures.  Of  course,  current  quantum- 
chemical  methods  especially  of  a  semiempirical  nature  may  help  (and  indeed  to 
some  extent  already  have  helped)  address  this  first  question.  But  at  least  to  the 
present  authors  the  second  question  concerning  rational  synthesis  seems  quite 
opaque.  Still  the  range  of  possibilities  seems  intriguing,  and  might  encourage  others 
to  address  some  of  the  indicated  questions. 

Here  we  have  looked  at  the  structural  possibilities  going  beyond  the  simple  graph- 
theoretic  cluster  structure  to  embeddings  of  the  graph  in  appropriate  (locally  Eu¬ 
clidean)  closed  surfaces.  The  consequent  features  not  only  of  the  molecular  graph 
G  but  also  of  the  surface  S  and  of  the  embedding  of  G  in  S  seem  to  lead  far  toward 
a  characterization  of  the  conceivable  stereoisomers.  Thence  the  presentation  here 
may  be  viewed  as  a  development  of  classical  stereochemical  theory,  to  be  utilized 
both  by  experimentalists  and  quantum  chemists.  Moreover  the  curvature  ideas 
marshalled  in  the  section  “Curvature  for  Clusters”  are  natural  in  our  current  ste¬ 
reochemical  development  and  seem  in  the  “Cluster  Strain”  section  to  lead  to  insight 
concerning  strain  and  its  effects.  In  particular  the  global  shape  of  large  (hypothetic) 
fullerene  shapes  is  better  understood,  and  the  truncated-icosahedral  structure  of 
C60  is  verified  to  be  “uniquely  elegant.” 

Aside  from  the  relevant  topological  and  differential-geometric  aspects  emphasized 
here  there  are  a  number  of  more  conventional  chemical  graph-theoretic  questions. 
Here  the  sections  on  “Specialization”  and  “Generation  and  Enumeration”  bear 
more  so  on  these  graphical  aspects,  as  also  do  most  of  the  theoretical  articles  we 
have  referenced.  The  fullerene  cage-typing  theorem  of  the  “Generation  and  Enu¬ 
meration”  section  distinguishes  cages  rather  poorly  (as  seen  by  Table  I),  and  more 
faithful  graphical  characterizations  would  be  useful,  as  is  the  subject  of  other  work 
by  us  [51].  One  particular  related  point  concerns  iupac  nomenclature,  which  for 
the  fullerenes  has  been  recently  (correctly)  described  [52]. 

Still  further  developments  and  work  seem  likely— and  hopefully  useful  (even  at 
a  stage  preceding  full-blown  quantum-chemical  computations). 

Finally,  acknowledgment  is  made  to  the  Welch  Foundation  of  Houston,  Texas 
and  to  the  donors  of  the  Petroleum  Research  Fund  administered  by  the  American 
Chemical  Society. 
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Abstract 

A  “pedestrian”  approach  is  used  to  enumerate  the  spanning  trees  in  buckminsterfullerene.  The  approach 
is  called  “pedestrian”  because  calculations  were  carried  out  by  hand  and  pocket  calculator.  The  number 
of  spanning  trees  obtained  for  buckminsterfullerene  is  375,29 1,866,372,898,8 16,000,  a  number  identical 
to  the  value  reported  by  Brown  et  al.  [J.  Comput.  Chem.  12, 1118(1991)3  who  used  a  computationally 
more  involved  approach.  ©  1994  John  Wiley  &  Sons,  Inc. 


In  this  note  we  present  a  “pedestrian”  approach  to  the  enumeration  of  spanning 
trees  of  buckminsterfullerene  (see  the  soccerball-like  model  of  this  molecule  in  Fig. 
1 ).  A  spanning  tree  of  a  graph  G  is  a  spanning  acyclic  subgraph  of  G  [1] . 

The  number  of  spanning  trees  of  buckminsterfullerene  is  already  available  in  the 
literature.  Brown  et  al.  [2]  have  used  the  Gutman-Mallion-Essam  theorem  [3] 
(derived  from  the  Matrix-Tree  theorem  [4])  for  enumerating  the  number  of  span¬ 
ning  trees  in  a  graph  corresponding  to  buckminsterfullerene.  The  concept  of  a 
spanning  tree  underlies  the  method  of  McWeeny  [  5  ]  for  computing  x-electron 
“ring-current”  magnetic  properties  of  conjugated  molecules  [6].  It  is  also  used  in 
attempts  to  quantify  the  complexity  of  a  structure  [e.g.,  7], 

Our  procedure  is  based  on  the  spectrum  of  the  Laplacian  matrix  [8]  L  of  buck¬ 
minsterfullerene  and  consists  of  the  following  steps: 

(1)  Setting  up  the  Laplacian  matrix  L  of  buckminsterfullerene:  The  Laplacian 
matrix  is  defined  as  [7]: 

L  =  V  -  A  (1) 

where  A  is  the  adjacency  matrix  [9]  of  a  molecule  and  V  is  the  degree  (valency) 
matrix  which  is  a  diagonal  matrix  whose  entries  are  graph-theoretical  valencies. 
The  easiest  way  of  setting  up  the  Laplacian  matrix  of  buckminsterfullerene  is  by 
using  the  corresponding  Schlegel  diagram  [10].  The  Schlegel  diagram  is  the  rep¬ 
resentation  of  buckminsterfullerene  in  a  plane.  It  is  shown  in  Figure  2.  Note  that 
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Figure  1 .  A  soccerball-like  surface  structure  of  buckminsterfullerene. 


the  Schlegel  diagram  of  buckminsterfullerene  is  a  regular  graph  of  degree  3,  so  that 
all  diagonal  elements  of  matrix  V  are  equal  to  3. 

(2)  The  diagonalization  of  the  Laplacian  matrix:  In  Table  I  we  give  the  spectrum 
of  the  Laplacian  matrix  of  buckminsterfullerene.  It  has  been  obtained  by  hand 
computation  [11]  utilizing  the  irreducible  representations  of  the  Ih  symmetry  group 
to  which  buckminsterfullerene  belongs  [e.g.,  12,13].  Thus,  the  60  by  60  matrix 
was  broken  into  three  1  -by- 1  matrices  (two  of  which  occur  3  times),  five  2-by-2 
matrices  (two  of  which  occur  3  times,  two  more  of  which  occur  4  times,  and  the 
last  of  which  occurs  5  times)  and  one  3-by-3  matrix  (which  occurs  5  times). 


Figure  2.  The  Schlegel  diagram  of  buckminsterfullerene. 
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Table  I.  The  spectrum  of  the  Laplacian  matrix  of  buckmin- 
sterfullerene. 


*1  =  0 

*2  =  *3  =  =  [9V2  -  V10  -  2V(19  -  V5)]/4V2 

*5  =  x6  =  *7  =  *g  =  *9  =  (5  -  yi3)/2 
*io  =  x„  =  X\2  =  [9V2  +  yiO  -  2V(19  +  V5)]/4V2 
*13  =  *14  =  *15  =  *16  =  (7  -  yi7)/2 

*17  =  *18  =  *19  =  *20  =  *21  =  *22  =  *23  =  *24  “  *25  “  2 

*26  =  *27  =  *28  =  *29  =  *30  =  (7  ~~  l/5)/2 

*3.  =  *32  =  *33  =  [9V2  -  yio  +  2V(19  -  V5)]/4V2 
*34  =  *35  =  *36  =  (9  -  y5)/2 

*37  =  *38  =  *39  =  *40  =  *41  =  (5  +  yi3)/2 

*42  =  *43  -  *44  =  [9V2  +  yio  +  2V(I9  +  V5)]/4V2 

*45  =  *46  =  *47  =  *48  =  *49  =  (7  +  P)/2 
*50  =  *51  =  *52  =  *53  =  5 

*54  =  *55  =  *56  =  *57  =  (7  +  V17)/2 
*58  =  *59  =  *60  =  (9  +  V5)/2 


Note  that  the  first  Laplacian  eigenvalue  is  always  zero.  The  zero  value  for  the 
first  Laplacian  eigenvalue  is  the  result  of  the  special  structure  of  the  Laplacian 
matrix  [7,14]. 

(3)  The  computation  of  the  number  of  spanning  trees:  The  number  t(G)  of 
spanning  trees  of  a  graph  G  is  given  by  [  7, 1 5  ] : 

/(G)  =  (1/A0  I1*«(L)  (2) 

i= 2 

where  Nis  the  number  of  vertices  in  G  and  the  X/(L)  denote  the  Laplacian  spectrum. 
The  multiplication  starts  with  the  second  Laplacian  eigenvalue  since  the  first  ei¬ 
genvalue  is  always  zero  and  does  not,  of  course,  play  a  part  in  the  formula.  The 
above  formula  is  generally  valid  for  computing  the  number  of  spanning  trees  for 
any  graph  [16]. 

By  substituting  the  values  reported  in  Table  I  into  formula  (2)  we  obtain  the 
number  of  spanning  trees  in  buckminsterfullerene: 

t(G)  =  l21  X  35  X  54  X  ll5  X  19 3/2 2  X  3  X  5 
=  2 25  X  34  X  53  X  ll5  X  193 
=  375,291,866,372,898,816,000 


(3) 
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which  is  identical  to  the  value  reported  by  Brown  et  al.  [2] . 
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Abstract 

The  photodissociation  process  of  argon  cluster  ion  is  studied  by  using  the  molecular  dynamics  method 
with  non-adiabatic  transitions.  The  potential  energy  surfaces  and  the  electronic  states  are  calculated  with 
the  diatomics-in-molecules  (dim)  hamiltonian.  The  initial  configurations  are  sampled  from  the  classical 
trajectory  paths  on  the  potential  energy  surface  of  the  electronic  ground  state.  The  non-adiabatic  process 
is  treated  with  the  scheme  proposed  by  J.  C.  Tully  [J.  Chem.  Phys.  93,  1061  (1990)]  with  a  slight 
modification.  The  method  is  applied  to  the  photodissociation  process  of  Arf.  The  calculated  kinetic 
energy  distribution  and  angular  distribution  of  the  photofragment  are  in  good  agreement  with  the  ex¬ 
perimental  results,  but  the  branching  ratio  between  Ar+  and  ArJ  fragment  ions  is  not.  ©  1994  John  Wiley 
&  Sons,  Inc. 


Introduction 

Several  experimental  and  theoretical  studies  revealed  that  the  argon  cluster  ion, 
ArJ,  consists  of  the  trimer  ion  core,  Ar3,  and  the  surrounding  solvent  atoms  [1- 
10].  The  trimer  ion  has  a  strong  photoabsorption  band  in  the  visible  region,  and  it 
photodissociates  into  Ar+  ion  and  two  Ar  atoms  on  the  photoabsorption  [11-13]. 
So  do  the  larger  clusters,  but  the  fragment  ion  is  not  always  Ar+,  and  several  small 
cluster  ions  are  produced  as  a  result  of  the  photodissociation  [5,9,10].  All  the  excited 
states  of  these  small  cluster  ions  are  unstable,  so  that  the  fragment  ions  must  be  in 
the  electronic  ground  state.  Therefore,  the  non-adiabatic  transitions  from  the  photo- 
excited  state  to  the  ground  state  are  essential  in  the  photodissociation  process.  In 
these  experimental  conditions,  no  collisions  take  place  during  the  dissociation  pro¬ 
cesses.  Therefore,  the  transition  process  to  the  ground  state  is  a  typical  example  of 
the  intramolecular  energy  relaxation  process. 

In  this  study,  the  method  proposed  by  J.  C.  Tully  [14],  which  is  referred  as  CSH 
(coupled  surface  hopping  trajectory)  hereafter,  is  employed  to  treat  the  non-adiabatic 
transitions.  In  the  CSH  method,  the  nuclear  motion  is  treated  classically,  and  the 
electronic  state  that  governs  the  nuclear  motion  is  selected  statistically  among  the 
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adiabatic  electronic  states.  The  method  is  applied  to  the  photodissociation  process 
of  Atj  .  The  kinetic  energy  distribution,  the  angular  distribution,  and  the  size  dis¬ 
tribution  of  produced  photofragments  are  calculated  and  compared  with  the 
experimental  results. 


Hamiltonian 


Calculation 


The  adiabatic  potential  energies  and  the  electronic  states  are  calculated  from  the 
dim  hamiltonian,  #[1,4,1 5].  The  3/?-hole  valence  extreme  structures,  in  which  the 
positive  hole  in  the  cluster  is  localized  on  3j pw  (w  =  x,  y,  or  z)  orbital  of  one  of  the 
constituent  atoms,  are  taken  as  the  basis  set.  The  dimension  of  the  basis  set  is  3n. 
The  derivatives  of  the  potential  energies  with  respect  to  the  nuclear  coordinates 
can  be  calculated  from  the  derivatives  of  each  element  of  H  [16]: 


V<^|#|^)  =  <^-|V#|^>,  (1) 

where  'k,  is  the  zth  eigen  function  of  H  and  is  normalized.  The  dim  hamiltonian 
matrix  is  explicitly  represented  as  a  sum  of  the  dimeric  hamiltonian  matrix  HAB , 
where  A  and  B  are  constituent  atoms.  Because  HAB  is  a  function  of  the  internuclear 
distance  RAB ,  the  derivatives  of  the  matrix  elements  of  HAB  are  readily  obtained 
analytically,  as  are  the  derivatives  of  H.  The  non-adiabatic  coupling  vectors  are 
also  calculated  analytically  from  the  derivatives  of  H  [16]: 


where  Et  is  the  zth  eigenvalue  of  H. 


Ei  -  Ej 


(2) 


Initial  Configuration 

The  initial  configurations  and  momenta  of  the  nuclei  are  sampled  from  the  normal 
molecular  dynamics  (md)  calculation  of  the  cluster  ion  in  the  electronic  ground 
state.  The  initial  internal  energies  of  the  trajectories  are  taken  moderately  high 
enough  that  a  few  trajectories  dissociate  through  the  thermal  evaporation  of  the  Ar 
atoms;  the  typical  microcanonical  temperature  is  190-200  K.  After  about  6  ps  at 
high  temperature,  the  cooling  down  process  is  started.  The  clusters  are  cooled  down 
to  40  K,  by  removing  the  kinetic  energy  gradually.  The  annealing  process  takes 
about  15  ps,  followed  by  the  normal  trajectory  calculation  for  1.2  ps.  After  that, 
50  structures  are  sampled  with  the  time  interval  of  0. 1  ps.  The  total  number  of 
2500  structures  are  obtained  from  50  trajectories  generated  independently.  The 
photoexcited  state  from  which  the  CSH  trajectory  calculation  starts  is  selected  among 
the  adiabatic  states.  The  probability  to  be  selected  is  proportional  to  the  oscillator 
strength  of  the  transition  from  the  ground  state. 


Dynamics 

The  photodissociation  trajectory  is  calculated  by  using  the  CSH  method,  followed 
by  the  normal  md  calculation.  At  first,  each  trajectory  is  evolved  with  the  CSH 
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method  for  ca.  0.5  ps.  Then,  the  trajectory  is  examined  to  determine  whether  the 
electronic  state  drops  down  to  the  ground  state  and  whether  the  ground  state  is 
well  isolated  from  all  other  excited  states  in  energy.  If  both  of  the  conditions  are 
satisfied,  the  rest  of  the  trajectory  is  calculated  by  using  the  normal  MD  calculation, 
which  is  much  faster  than  the  csh  calculation. 

The  details  of  the  CSH  calculation  are  described  elsewhere  [13],  and  only  a  brief 
outline  is  shown  here.  At  the  starting  point  of  the  CSH  calculation,  the  electronic 
wave  function,  $,  is  taken  as  one  of  the  adiabatic  electronic  states,  The  electronic 
state  is  evolved  in  time  according  to  the  Schrodinger  equation, 

i  —  =  H{R(m.  (3) 

at 

Note  that  the  electronic  state  $  is  no  longer  the  eigen  function  of  H  after  the  time 
evolution,  because  H  changes  in  the  time  through  the  changes  of  the  nuclear  co¬ 
ordinates  R.  The  nuclear  motion  itself  is  governed  by  the  adiabatic  potential  energy 
surface,  E£R),  and  it  is  calculated  by  using  the  Newton  equation  of  motion.  The 
population  of  ^  in  4>, 

0«-  l<*tl*>l2,  <4) 

is  monitored,  and  the  non-adiabatic  transition  is  invoked  when  a%  decreases.  The 
timing  of  the  transition  is  determined  according  to  the  fewest  switching  algorithm 
[13,14].  The  destination  state,  is  selected  among  the  adiabatic  states,  by  ex¬ 
amining  (a)  the  increase  of  av  and  (b)  the  magnitude  of  the  non-adiabatic  coupling 
between  ^  and  [13]. 

The  fifth-order  Adams’  formula  is  employed  in  the  numerical  integration.  Because 
the  electronic  wave  function  oscillates  rapidly,  the  time  step  in  the  CSH  calculation 
is  taken  as  short  as  0.05  fs.  The  time  step  in  the  normal  md  calculation  is  2.4  fs. 

Results 

One  of  the  photodissociation  trajectories  of  Ar|  is  shown  in  Figure  1.  The  actual 
potential  energy  surface  on  which  the  nuclei  are  moving  is  plotted  in  a  solid  line, 
while  the  other  adiabatic  potential  energies  are  plotted  in  dashed  lines.  There  are 
2 1  adiabatic  states  in  Ar* ,  but  only  the  lowest  seven  states  are  shown  in  the  figure 
for  clarity.  The  trajectory  starts  from  the  sixth  excited  electronic  state  at  t  =  0.  After 
seven  non-adiabatic  transitions  indicated  by  vertical  arrows  at  the  top  of  the  figure, 
it  finally  settles  down  to  the  ground  electronic  state,  to  produce  a  dimer  ion  and 
five  neutral  atoms.  The  potential  energy  starts  to  oscillate  regularly  at  1  ps,  which 
reflects  the  vibration  of  the  dimer  ion. 

Several  snapshots  of  the  trajectory  are  shown  in  Figure  2.  The  atoms  labeled  as 
“1”,  “2”,  and  “3”  form  a  linear  ArJ  ion  core  before  the  photoexcitation.  Just  after 
the  excitation,  “1”  is  ejected  along  the  colinear  axis  of  the  ion  core  (t  =  0.3  ps), 
and  it  flies  away.  On  the  other  side  of  the  ion  core,  “3”  also  tries  to  escape,  but  it 
is  blocked  by  “4”  located  on  the  colinear  axis,  and  instead,  “4”  is  slowly  ejected 
along  the  axis.  From  0.3  to  0.9  ps,  the  rest  of  the  five  atoms  undergoes  several  non- 
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Figure  1.  The  potential  energy  curve  along  the  photodissociation  trajectory  of  Ar?  is 
plotted  in  a  solid  line  as  a  function  of  time.  The  adiabatic  potential  energies  of  the  lowest 
seven  states  are  also  plotted  in  dashed  lines.  The  trajectory  starts  from  the  sixth  excited 
state  at  /  =  0,  commits  seven  non-adiabatic  transitions  (as  indicated  by  arrows  at  the  top 
of  the  figure),  and  finally  settles  down  in  the  ground  state. 


adiabatic  transitions  and  charge  re-distributions,  and  finally,  the  charge  is  localized 
on  the  atoms  labeled  by  “3”  and  “7”  to  form  the  dimer  ion. 

The  photoabsorption  cross  section  of  Ar?  is  calculated  at  the  starting  points  of 
2500  photodissociation  trajectories  and  plotted  in  Figure  3.  The  resolution  of  the 
spectrum  is  20  nm.  To  make  a  meaningful  comparison  with  the  experimental 
results  [10],  we  concentrate  on  those  trajectories  such  that  their  excitation  energies 
fall  within  the  range  between  2.25-2.75  eV  (450-550  nm).  The  region,  indicated 
as  a  shaded  area  in  the  figure,  contains  1575  trajectories.  Among  those  trajectories, 
13%  of  them  are  not  completely  terminated:  1 1%  still  remain  in  the  excited  state 
at  2.7  ps  after  the  photoexcitation,  while  2%  do  drop  to  the  ground  state  but  the 
internal  energy  is  large  enough  for  further  dissociation.  The  remaining  87%  trajec¬ 
tories  are  terminated  normally:  54%  produce  an  Ar+  fragment  ion,  while  33%  end 
up  with  an  ArJ  fragment  ion. 

The  kinetic  energy  distribution  of  the  monomer  fragment  ion,  Ar+,  and  that  of 
the  neutral  fragments  are  calculated  from  those  terminated  trajectories,  and  are 
plotted  in  Figure  4.  The  kinetic  energy  of  the  neutral  fragments  shows  a  bimodal 
distribution  as  is  seen  in  Figure  4(b).  We  refer  to  the  components  peaked  at  0.4  eV 
as  the  fast  fragments  hereafter.  The  threshold  energy  is  arbitrarily  set  to  0.2  eV.  As 
shown  in  Figure  4(a),  the  fast  fragments  dominate  the  distribution  of  the  monomer 
fragment  ion. 

The  translational  energy  (a)  and  the  internal  energy  (b)  distributions  of  the  dimer 
fragment  ion  are  shown  in  Figure  5.  Unlike  the  monomer  ion  and  the  neutral 
fragments,  the  dimer  fragments  carry  little  translational  energy,  and  its  distribution 
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Figure  2.  The  snapshots  on  the  trajectory  shown  in  Figure  1 .  The  darkness  on  atoms 
refrects  charge  density  on  them.  The  lines  drawn  from  atoms  indicate  the  velocity,  which 
correspond  to  the  movements  in  the  next  1  ps. 


decreases  monotonously.  A  large  amount  of  energy  is  stored  in  the  vibrational 
mode,  as  is  evident  in  Figure  5(b).  As  the  binding  energy  of  the  dimer  ion  is  1.31 
eV,  the  produced  dimer  ion  is  considered  to  be  highly  vibrationally  excited,  and 
besides,  its  energy  distribution  is  far  from  the  thermal  one.  Note  that  the  plotted 
energy  is  that  of  the  dimer,  so  that  the  average  energy  per  atom  becomes  as  large 
as  the  kinetic  energy  of  the  monomer  fragment  ion  (ca.  0.4  eV). 

The  angular  distributions  of  the  fast  component  of  the  monomer  fragment  ion 
and  the  neutral  fragments  are  plotted  in  Figure  6,  as  well  as  that  of  the  dimer 
fragment  ion.  The  angle  is  measured  between  the  transition  dipole  moment  of  the 
photoexcitation  and  the  velocity  vector  of  the  photofragment.  The  distribution  of 
the  fast  monomer  ions  and  that  of  the  fast  neutral  fragments  are  anisotropic:  they 
are  mainly  ejected  along  the  transition  dipole  moment.  The  distribution  for  the 
dimer  ions  is  isotoropic. 

Discussion 

Excitation  Energy  Dependence 

In  this  study,  the  multi-dimensional  and  multi-state  non-adiabatic  dynamics  are 
directly  treated.  The  non-adiabatic  transitions  are  expected  to  occur  more  frequently 
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Figure  3.  The  photoabsorption  spectrum  of  Ar*  calculated  from  the  ensembles  of  the 
initial  configurations  for  the  photodissociation  trajectories.  The  resolution  of  the  spectrum 
is  20  nm.  The  shaded  area  (2.25-2.75  eV)  contains  63%  of  the  2500  trajectories. 


as  the  nuclear  velocity  becomes  large.  In  the  present  case  of  the  photodissociation 
of  Ar|,  the  nuclear  kinetic  energy  originates  from  the  initial  excitation  energy,  A E. 
To  examine  the  dependence  of  the  dynamics  on  A E,  the  2238  normally  terminated 
trajectories  among  2500  samples  are  grouped  into  three  parts:  A E  >  2.75  eV  (the 
left  blank  area  in  Fig.  3),  2.25  <  A E  <  2.75  eV  (the  shaded  area),  and  A E  <  2.25 
eV  (the  right  blank  area).  The  excitation  energy  A E  and  frequency  of  the  non- 
adiabatic  jumps  are  averaged  separately  in  each  part,  and  summarized  in  Table  I 
with  the  other  statistical  values.  The  number  of  non-adiabatic  jumps  does  become 
large  with  the  excitation  energy,  but  it  is  because  more  jumps  are  required  to  go 
down  to  the  ground  state,  if  the  trajectory  starts  from  higher  excited  states.  The 
number  of  jumps  per  unit  time  is  almost  independent  from  A E.  It  is  probably 
because  the  fast  fragments  are  ejected  in  the  early  stage  of  the  photodissociation  as 
typically  seen  in  atom  “  1  ”  of  Figure  2.  The  fast  fragments  carry  away  the  excess 
energy,  leaving  a  much-the-same  amount  of  the  internal  energy  in  the  residuals. 
Note  that  the  non-adiabaticity  is  determined  by  the  relative  velocities  among  atoms 
located  within  the  interacting  part  of  the  cluster. 

Origin  of  the  Fast  Fragments 

The  visible  photoabsorption  band  of  ArJ  is  mainly  attributed  to  the  -► 
2Xg  transition  [4,1 1-13,17-20].  The  transition  dipole  moment  is  parallel  to  the 
colinear  symmetric  axis  of  ArJ ,  and  when  it  is  excited,  both  of  the  side  atoms 
attached  to  the  center  atom  of  ArJ  are  ejected  along  the  colinear  axis.  Because 
the  charge  is  located  on  both  of  the  side  atoms  in  the  photoexcited  state  2SJ,  a 
fast  Ar+  fragment  [9,21-23],  a  fast  neutral  Ar  fragment  [10],  and  a  slow  Ar 
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Figure  4.  Kinetic  energy  distributions  of  monomer  ions  (a)  and  neutral  fragments  (b) 
produced  from  the  photoexcitation  of  Art  between  2.25-2.75  eV.  The  distribution  of 
neutral  fragments  is  bimodal,  while  the  distribution  of  ionic  fragments  has  only  the  fast 

component. 


fragment  are  produced  on  the  photoexcitation.  These  dissociation  processes  were 
elucidated  in  our  previous  article  [13].  Note  that  a  few  slow  Ar+  fragments  are 
produced  through  non-adiabatic  transitions  during  the  photodissociation. 

As  discussed  in  the  previous  article  [4],  the  photoexcited  state  of  the  larger 
clusters  remembers  the  character  of  ArJ.  Naturally,  the  fast  Ar+  fragment 
is  expected  to  be  produced  through  the  photoexcitation  of  Ar?  and  to  be  ejected 
along  the  transition  dipole  moment.  The  fast  Ar+  fragment  is,  however,  no  longer 
a  major  ionic  product  of  the  photodissociation.  The  non-adiabatic  transitions 
now  play  a  more  important  role  in  the  photodissociation  process  of  Ar7 .  Most 
of  the  charge  carried  by  the  fast  escaping  atoms  as  atom  “1”  in  Figure  2  is  drawn 
back  to  the  residual  part  of  the  cluster.  Fast  neutral  atoms  are  thus  produced, 
and  the  majority  of  the  neutral  fragments  that  constitute  the  slow  component 
in  the  velocity  distribution  are  produced  from  successive  evaporation  from  the 
residuals.  Note  that  not  all  the  fast  neutral  fragments  originate  from  either  of 
the  two  side  atoms  that  initially  form  an  ion  core  with  the  center  atom.  The 
number  of  fast  neutral  fragments,  the  number  of  those  originating  from  an  ion 
core,  and  their  kinetic  energies  are  averaged  and  summarized  in  Table  II,  as 


536 


IKEGAMI  AND  IWATA 


Internal  Energy  /  eV 

Figure  5.  The  translational  (a)  and  the  internal  (b)  energy  distribution  of  the  dimer  frag¬ 
ment  ion.  The  rotational  energy  is  included  in  the  internal  energy. 


well  as  the  number  of  fast  Ar+  fragments.  For  excitation  energies  between  2.25 
to  2.75  eV,  about  20%  of  the  fast  neutral  fragments  originate  from  the  solvent 
atoms.  These  atoms  are  likely  to  be  generated  from  a  rear-end  collision  be¬ 
tween  the  solvent  atoms  and  the  atom  ejected  from  an  ion  core,  as  atom  “4”  in 
Figure  2. 

When  Ar^  is  excited  higher  than  2.75  eV,  96%  of  the  trajectories  produce  Ar+ 
fragment  ions.  Nevertheless,  only  29%  of  the  trajectories  produce  fast  Ar+  fragments. 
It  is  because  when  excited  higher,  ejected  atoms  from  the  ion  core  get  much  kinetic 
energy,  which  makes  the  non-adiabatic  transitions  (or  the  charge  draw-back  process) 
easier.  Indeed,  only  16%  of  the  fast  Ar+  fragment  originates  from  the  ion  core.  On 
the  other  hand,  when  excitation  energies  are  between  2.25  and  2.75  eV,  57%  of  the 
trajectories  produce  fast  Ar+  fragments,  and  90%  of  them  originate  from  the  ion 
core.  Since  62%  of  the  trajectories  end  up  with  Ar+  fragments,  more  than  80%  of 
the  Ar+  fragment  ions  come  from  the  ion  core.  When  the  excitation  energy  is  lower 
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Figure  6.  The  angular  distribution  of  the  photofragments  of  ArJ  around  the  transition 
dipole  moment  of  the  photoexcitation.  The  distributions  of  fast  monomer  fragment  ions 
(solid  line),  fast  neutral  fragments  (dashed  line),  and  dimer  fragment  ions  (dotted  line)  are 

plotted. 


than  2.25  eV,  almost  no  Ar+  fragment  is  produced.  Since  the  binding  energy  of 
Atj  with  respect  to  Ar+  +  6  Ar  is  - 1 .79  eV  at  0  K  [4],  it  is  energetically  improbable 
to  produce  Ar+.  Instead,  the  cluster  utilizes  the  binding  energy  of  ArJ  to  evaporate 
solvent  atoms. 

Comparison  with  Experiments 

Nagata  and  Kondow  observed  the  kinetic  energy  distribution  of  the  neutral  frag¬ 
ments  from  their  time-of-flight  mass  spectra  [10].  They  found  a  bimodal  distribution 
in  the  photodissociation  of  Ar^  at  532  nm  (2.33  eV),  with  an  average  kinetic  energy 


Table  I.  Characteristics  of  the  trajectories  for  the  three  regions  of  the  excitation  energy,  A E. 


Excitation  energy /eV 

>2.75 

2.25-2.75 

<2.25 

No.  of  trajectories3 

90 

1365 

783 

Averaged  AF^/eV 

3.58 

2.44 

2.04 

No.  of  non-adiabatic  jumps 

12.4 

8.2 

4.4 

Freq.  of  non-adiabatic  jumps/ps-1 

7.6 

7.7 

7.0 

No.  of  fast  fragments0 

2.25 

2.27 

2.06 

Kinetic  energy  of  fast  fragmentsc/eV 

0.75 

0.38 

0.43 

a  Number  of  the  trajectories,  which  start  after  the  excitation  with  A  E  and  terminate  normally. 
b  Averaged  excitation  energy  A E  over  the  trajectories. 

c  A  fast  fragment  is  defined  as  the  one  which  has  more  than  0.2  eV  of  kinetic  energy.  Charges 
on  the  fragments  are  not  considered  here. 
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Table  II.  Characteristics  of  the  neutral  fragments  for  the  three  regions  of  the  excitation  energy. 


Excitation  energy /eV 

>2.75 

2.25-2.75 

<2.25 

No.  of  fast  neutrals3 

1.96 

1.70 

2.05 

No.  of  fast  neutrals  come  from  the  ion  coreb 

1.52 

1.35 

1.15 

Kinetic  energy  of  fast  neutrals/eV 

0.75 

0.39 

0.43 

No.  of  fast  Ar+  fragmentsc 

0.29 

0.57 

<0.01 

a  Total  number  of  fast  neutral  fragments  per  one  photodissociation  event. 
b  Number  of  the  fast  neutral  fragments  originating  from  the  side  atoms  of  the  ion  core. 
c  Number  of  fast  Ar+  fragments  per  one  photodissociation  event. 


of  the  fast  components  as  ca.  0.35  eV.  They  also  estimated  the  anisotropy  parameter 

of  the  fast  neutral  atoms  to  be  1.5-2,  and  the  number  of  the  fast  neutrals  ejected 
per  single  photodissociation  event  to  be  two.  The  present  calculation  reproduces 
the  experimental  results  very  well.  The  average  kinetic  energy  of  fast  neutral  frag¬ 
ments  calculated  from  Figure  4  is  0.38  eV,  the  angular  distribution  shown  in  Figure 
6  indicates  the  positive  anisotropy  parameter,  and  the  calculated  number  of  fast 
neutral  fragments  per  trajectory  is  1.7. 

There  is,  however,  a  discrepancy  between  the  present  calculation  and  the  exper¬ 
imental  result  in  the  branching  ratio  between  monomer  ion  productions  and  dimer 
ion  productions,  Ar+/Ar2 .  The  observed  branching  ratio  is  almost  0,  that  is,  no  Ar+ 
fragment  ion  is  observed  experimentally,  while  the  calculated  one  is  1.6  for  2.25  < 
A E  <  2.75.  Because  the  branching  ratio  calculated  for  A E  <  2.25  is  0.01  and  the 
one  for  A E  >  2.75  is  21.6,  it  is  a  steep  function  of  the  excitation  energy.  So  that  if 
the  energy  range  were  taken  narrower  around  2.33  eV  (532  nm)  in  the  calculations, 
the  branching  ratio  might  be  reduced. 

Conclusion 

The  photodissociation  process  of  the  argon  cluster  ion  is  studied  by  using  Tully’s 
scheme  with  a  slight  modification.  The  agreements  with  the  experimental  obser¬ 
vation  for  the  kinetic  energy  distribution  and  the  angular  distribution  for  the  pho¬ 
tofragments  of  Atj  are  satisfactory.  For  the  branching  ratio  between  Ar+  and  ArJ 
fragment  ion,  however,  the  calculation  overestimates  the  production  of  Ar+.  The 
poor  resolution  of  the  excitation  energy  seems  to  be  responsible  for  the  discrepancy, 
but  the  dependency  of  the  ratio  on  the  initial  conditions  of  the  trajectories,  such 
as  the  rotational  state  or  the  temperature  of  the  cluster,  must  be  examined. 
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Abstract 

The  minimum-energy  structures  and  bonding  properties  of  the  hydrogenated  lithium  clusters,  Li4H2 
and  LivH,  have  been  investigated  by  means  of  an  ab  initio  Monte  Carlo  simulated  annealing  method. 
The  minimum-energy  structures  of  Li4H2  and  Li7H  are  found  to  resemble  those  of  the  triangular  planar 
(D3h)  isomer  of  Li6  and  Td  isomer  of  Li8  clusters,  respectively.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Interactions  of  hydrogen  and  oxygen  with  small  metal  clusters  and  bulk  metal 
surfaces  are  of  fundamental  importance  for  many  chemical  processes  such  as  ca¬ 
talysis,  corrosion,  and  materials  processing,  and  play  an  important  role  in  chemical 
and  electronics  industries.  Recent  advances  in  experimental  techniques  have  enabled 
detailed  spectroscopic  studies  on  gas-phase  clusters  and  their  interaction  with  small 
molecules.  A  number  of  theoretical  studies  have  been  performed  to  model  the 
electronic  structure  and  reactivity  of  metal  clusters  and  molecule-metal  surface 
interactions  [1]. 

Kaldor  et  al.  [2]  have  pointed  out  that  there  are  important  differences  between 
the  reactions  on  metal  surfaces  and  the  reactions  with  bare  metal  clusters.  Clusters 
of  metal  atoms  react  readily  with  small  molecules  and  their  reactivity  exhibits  very 
characteristic  cluster-size  dependency.  In  small  clusters  of  up  to  15  atoms,  most  of 
the  atoms  .are  surface  atoms.  Owing  to  its  small  dimension,  the  surface  of  the 
clusters  are  strongly  curved  with  high  step  densities  [2,3].  As  the  steps  and  other 
defects  on  the  surfaces  enhance  the  reactivity,  bare  metal  clusters  are  usually  highly 
reactive.  The  capability  of  small  metal  clusters  to  undergo  geometric  relaxation 
during  the  reaction  with  small  molecules  is  quite  unique.  Because  numerous  struc¬ 
tural  isomers  with  low  energy  barrier  exist,  the  clusters  can  easily  undergo  structural 
changes  to  react  with  adsorbates  [2,3]. 
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Recently,  Przybylski  et  al.  [4]  have  studied  the  reaction  pathways  for  the  inter¬ 
action  of  H2  with  Li4  and  Li6  clusters  by  means  of  ab  initio  correlated  methods.  In 
that  study,  they  have  used  a  conventional  geometry  optimization  technique  at  the 
Hartree-Fock  (hf)  level  and  calculated  the  configuration  interaction  (ci)  energies 
for  the  optimal  HF  geometries  using  multireference  diexcited  (mrd)-ci  procedure. 
Kato  et  al.  [5]  have  studied  the  electronic  structure  of  Li„Hm  clusters  (m  <  n  <  4) 
using  the  hf  and  Cl  methods  with  a  5-21 G  basis  set.  In  a  previous  study  [6],  we 
have  investigated  the  minimum-energy  structures  of  a  hydrogenated  lithium  cluster, 
Li5H  using  the  first-principles  Monte  Carlo  (MC)  simulated  annealing  method. 

Most  ab  initio  studies  have  employed  conventional  geometry  optimization  tech¬ 
niques  to  determine  the  structure  of  small  metal  and  semiconductor  clusters.  Since 
the  energy-minimization  procedures  make  only  downhill  movements,  the  starting 
structure  and  the  minimized  structure  are  always  in  the  same  potential  energy  well 
in  the  conventional  energy-minimization  algorithm  [7].  With  such  an  approach, 
one  may  easily  fall  into  one  of  several  local  minima  if  the  starting  conformation  is 
in  the  local  energy  minimum  region  [7].  In  order  to  find  the  global  minimum,  a 
brute-force  strategy  has  been  employed  in  the  past  [8].  This  approach  attempts  to 
locate  this  global  energy  minimum  by  generating  all  of  the  possible  starting  ge¬ 
ometries,  and  minimizing  each  to  the  nearest  (local  or  global)  minimum.  However, 
as  the  number  of  atoms  increases  in  the  cluster,  the  number  of  local  minima  increases 
exponentially.  Therefore,  it  becomes  very  difficult  to  enumerate  all  of  the  possible 
geometries  to  locate  the  global  minimum  among  the  local  minima. 

Recently,  a  number  of  simulation  algorithms  have  been  developed  and  applied 
to  the  study  of  the  structure  and  dynamics  of  the  metal  and  semiconductor  clusters. 
Among  these  are  the  Monte  Carlo  (mc)  [6,9]  and  molecular  dynamics  (md)  [10- 
13]  simulated  annealing  algorithms  to  locate  the  global  minimum  energy  structure. 
The  former  attempts  to  locate  the  global  minimum  by  searching  configuration 
space  (i.e.,  by  sampling  canonical  ensemble)  using  the  MC  approach  known  as 
simulated  annealing  [7,14,15].  The  latter  attempts  to  locate  the  global  minimum 
by  evolving  the  system  with  time.  The  MC  and  MD  simulated  annealing  techniques 
have  been  successfully  applied  to  a  number  of  systems  including  rare  gas  atom 
clusters  using  the  Lennard-Jones  pair  potential  and  embedded-atom  potential  [15— 
17].  However,  in  the  case  of  metal  and  semiconductor  clusters,  phenomenological 
potentials  fail  dramatically  for  covalent  and  metallic  systems  where  many-body 
interactions  play  a  crucial  role. 

The  major  difficulty  in  computer  simulations  of  metal  and  semiconductor  clusters 
is  the  description  of  the  many-body  interactions.  Several  attempts  have  been  made 
to  include  ab  initio  the  many-body  effects  through  computer  simulation.  Rothlis- 
berger  and  Andreoni  [13]  have  reported  the  simulated  annealing  MD  method  of 
Car  and  Parrinello  [12]  for  alkali  metal  clusters  using  the  density-functional  ap¬ 
proach.  The  approach  attempts  to  include  the  many-body  interactions  in  terms  of 
the  Local  Density  Approximation  (lda).  Hartke  and  Carter  [10]  have  implemented 
an  ab  initio  md  simulation  by  employing  the  Generalized  Valence  Bond  (GVB) 
method.  The  md  simulated  annealing  is  performed  with  first-principles  forces  de¬ 
rived  from  the  GVB  wave  function  to  locate  the  global  potential  minimum.  Recently 
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Hammes-Schiffer  and  Andersen  [11]  have  reported  a  simulated  annealing  MD  study 
of  small  lithium  clusters  of  up  to  five  atoms  by  using  the  general  Hartree-Fock 
(ghf)  theory /STO-3G  basis  set.  These  methods  are  designed  to  solve  the  many- 
body  simulation  by  evolving  the  system  with  time. 

In  a  recent  study  [6],  we  have  implemented  an  algorithm  for  MC  simulated  an¬ 
nealing  based  on  an  ab  initio  correlated  electronic  structure  theory.  In  our  first- 
principles  MC  simulated  annealing  procedure,  we  describe  the  many-body  inter¬ 
actions  in  metal  and  semiconductor  clusters  in  terms  of  an  ab  initio  correlated 
method  at  the  level  of  second-order  Moller-Plesset  perturbation  theory  (mp2).  We 
have  chosen  the  mp2  level  of  theory  because,  in  a  number  of  studies,  ab  initio 
calculations  at  the  mp2  level  have  proven  to  be  the  most  efficient  correlated  method 
to  obtain  correct  geometrical  structures.  The  first-principles  MC  procedure  involves 
performing  MC  on  a  system  of  nuclei  without  prior  knowledge  of  the  quantum 
mechanical  potential  energy  surface.  The  MC  annealing  procedure  allows  for  an 
efficient  sampling  of  the  global  minimum,  and  thus,  is  ideally  suited  for  locating 
energy  global  minimum  in  metal  and  semiconductor  clusters  which  possess  a  large 
number  of  local  minima  separated  by  tunneling  barriers  of  varying  height.  In  contrast 
with  the  MD  methods,  the  interparticle  forces  are  not  required  in  the  MC  procedure, 
making  the  use  of  the  ab  initio  correlated  mp2  method  more  efficient  in  terms  of 
computer  time.  In  a  preliminary  study,  we  have  successfully  applied  our  ab  initio 
MC  simulated  annealing  method  to  the  geometry  optimization  of  the  lithium  clusters, 
Li„  (n  =  4,6)  and  the  hydrogenated  lithium  clusters,  Li4H2,  Li5H[6],  and  Li7H.  This 
article  outlines  our  first-principles  MC  simulated  annealing  algorithm  and  describes 
the  applications  to  Li4H2  and  Li7H.  In  the  next  section,  our  ab  initio  MC  simulated 
annealing  algorithm  is  outlined.  In  the  third  section,  the  results  of  the  applications 
to  the  lithium-hydrogen  clusters,  Li4H2  and  Li7H  are  presented. 

Methods 

In  our  first-principles  Monte  Carlo  simulated  annealing  procedure  [6],  we  employ 
the  correlated  ab  initio  method  to  approximate  the  total  energy  .Etotai  of  the  system 
as  a  sum  of  the  HF  and  MP2  energies;  Et otai  =  EHF  +  EMP2.  This  is  in  contrast  with 
the  conventional  approach  in  which  one  approximates  the  system  energy  as  a  sum 
of  model  potentials,  e.g.,  the  pair-wise  Lennard-Jones  potential  [16,17].  The  MC 
simulated  annealing  procedure  starts  at  a  given  temperature  by  calculating  the  hf 
and  mp2  energies  for  an  initial  cluster  configuration  Ro.  This  is  followed  by  making 
a  random  move  for  one  of  the  atoms  in  the  cluster  to  produce  a  new  configuration 
Knew  with  which  a  new  total  energy  i^otaiCRnew)  is  evaluated  by  computing  the  HF 
and  mp2  energies.  The  change  in  the  total  energy  of  the  cluster,  A  E,  associated 
with  the  random  move,  A  E  =  EXoXJRnew)  -  EtotJRo)  is  computed.  If  A E  is  negative, 
the  configuration  Rnew  is  accepted  as  Ro  for  the  next  random  move.  If  A E  is  positive, 
the  move  is  accepted  with  the  probability  given  by  the  Boltzmann  factor,  p  = 
exp(~A EjkT).  This  is  the  so-called  Metropolis  algorithm  [18]  which  makes  use  of 
the  energy  dependency  of  the  Boltzmann  distribution  at  a  given  temperature.  In 
the  Metropolis  algorithm,  uphill  moves  are  always  allowed  at  nonzero  temperature. 
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Consequently,  the  MC  annealing  procedure  does  not  have  the  problem  of  becoming 
trapped  in  one  of  the  local  minima  as  is  found  using  the  conventional  optimization 
techniques.  The  stepsize  ARmax ,  which  restricts  the  maximum  displacement  of  the 
atoms,  is  adjusted  in  such  a  way  that  about  50%  of  the  move  is  accepted  in  order 
to  maintain  the  detailed  balance  at  a  given  temperature.  For  a  given  temperature, 
a  number  of  Metropolis  samplings  are  performed  to  simulate  the  structural 
equilibrium. 

The  simulated  annealing  algorithm  pioneered  by  Kirkpatrick  et  al.  [14]  makes 
use  of  the  temperature  dependence  of  the  Boltzmann  distribution  [7].  The  algorithm 
is  based  on  the  “crystal  forming”  process,  i.e.,  during  the  simulation,  temperature 
is  lowered  slowly  from  a  sufficiently  high  temperature  to  a  “freezing”  temperature 
[7, 15].  However,  the  algorithm  is  not  completely  satisfactory  in  the  sense  that  there 
is  no  established  protocol  for  the  annealing  procedure  and  a  certain  number  of 
annealing  runs  are  necessary  to  evaluate  its  performance  [7,15].  Thus,  in  practice, 
the  annealing  procedure  is  repeated  several  times  to  ensure  that  the  global  minimum 
has  been  found.  The  algorithm  was  successfully  used  to  locate  the  global  minimum 
energy  conformations  of  polypeptides  and  proteins  where  a  very  large  number  of 
local  minima  exist  [7,15].  In  our  annealing  procedure,  we  adopt  the  temperature¬ 
quenching  scheme  according  to  Kirkpatrick  et  al.  [14]; 

Tnew  =  7  ?old  (0 

7  is  the  quenching  factor  that  determines  the  rate  of  temperature  drop. 

The  stepsize  ARmax  also  is  slowly  lowered  to  maintain  the  detailed  balance; 

ARmax(new)  =  7l/2ARmax(old)  (2) 

The  value  of  the  quenching  factor  y  used  in  our  study  varies  between  0.90  and 
0.99.  A  computer  program  has  been  developed  for  our  Monte  Carlo  simulated 
annealing  procedure  which  utilizes  the  HONDO  electronic  structure  program  package 
[19]  to  compute  the  hf  and  mp2  energies.  The  random  number  generation  routine 
URAND  of  Forsythe  et  al.  [20]  has  been  used.  In  each  MC  annealing  run,  we  start 
with  a  different  initial  temperature,  initial  geometry,  and  random  number  initial¬ 
ization.  In  the  annealing  process,  the  temperature  is  lowered  to  « 1  Kelvin  according 
toEq.  (1). 

In  order  to  study  the  interactions  of  hydrogen  with  lithium  clusters  and  the 
minimum-energy  structures  of  the  products,  MC  simulated  annealing  run  is  initiated 
with  a  reactant  (i.e.,  H  or  H2)  separated  at  least  3  A  apart  from  the  Li„  cluster  ( n 
—  4  or  7)  at  a  reasonably  high  temperature  (i.e.,  300-500  K).  This  procedure  is 
repeated  several  times  by  varying  the  initial  temperature,  quenching  factor,  and 
initial  configuration.  The  basis  sets  employed  are  double-zeta  plus  polarization 
(dzp)  basis  sets  reported  by  Dunning  and  Hay  [2 1  ]  for  both  H  and  Li.  The  atom¬ 
ization  energy  EA  is  calculated  according  to  Kato  et  al.  [5]; 

Ea  =  £MP2(Li„Hm)  -  nEMP2( Li)  -  mE{H)  (3) 

where  n  and  m  are  the  number  of  the  lithium  and  hydrogen  atoms  in  the  LinHw 
cluster. 
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Results  and  Discussion 

Figure  1  shows  a  typical  MC  simulated  annealing  run  that  maps  variation  of  the 
average  total  energy  of  the  H2  +  Li4  interaction  obtained  at  a  given  temperature  as 
a  function  of  temperature.  Contrary  to  the  conventional  energy  optimization  tech¬ 
niques,  the  total  energy  does  not  decrease  monotonically.  In  the  Metropolis  algo¬ 
rithm,  uphill  moves  are  always  allowed  at  nonzero  temperature:  In  a  number  of 
instances,  the  total  energy  is  seen  to  rise  due  to  the  Metropolis  sampling  scheme. 
This  particular  feature  enables  the  system  to  overcome  the  energy  barriers  of  the 
local  minima. 

A  total  of  four  MC  simulated  annealing  calculations  were  performed  on  the  H2 
+  Li4  interaction  by  varying  initial  conditions.  All  the  MC  annealing  runs  gave  the 
same  global  energy  minimum  structure  of  the  closed-shell  singlet  (!Ai,  C2v)  Li4H2. 
This  planar  structure  obtained  from  the  H2  +  Li4  interaction  is  identical  to  that 
obtained  earlier  by  Kato  et  al.  [5]  using  a  conventional  geometry  optimization 
technique.  The  minimum-energy  structure  of  the  planar  Li4H2  cluster  is  shown  in 
Figure  2.  Since  the  final  energy  of  the  system  obtained  from  a  MC  simulated  an¬ 
nealing  is  the  average  total  energy  obtained  by  the  Metropolis  sampling  scheme  at 
the  final  temperature,  the  system  energy  may  be  refined  by  a  standard  geometry 
optimization  technique:  Thus,  the  structure  and  energy  obtained  in  the  MC  simu¬ 
lations  were  refined  further  by  a  conventional  gradient  optimization  technique 
employing  a  larger  basis  set  of  Lie  and  Clementi  (i.e.,  ( 1 3s4p/7s4p)  for  Li  and  {Ss2p/ 
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5s2p)  for  H)  [22].  Table  I  shows  the  results  of  the  refinement  along  with  the  Cl 
results  reported  by  Kato  et  al.  [5]  and  Przybylski  et  al.  [4]  for  comparative  purpose. 
All  the  Li  —  Li,  Li — H,  and  H  —  H  bond  lengths  obtained  in  our  calculations  on 
Li4H2  cluster  agree  to  within  0.05  A  with  those  obtained  by  Przybylski  et  al.  who 
employed  (135,3/?/65,3/?)  basis  for  Li  and  (4s,2p/2s,2p)  basis  for  H. 

The  global  energy  minimum  structure  of  the  planar  Li4H2  cluster  is  analogous 
to  a  local  energy  minimum  structure  of  triangular-planar  (D3h)  Li6  cluster  [23]:  The 
Li4H2  structure  may  be  obtained  by  replacing  two  of  the  comer  lithium  atoms  in 
the  triangular  Li6  by  two  hydrogen  atoms.  Net  charges  on  the  atoms  obtained  by 
means  of  Mulliken.  population  analysis  are  given  in  Table  II.  The  analysis  reveals 
that  the  lithium  atom,  which  is  directly  bonded  to  the  two  hydrogen  atoms  (labeled 
Li2  in  Fig.  2),  possesses  a  large  net  positive  charge  (+0.77)  indicating  that  there  is 
a  significant  amount  of  charge  transfer  between  the  Li  and  hydrogen  atoms.  The 
hydrogen  atoms  possess  a  large  net  negative  charge  (—0.72  each)  due  to  their  large 
electronegativity.  This  difference  in  the  electronegativity  between  H  and  Li  causes 
substantial  changes  in  the  charge  distribution  as  well  as  the  stability  of  the  cluster 
isomers.  When  the  hydrogen  atoms  replace  the  lithium  atoms  in  the  pure  lithium 
cluster,  the  global  energy  minimum  structure  of  Li4H2  no  longer  resembles  the 
global  minimum  structure  of  Li6  cluster,  but  is  analogous  to  one  of  the  local  energy 
minimum  structures  of  Li6 . 

A  total  of  five  MC  simulated  annealing  calculations  were  performed  on  the  H  + 
Li7  interaction  by  varying  initial  conditions.  The  mc  simulated  annealing  study 
revealed  the  existence  of  two  low-lying  structural  isomers  of  closed-shell  singlet 
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Table  I.  Structural  parameters  and  energies  of  Li^ . 


Previous  work 


Present  work 

Ref.  [4] 

Ref.  [5] 

Bond  length  (A) 

R(Lil-Li3) 

3.12 

3.16 

3.14 

R(Li2-Li3) 

2.87 

2.89 

2.95 

R(Li3-Li4) 

3.07 

3.12 

3.22 

R(Li3-Hl) 

1.72 

1.73 

1.76 

R(Li2-H2) 

1.72 

1.72 

1.76 

R(H1-H2) 

3.16 

— 

3.22 

Total  energy 

(E)  (a.u .)* 

-31.00325 

— 

— 

£oPt  (a.u.)b 

-31.15790 

-31.10060 

— 

Ea  (kcal/mol)c 

147.6 

— 

115.7 

a  Average  total  energy  obtained  by  MC  simulated  annealing. 
b  Total  energy  obtained  after  refinement  using  Lie-Clementi 
basis  set. 

c  Atomization  energy. 


Li7H.  Figure  3  displays  the  structure  of  the  two  isomers  (labeled  A  and  B).  Four 
out  of  a  total  of  five  MC  annealing  runs  gave  the  structure  A.  The  fifth  run  gave 
the  structure  B  as  a  local  minimum.  In  the  global  energy  minimum  structure  (isomer 
A),  the  hydrogen  atom  is  directly  bonded  to  three  lithium  atoms,  while  in  isomer 
B  the  hydrogen  atom  is  bonded  to  two  lithium  atoms  and  energetically  less  stable. 
The  structure  and  average  total  energy  obtained  in  the  MC  simulations  were  refined 
further  by  a  conventional  gradient  optimization  technique  employing  the  basis  set 
of  Lie  and  Clementi  [22].  Table  III  shows  the  results  of  the  refinement.  The  total 
energy  of  the  isomer  B  is  only  about  12  kcal/mol  above  that  of  the  isomer  A. 

The  results  support  the  argument  [24]  that  the  most  stable  adsorption  position 
is  usually  the  site  with  the  largest  coordination  number  available  on  the  cluster 


Table  II.  Net  charges  obtained  from 
Mulliken  population  analysis  on  Li4H2. 

Atom 

This  work 

Kato  et  al.  (Ref.  [5]) 

Lil 

-0.19 

-0.17 

Li2 

0.77 

0.377 

Li3 

0.43 

0.286 

Li4 

0.43 

0.286 

HI 

-0.72 

-0.389 

H2 

-0.72 

-0.389 
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B 

Figure  3.  Two  structural  isomers  of  Li7H  obtained  by  mc  simulated  annealing. 


surface.  The  global  energy  minimum  structure  of  the  hydrogenated  Li7H  cluster 
obtained  from  the  MC  annealing  of  the  Li7  +  H  interaction,  has  the  coordination 
number  of  three  for  the  hydrogen  atom.  There  is  a  topological  equivalence  between 
the  structure  of  isomer  A  and  the  Td  structure  of  Lig  [25]:  The  Li7H  may  be  obtained 
by  replacing  one  of  the  bridge-capped  lithium  atoms  in  Li8  by  a  hydrogen  atom. 
This  structure  of  Li7H  also  resembles  the  structure  of  Li6H2  reported  by  Przybylski 
et  al.  [4]. 
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Table  III.  Structural  parameters  and  energies  of  Li7H. 


Isomer  A 

Isomer  B 

Bond  length  (A) 

R(Li  1  — Li2) 

2.94 

R(Lil-Li2) 

3.13 

R(Lil-Li3) 

3.18 

R(Lil-Li3) 

3.01 

R(Lil-Li4) 

2.93 

R(Lil-Li5) 

2.73 

R(Lil-Li5) 

3.18 

R(Lil-H) 

1.70 

R(Lil-Li6) 

2.95 

R(Li2-Li3) 

3.07 

R(Lil-Li7) 

3.17 

R(Li3-Li4) 

2.97 

R(Li2-Li3) 

3.22 

R(Li3-Li5) 

3.08 

R(Li2-Li4) 

2.75 

R(Li3-Li6) 

3.04 

R(Li2-Li6) 

2.73 

R(Li3-Li7) 

2.94 

R(Li2-Li7) 

3.23 

R(Li4-Li5) 

3.21 

R(Li2-H) 

1.82 

R(Li5-Li6) 

3.02 

R(Li3-Li4) 

3.24 

R(Li5-Li7) 

3.22 

R(Li4-Li5) 

3.26 

R(Li5-H) 

1.71 

R(Li4-Li6) 

2.74 

R(Li6-Li7) 

3.08 

R(Li4-H) 

1.82 

R(Li5-Li6) 

3.19 

R(Li6-Li7) 

3.21 

R(Li6-H) 

1.82 

Total  energy 

<£>  (a.u.)’ 

-52.79072 

-52.77513 

£op,  (a.u.)b 

-53.03524 

-53.01547 

Ea  (kcal/mol)c 

106.8 

94.4 

a  Average  total  energy  obtained  by  MC  simulated  annealing. 
b  Total  energy  obtained  after  refinement  using  Lie-Clementi 
basis  set. 

c  Atomization  energy. 


Table  IV.  Net  charges  obtained  from 
Mulliken  population  analysis  on  Li?H. 


Atom 

Isomer  A 

Isomer  I 

Lil 

0.22 

0.45 

Li2 

0.34 

-0.12 

Li  3 

-0.18 

0.17 

Li4 

0.33 

-0.21 

Li5 

-0.19 

0.48 

Li6 

0.34 

-0.06 

Li7 

-0.19 

-0.01 

H 

-0.68 

-0.70 
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Table  IV  shows  the  net  charges  on  the  atoms  contained  in  Li7H,  which  are 
obtained  by  Mulliken  population  analysis  for  the  two  stable  isomers.  Two  sets  of 
equal  net  charges  on  three  of  the  lithium  atoms,  i.e.,  (Li3,  Li5,  Li7)  and  (Li2,  Li4, 
Li6),  as  well  as  the  bond  lengths  indicate  that  the  structure  of  isomer  A  possesses 
the  C3v  symmetry  with  a  C3  rotation  axis  along  the  H  —  Lil  bond.  In  both  isomers, 
the  hydrogen  atom  possesses  a  large  negative  charge. 

Acknowledgment 

This  work  has  been  supported  by  the  National  Science  Foundation  through 
EPSCoR  program. 


Bibliography 

[1]  H.  Nakatsuji,  H.  Nakai,  and  M.  Hada  in  Metal-Ligand  Interactions:  From  Atoms ,  to  Clusters,  to 
Surfaces,  Salahub  and  Russo,  Eds.  (Kluwer,  1992),  pp.  251-285;  W.  Steele,  Chem.  Rev.  93,  2355 
(1993). 

[2]  A.  Kaldor,  D.  M.  Cox,  and  M.  R.  Zakin,  Adv.  Chem.  Phys.  70,  21 1  (1988). 

[3]  M.  F.  Jarrold  and  J.  E.  Bower,  J.  Chem.  Phys.  85,  5373  (1986);  M.  F.  Jarrold  and  J.  E.  Bower,  Z. 
Phys.  D  12,  551  (1989);  M.  D.  Morse,  J.  B.  Hopkins,  P.  R.  R.  Langridge-Smith,  and  R.  E.  Smalley, 

J.  Chem.  Phys.  79,  5316  (1983). 

[4]  K.  Przybylski,  J.  Koutecky,  V.  Bonacic-Koutecky,  P.  von  R.  Schleyer,  and  M.  F.  Guest,  J.  Chem. 
Phys.  94,  5533  (1991). 

[5]  H.  Kato,  K.  Hirao,  I.  Nishida,  K.  Kimoto,  and  K.  Akagi,  J.  Phys.  Chem.  85,  3391  (1981);  H.  Kato, 

K.  Hirao,  and  K.  Akagi,  Inorg.  Chem.  20,  3659  (1981). 

[6]  V.  Keshari  and  Y.  Ishikawa,  Chem.  Phys.  Lett.  218,  406  (1994). 

[7]  S.  R.  Wilson,  W.  Cui,  J.  W.  Moskowitz,  and  K.  E.  Schmidt,  Tetrahedron  Lett.  29,  4373  (1988);  J. 
Moskowitz,  K.  E.  Schmidt,  S.  R.  Wilson,  and  W.  Cui,  Int.  J.  Quantum  Chem.,  Quantum  Chem. 
Symp.  22, 61 1  (1988);  S.  R.  Wilson,  W.  Cui,  J.  W.  Moskowitz,  and  K.  E.  Schmidt,  J.  Comp.  Chem. 
12,  342  (1991). 

[8]  “Calc/Conf”  in  Chem-X,  Chemical  Design  Ltd.,  Oxford,  England;  Macromodel,  Copyright  1986, 
W.  C.  Still,  Columbia  Univ. 

[9]  U.  Nagashima,  K.  Morokuma,  and  H.  Tanaka,  J.  Phys.  Chem.  96,  4294  (1992). 

[10]  B.  Hartke  and  E.  A.  Carter,  J.  Chem.  Phys.  97,  6569,  (1992). 

[11]  S.  Hammes-Schiffer  and  H.  C.  Andersen,  J.  Chem.  Phys.  99,  523  (1993). 

[12]  R.  Car  and  M.  Parrinello,  Phys.  Rev.  Lett.  55,  2471  (1985);  E.  S.  Fois,  A.  Selloni,  M.  Parrinello, 
and  R.  Car,  J.  Phys.  Chem.  92,  3268  (1988). 

[13]  U.  Rothlisberger  and  W.  Andreoni,  J.  Chem.  Phys.  94,  8129  (1991). 

[14]  S.  Kirkpatrick,  C.  D.  Gelatt,  Jr.,  and  M.  P.  Vecchi,  Science  220,  671  (1983). 

[15]  H.  Kawai,  T.  Kikuchi,  and  Y.  Okamoto,  Protein  Eng.  3,  85  (1989);  Y.  Okamoto,  M.  Fukugita,  T. 
Nakazawa,  and  H.  Kawai,  Protein  Eng.  4,  639  (1991);  Y.  Okamoto,  T.  Kikuchi,  and  H.  Kawai, 
Chem.  Lett.  1275  (1992). 

[16]  D.  G.  Vlachos,  L.  D.  Schmidt,  and  R.  Aris,  J.  Chem.  Phys.  96,  6880  (1992). 

[17]  D.  D.  Frantz,  D.  L.  Freeman,  and  J.  D.  Doll,  J.  Chem.  Phys.  93,  2769  (1990). 

[18]  N.  Metropolis,  A.  W.  Rosenbluth,  M.  N.  Rosenbluth,  A.  H.  Teller,  and  E.  Teller,  J.  Chem.  Phys. 
21,  1087  (1953). 

[19]  HONDO;  QCPE  Program  #544,  Univ.  of  Indiana,  Bloomington,  IN. 

[20]  G.  E.  Forsythe,  M.  A.  Malcolm,  and  C.  B.  Moler,  Computer  Methods  for  Mathematical  Computations 
(Prentice-Hall,  Englewood  Cliffs,  NJ,  1977),  p.  246. 

[21]  T.  H.  Dunning,  Jr.  and  P.  J.  Hay  in  Modern  Theoretical  Chemistry,  Vol.  3,  H.  F.  Schaefer  III,  Ed. 
(Plenum,  New  York,  1977). 

[22]  G.  C.  Lie  and  E.  Clementi,  J.  Chem.  Phys.  60,  1275  (1974). 


HYDROGENATED  LITHIUM  CLUSTERS 


551 


[23]  I.  Boustani,  W.  Pewestorf,  P.  Fantucci,  V.  Bonacic-Koutecky,  and  J.  Koutecky,  Phys.  Rev.  B.  35, 
9437  (1987);  J.  Koutecky,  I.  Boustani,  and  V.  Bonacic-Koutecky,  Int.  J.  Quantum  Chem.  38,  149 
(1991). 

[24]  D.  Spanjaard  and  M.  C.  Desjonqueres,  in  Interaction  of  Atoms  and  Molecules  with  Solid  Surface, 
Physics  of  Solid  and  Liquid,  V.  Bortolani,  N.  H.  March,  and  M.  P.  Tosi,  Eds.  (Plenum  Press,  New 
York,  1990). 

[25]  V.  Bonacic-Koutecky,  P.  Fantucci,  and  J.  Koutecky,  Chem.  Rev.  91,  1035  (1991). 


Received  March  18,  1994 


On  the  Interaction  of  Cyanate  and  Thiocyanate 
Anions  with  Li+  and  Mg2+ 

MICHAEL  PROBST 

Institute  of  General  Inorganic  and  Theoretical  Chemistry,  Innsbruck  University, 
6020  Innsbruck,  Austria 


Abstract 

The  energy  surfaces  of  Mn+/SCN"  and  Mn+/OCN-  (M  =  Li+  and  Mg2+)  ion  pairs  have  been  calculated 
at  the  Hartree-Fock  and  mp2  levels  of  theory.  The  electrostatic  potential  and  the  actual  binding  energies 
are  compared.  Besides  linear  ion  pairs,  nonlinear  ones  are  also  found  and  are  in  some  cases  the  most 
stable  ones.  The  electrostatic  potential  and  the  actual  binding  to  cations  are  compared.  ©  1994  John  Wiley 
&  Sons,  Inc. 


Introduction 

Cyanate  and  thiocyanate  anions  are  interesting  from  the  viewpoint  of  coordination 
chemistry  because  they  are  small,  linear  molecules  and  represent  most  simple  types 
of  anions  with  more  than  one  coordination  site.  In  general  it  is  assumed  that  each 
end  represents  a  possible  coordination  site. 

Solutions  containing  ion  pairs  of  Zn2+,  Cd2+,  and  Hg2+  with  SCN  have  recently 
been  studied  by  Ohtaki  by  means  of  x-ray  diffraction  [1]  and  both  S-site  and  N- 
site  coordination  has  been  found.  Also,  there  is  indication  that  in  aqueous  solu¬ 
tions  of  the  corresponding  salts  ion  pair  formation  occurs  under  various  circum¬ 
stances  [2]. 

Despite  SCN"  and  OCN-  being  quite  common  anions,  they  have  been  subjected 
to  few  theoretical  investigations  so  far.  After  this  work  was  finished  an  article  by 
Parrini  and  Morales  [3]  came  to  our  attention.  Those  authors  studied  Li+/SCN- 
with  Hartree-Fock  calculations  and  smaller  basis  sets  (up  to  4-3 1G*).  Their  findings 
are  similar  to  this  article  but  they  do  not  mention  the  stable  linear  complex 
Li+ .  .  .  NCS-.  A  recent  study  of  the  cyanate  anion  [4]  discusses  geometries  in  the 
ground  and  excited  states. 

Therefore  we  have  selected  Li+  and  Mg2+  as  examples  of  mono-  and  divalent 
counterions  and  study  the  structure  of  the  ion  pairs  and  the  influence  of  ion  pair 
formation  on  the  intramolecular  SCN-  and  OCN-  geometry. 

After  mentioning  the  details  of  the  calculations,  we  present  the  results  for 
the  systems  Li+/SCN-  and  Li+/OCN-,  and  the  differences  if  Li  is  substituted 
with  Mg2+. 
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Details  of  the  Calculations 

Optimized  geometries  and  binding  energies  for  the  various  complexes  have  been 
calculated.  All  calculations  were  performed  with  the  6-3 1  lG(d,p)  basis  set  [5].  With 
this  rather  large  basis,  inadequacies  due  to  the  basis  set  limitations  can  be  assumed 
to  be  small  for  the  systems  under  study  here.  Therefore,  no  counterpoise  corrections 
or  similar  treatment  was  performed.  All  complexes  were  calculated  within  the  mp2 
approximation  except  the  ones  where  we  included  water  molecules  to  model  sol¬ 
vation  effects.  For  those,  only  Hartree-Fock  calculations  could  be  afforded.  Several 
test  calculations  were  performed  at  the  mp4  level  and  most  of  the  geometry  opti¬ 
mizations  were  performed  at  the  Hartree-Fock  level,  too. 

Interaction  Energies  in  Li+/SCN~ 

The  interaction  energies  of  the  ion  pairs  Li+  •  *  •  SCN"  and  Li+  •  •  •  NCS~  at  the 
optimized  geometries  are  summarized  in  Table  I.  The  Hartree-Fock  binding  energies 
are  slightly  lower  than  the  corresponding  mp2  values.  The  mp2  values  given  in 
Table  I  correlate  only  the  valence  electrons.  Test  calculations  showed  that  if  all 
electrons  are  correlated,  the  corresponding  MP2a]i  energies  are  again  slightly  lower 
and  are  about  8%  lower  than  the  Hartree-Fock  values.  Differences  between  mp2 
and  mp4  binding  energies  were  less  than  1  kcal/mol  for  both  Li+  •  •  •  SCN-  and 
Li+  •  •  •  NCS-.  In  the  following,  only  the  mp2  energies  are  discussed  unless  men¬ 
tioned  otherwise. 

Coordination  of  Li+  to  the  N-site  is  preferred  over  coordination  to  the  S-site  by 
22  kcal/mol.  However,  a  calculation  of  the  force  constants  at  the  equilibrium  ge- 


Table  I.  Interaction  energies  in  the  various  systems. 


SCN” 

OCN- 

Li+ 

Mg2+ 

Li 

i+ 

Mg2 

:+ 

Anion  Cation  Method 

HF 

mp2 

HF 

mp2 

HF 

mp2 

HF 

mp2 

S-  or  O-site  coordination 

-115 

-120 

-263 

-272 

-149 

-150 

-305 

-307 

N-site  coordination 

-139 

-143 

-306 

— 3 1 5a 

-161 

-162 

-337 

-339 

“T-shaped”  coordination 

-143 

-315 

Complexes  with  three  H20 

molecules.b 

S-  or  O-site  coordination 

-177 

-411 

-201 

-455 

N-site  coordination 

-194 

-447 

-209 

-475 

“T-shaped”  coordination 

-199 

-440 

-213 

-464 

The  interaction  energies  are  given  in  kcal/mol. 

a  The  optimal  nonlinear  conformation  is  0. 1  kcal/mol  lower  in  energy  than  the  linear  one  (-3 1 5.3  vs. 
-315.2  kcal/mol). 

bThe  energies  include  the  binding  energy  of  the  water  molecules  to  the  complexes:  A E  = 
T((H20)3AT+XCN-)  -  £(AP+)  -  £(XCN-)  -  3£(H20). 
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ometry  showed  that  the  Li+  •  •  *  SCN-  complex  is  not  a  stable  local  minimum 
whereas  the  Li+  •  •  •  NCS-  complex  is.  The  energy  hypersurface  of  the  system  shows 
a  second  stable  minimum  at  a  conformation  where  Li+  is  coordinated  above  the 
carbon  forming  a  T-shaped  complex  (denoted  as  S — C(Li) — N)  which  is  shown 
in  Figure  1  (top).  This  complexation  changes  the  S  —  C  N  angle  from  180.0°  to 
162.6°  with  N  and  S  being  bent  towards  the  cation.  The  Li  —  S  C  N  dihedral 
angle  is  0°,  that  is,  the  complex  is  planar.  The  relative  stability  of  this  local  minimum 
is  -143  kcal/mol,  making  it  practically  identical  in  stability  to  the  linear 
Li+*  •  •  NCS-  conformation.  In  the  ongoing  discussion,  however,  we  do  not  neglect 
the  Li+  •  •  •  SCN-  configuration  because  in  some  environments  it  might  still  become 
a  stable  complex  or  at  least  a  real  minimum  of  the  potential  surface. 

In  order  to  be  able  to  draw  conclusions  more  relevant  to  aqueous  salt  solutions 
we  performed  calculations  on  the  complexes  (H20)3Li+  •  •  •  SCN  and 
(H20)3Li+  •  •  •  NCS-  (Fig.  5).  The  complexes  might  be  used  as  models  of  hydrated 
Li+  cations  with  one  hydration  shell  water  molecule  substituted  by  the  SCN-  anion. 
It  is  known  from  molecular  dynamics  experiments  that  in  dilute  aqueous  solutions 
Li+  prefers  octahedral  coordination  to  water  [6]  but  we  are  mainly  interested  in 
knowing  whether  the  water  molecules  have  any  qualitative  effect  on  the  order  of 
the  binding  energies  and  geometries  of  the  ion  pairs  and  therefore  we  studied  the 
computationally  less  demanding  “pseudotetrahedral”  coordination.  The  geometries 
of  the  complexes  with  three  water  molecules  were  optimized  at  the  Hartree-Fock 
level.  We  neglected  further  hydration  at  the  site  of  the  anion  because  it  is  energetically 
less  important  that  hydration  of  the  cation.  The  order  of  stability  for  the  linear 
molecules  stays  the  same  as  for  the  unhydrated  cations.  Li+  •  •  •  NCS-  is  17  kcal/ 
mol  more  stable  than  Li+  •  •  •  SCN-.  Here,  too,  the  latter  is  not  a  stable  conformation 
but  a  saddle  point  only.  For  the  T-shaped  complex  with  three  molecules  of  water 
around  the  Li+,  S  —  C(Li(H20)3)  —  N,  we  could  not  perform  a  completely  con¬ 
verged  geometry  optimization  due  to  the  very  shallow  potential  surface.  Nevertheless, 
the  binding  energy  after  100  iterations  was  almost  constant  and  5  kcal/mol  lower 
than  in  the  linear  coordinated  (H20)3Li+  •  •  •  NCS-.  No  indication  of  a  change  back 
to  a  linear  coordination  was  found.  Therefore  the  T-shaped  complex  could  well  be 
present  in  aqueous  solution. 

The  unexpected  T-shaped  nonlinear  arrangement  of  Li+  and  SCN-  was  inves¬ 
tigated  further.  A  map  of  the  electrostatic  potential  of  SCN-  (Fig.  2)  shows  indeed 
a  local  minimum  besides  the  sulfur  atom  away  from  the  molecular  axis.  In  the 
optimized  nonlinear  S  —  C(Li) — N  structure  described  above,  however,  Li+  is 
closer  to  the  carbon  atom  than  to  the  sulfur  atom  (making  the  complex  not  L- 
but  T-shaped).  The  electrostatic  potential  maps  show  the  mp2  potential.  Maps 
with  the  Hartree-Fock  potential  were  virtually  identical.  Since  the  electrostatic 
potential  measures  the  interaction  of  the  molecule  with  a  proton  in  the  limit  of 
negligible  polarization,  we  investigated  if  mainly  electronic  polarization  of  the  SCN- 
anion  or  the  bending  of  S — C — N  (which  can  be  regarded  as  geometric  “polar¬ 
ization”)  are  responsible  for  the  difference  between  electrostatic  potential  minima 
and  actual  minimum  energy  structure.  A  calculation  of  the  T-shaped  complex  with 
a  rigid  linear  SCN-  gave  nearly  the  same  binding  energy  (-140  kcal/mol)  as  the 
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Figure  2.  Contour  maps  of  the  MP2/6-31  lG(d,p)  electrostatic  potential  of  SCN  (left) 
and  OCN“  (right)  in  the  molecular  plane.  Dashed  lines  denote  negative  and  full  lines 
positive  values.  The  contour  lines  are  labeled  in  atomic  units. 


one  where  all  geometric  degrees  of  freedom  were  relaxed  (—142  kcal/mol).  If  instead 
of  the  electrostatic  potential  the  actual  interaction  between  Li+  and  S CN~  as  a 
function  of  the  position  of  Li+  is  calculated  (upper  left  plot  of  Fig.  3),  the  minimum 
near  the  carbon  atom  can  be  found.  Additionally,  the  contour  plot  shows  that  the 
“valley”  with  a  depth  of  - 1 30  to  - 1 40  kcal/mol  reaches  around  most  of  the  anion 
except  the  vicinity  of  the  sulfur  atom  where  there  is  no  indication  of  a  binding 
site.  It  should  be  mentioned  that  a  Mulliken  population  analysis  of  SCN"  gives 
an  equal  distribution  of  the  electronic  excess  charge  between  S  and  N  (S:  -0:51, 
C:  0.05,  N:  -0.54). 

Geometrical  Changes  upon  Complex  Formation 

Complexation  at  the  S-site  causes  both  the  S  C  and  the  C  N  bond  to  be 
shortened  by  about  0.007  A,  a  quite  small  geometrical  change.  Complexation  at 
the  N-site  shortens  the  S — C  bond  by  0.064  A  whereas  the  C  —  N  bond  gets  longer 
by  0.011  A.  The  same  trend  is  found  for  the  (H20)3Li*  •  *SCN  and 
(H20)3Li  •  •  *  NCS  complexes  with  slightly  smaller  changes  in  the  bond  length.  In 
the  T-shaped  complex  the  S  —  C  bond  gets  shorter  by  0.0 14  A  and  the  C  —  N  bond 
becomes  longer  by  0.008  A.  Whereas  for  the  two  linear  configurations  mp2  and 
mp4  calculations  give  nearly  identical  results,  a  mp4  calculation  of  the  T-shaped 
complex  gives  an  elongation  of  0.005  A  for  the  S  C  distance  and  no  change  of 
the  C  —  N  bond,  indicating  that  such  small  changes  are  beyond  the  limits  of  our 
method.  We  can  conclude  that  for  Li+  •  •  *  SCN"  and  S  C(Li)  N  the  internal 
geometry  of  SCN"  hardly  changes  whereas  for  Li+  •  •  •  NCS"  it  does  change  signif¬ 
icantly.  Table  II  summarizes  the  geometrical  parameters  of  the  complexes. 
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-4-2  0  2  4  -4-2  0  2  4 

Figure  3.  Isoenergy  contour  maps  of  the  interaction  energy  between  Li+  (left  plots) 
and  Mg2+  (right  plots)  moving  in  a  plane  containing  SCN-  (upper  plots)  and  OCN- 
(lower  plots).  mp2  energies  calculated  with  the  6-3 1 1  G(d,p)  basis  set.  The  values  given  are 

in  kcal/mol. 


Energies  and  Geometries  in  Li+/OCN 

The  cyanate  anion,  OCN-  is  normally  found  to  behave  similarly  to  SCN-.  We 
performed  the  same  type  of  calculations  as  described  above  for  SCN-  in  order  to 
find  out  which  differences  can  be  found  in  the  binding  of  cations  to  OCN-.  The 
energies  are  summarized  in  Table  I  and  the  structures  of  Li+*  *  -  OCN-  and 
Li+  •  •  •  NCO-  are  shown  in  Figure  4. 

Again  the  binding  energies  on  Hartree-Fock  and  mp2  levels  are  very  close.  Binding 
to  the  N-site  is  preferred  by  12  kcal/mol.  For  the  complex  with  three  water  molecules 
this  value  decreases  to  8  kcal/mol.  In  contrast  to  the  thiocyanates,  binding  to  the 
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Table  II.  Geometric  parameters  in  the  complexes.  Values  calculated  with  mp2/6-3  1  lG(d,p).  Distances 

in  A  and  angles  in  degrees. 


KS-C) 
or  r(0 — C) 

r(C — N) 

r(M — S) 
or  r(M — N) 

<(S— C— N) 
or  <(0— C— N) 

SCN" 

1.6690 

1.1913 

180 

OCN-' 

1.2322 

1.2046 

180 

Li-  •  -SCN 

1.6625 

1.1841 

2.0684 

180 

Li-  •  -NCS 

1.6050 

1.2025 

1.7699 

180 

T-shaped  S— C(Li)N 

1.6866 

1.1970 

2.2907 

163 

Mg-  •  -  SCN 

1.6614 

1.1828 

2.1346 

180 

Mg-  •  -NCS 

1.5575 

1.2280 

1.8439 

177 

T-shaped  S— C(Mg)N 

1.6868 

1.2004 

2.3959 

156 

Li-  •  -OCN 

1.2600 

1.1846 

1.6476 

180 

Li-  •  -NCO 

1.1925 

1.2101 

1.7403 

180 

Mg-  •  -OCN 

1.2864 

1.1778 

1.7676 

180 

Mg-  •  -NCO 

1.1670 

1.2220 

1.8203 

180 

O-site  results  here  in  a  stable  minimum  as  was  found  by  inspecting  the  eigenvalues 
of  the  force  constant  matrix.  A  contour  map  of  the  interaction  energy  between  Li+ 
and  OCN-  as  a  function  of  the  position  of  Li+  (lower  left  plot  of  Fig.  3)  shows  that 
for  this  ion  pair  no  T-shaped  complex  exists  as  a  stable  minimum.  However,  from 
this  figure  it  appears  that  a  slightly  nonlinear  Li  —  O  —  C  angle  would  be  preferred. 
This  is  only  true,  however,  for  the  map  of  the  interaction  energies  where  the  equi¬ 
librium  geometry  of  OCN-  is  kept  frozen  as  the  cation  samples  the  surface.  If  the 
internal  OCN-  geometry  is  relaxed,  too,  the  energy  minima  are  found  in  linear 
conformations  only. 

The  map  of  the  electrostatic  potential  of  OCN-  (right  plot  of  Fig.  2)  again  shows 
a  minimum  near  O  at  angle  of  about  45°  from  the  C — O  axis  and  the  linear 
configuration  is  only  a  saddle  point.  This  minimum,  however,  is  less  pronounced 
than  the  corresponding  one  for  SCN-.  The  difference  between  the  electrostatic 
potential  and  the  actual  binding  energy  can  be  explained  by  the  fact  that  the  energy 
gain  by  polarizing  the  anion  is  mostly  possible  in  the  direction  of  its  molecular 
axis,  therefore  changing  the  kind  of  “lone  pair”  orientation  predicted  by  the  elec¬ 
trostatic  potential. 

The  geometrical  parameters  of  the  cyanates  are  shown  in  Table  II.  Complexation 
to  the  oxygen  causes  an  elongation  of  the  O  —  C  bond  by  0.028  A  whereas  the 
C — N  bond  is  shortened  by  0.020  A.  Complexation  of  the  N-site  changes  the 
intramolecular  distances  in  the  opposite  way:  r0-c  gets  smaller  by  0.040  A  and 
rc- N  increases  by  0.006  A.  The  pattern  of  the  bond  length  changes  in  Li  •  •  •  NCO 
is  therefore  the  same  as  for  the  corresponding  thiocyanate. 

The  main  difference  between  lithium  thiocyanate  and  lithium  cyanate  ion  pairs 
is  that  the  first  one  is  able  to  form  a  nonlinear  complex  that  equals  in  stability  the 
linear  Li+  •  •  •  NCS-  complex  and  that  linear  Li+  •  •  •  SCN-  is  not  a  minimum  on 
the  potential  hypersurface.  In  case  of  lithium  cyanate  the  two  linear  complexes  are 
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Figure  5.  The  [(H20)3MgSCN]+  complex  with  the  stable  T-shaped  (left)  and  the  unstable  linear  coordination  (right)  of  the  metal 
cation.  The  figures  show  the  hf/6-31  lG(d,p)  optimized  geometries. 
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the  only  stable  minima  of  the  potential  surface.  The  binding  of  Li+  cation  to  OCN 
is  about  20  kcal/mol  more  stable  than  the  binding  to  SCN-. 


The  Complexation  of  SCN  with  Mg2+ 

In  order  to  study  the  interaction  of  a  doubly  charged  cation  with  thiocyanate 
and  cyanate  anion,  we  performed  calculations  on  the  systems  [Mg/SCN]+  and 
[Mg/OCN]+. 

For  [Mg/SCN]+,  a  similar  situation  as  for  Li/SCN  is  found  concerning  com¬ 
plexation  at  the  S-site  and  the  formation  of  a  T-shaped  complex:  An  approach  of 
the  cation  at  the  S-site  does  not  result  in  a  stable  linear  Mg2+  •  •  •  SCN-  cationic 
complex.  In  contrast  to  Li/SCN,  however,  linear  Mg2+  •  •  •  NCS-  is  only  a  saddle 
point  too  and  the  energy  minimum  with  Mg2+  at  the  N-site  occurs  at  a  nonlinear 
conformation  with  a  C — N — Mg  angle  of  154°.  Also  the  optimal  S — C — N 
angle  deviates  slightly  from  linearity  (177°).  The  complexes  are  shown  in  Figure  1 
(lower  part).  The  potential  surface  is  quite  flat  so  that  linear  Mg2+  •  •  •  NCS-  has 
practically  the  same  energy  as  the  true  minimum  structure. 

Again,  both  N-site  and  T-shaped  complexes  have  practically  identical  binding 
energies.  This  can  be  seen  also  from  the  map  of  the  interaction  energy  (upper  right 
plot  of  Fig.  5).  The  same  pattern  as  for  the  lithium  coordination  with  slightly  larger 
changes  of  the  bond  length  can  be  found.  Both  S  —  C  and  C  —  N  bond  length  are 
increased  by  about  0.01  A  for  the  T-shaped  complex  whereas  complexation  on  the 
N-site  increases  the  C  —  N  bond  by  0.04  A  and  makes  the  S — C  bond  shorter  by 
0.11  A. 


The  Complexation  of  OCN  with  Mg2+ 

For  this  ion  pair,  both  O-site  and  N-site  coordinated  complexes  are  the  only  local 
minima  of  the  energy  hypersurface.  Mg2+  •  •  •  NCO-  is  more  stable  by  about  32 
kcal/mol.  This  is  a  much  bigger  difference  than  for  Li+.  The  potential  energy  surface 
is  extremely  shallow  with  respect  to  deviations  from  linearity.  The  energy  minimum 
at  the  oxygen  site  is  a  bent  complex  (Fig.  4)  with  a  C — O  —  Mg  angle  of  160° 
whereas  linear  Mg2+  •  *  •  NCO-  is  found  to  be  stable  by  inspection  of  the  force 
constant  matrix. 

For  changes  of  the  internal  geometry  of  the  cation,  the  same  alternating  pattern 
can  be  found  as  for  the  other  linear  complexes.  Coordination  of  the  cation  leads 
to  an  elongation  of  the  bond  at  the  coordination  site  by  0.05  A  (O  —  C)  and  0.03 
A  (C — N)  whereas  the  other  bond  in  the  anion  becomes  shorter  by  0.01  A  and 
0.07  A,  respectively. 


Summary 

It  was  found  that  the  complexes  which  can  be  made  up  of  the  four  possible 
combinations  of  the  cations  Li+  and  Mg2+  and  the  anions  SCN-  and  OCN-  show 
remarkable  differences.  For  Li+/OCN-  and  Mg2+/OCN-,  linear  or  slightly  bent 
complexes  are  stable  minima  of  the  energy  surface,  with  N-site  coordination  sig- 
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nificantly  preferred  by  10  (Li+)  and  33  (Mg2+)  kcal/mol.  For  Li+/SCN"  the  linear 
complex  Li+*  •  •  NCS  and  a  T-shaped  complex  S — C(li) — N  have  very  similar 
stability.  Linear  coordination  to  the  S-site  does  not  result  in  a  minimum.  The  latter 
is  also  true  for  [Mg/SCN]+.  For  this  complex  coordination  at  the  N-site  results  in 
a  nonlinear  complex.  Hydration  of  the  complexes  by  three  molecules  of  water 
decreases  the  energetic  differences  and  intramolecular  geometric  changes  by  20- 
50%  but  did  not  qualitatively  change  the  above  observations.  These  calculations 
might  therefore  be  relevant  to  structural  questions  in  solution  chemistry  where 
normally  only  linear  complexes  [1]  and,  for  the  “hard”  cations  Li+  and  Mg2+,  N- 
site  coordination  is  considered.  Additionally,  the  results  show  that  pure  electrostatic 
calculations  would  give  wrong  geometries  of  the  complexes. 
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Abstract 

SC-meh-mo  calculations  with  electron  repulsion  correlation  and  configuration  interaction  included, 
have  been  carried  out  on  Sm(Cp*)2  in  both  its  linear  and  bent  geometrical  forms.  Except  for  relatively 
small  factors,  there  are  virtually  no  major  differences  in  the  electronic  structures  of  the  two  comfomers. 
In  both  cases,  the  bonding  is  found  to  be  very  close  to  being  nearly  totally  ionic.  An  estimation  of  the 
ionic  bond  energy  is  made  and  discussed.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

A  number  of  fc(pentamethylcyclopentadienyl)  compounds  having  the  general 
formula  M(Cp*)2 ,  where  Cp*  is  the  pentamethylcyclopentadienyl  ligand,  usually 
expressed  in  its  nonradical,  anion  form,  (Cp*)-.  It  is  generally  accepted  that  these 
systems  fall  into  two  structural  classifications;  those  that  have  linear  (parallel)  and 
bent  Cp*  dispositions  relative  to  the  Cp*-M-Cp*  centroid  angles.  If  M  =  Ge(II), 
Sn(II),  or  Ph(II),  the  bent  conformation  is  observed  in  both  the  solid  and  gas  phases 
[1].  This  is  believed  to  be  a  consequence  of  the  nonbonding  electron  pair  on  the 
metal  interacting  with  the  electron  II-system  of  the  Cp*  rings.  However,  when 
M  =  Ca,  Sr,  or  Ba,  the  structures  are  also  bent;  but  if  M  =  Mg,  it  is  linear  in  both 
solid  and  gas  phases  [2].  Since  there  are  no  nonbonding  electron  pairs  on  these 
metals  to  interact  with  the  Cp*  rings,  bent  structures  would  not  be  anticipated  in 
these  latter  cases.  Conversely,  if  M  is  a  transition  metal  having  d  valence  electrons, 
the  structures  appear  to  be  invariably  linear;  but  large-scale  ab  initio  LCAO-SCF 
calculations  have  not  been  able  to  reproduce  accurate  M-Cp*  equilibrium  bond 
distances,  no  matter  how  extensive  the  basis  sets  used  [3].  This  failure  is  apparently 
due  to  the  lack  of  d-electron  correlation  in  these  calculations.  It  is  equally  interesting 
that  for  M  =  Yb(II),  the  structure  is  also  bent  in  both  solid  and  gas  phases  [4].  If 
M  =  Sm(II)  or  Eu(II),  the  structures  are  bent  in  the  solid  phases  [5].  Although  the 
gas  phase  structures  have  not  been  reported,  other  studies  imply  that  these  would 
be  bent  also  [6]. 

The  only  electronic  structure  calculations  reported  on  Ln(Cp*)2  complex  thus 
far  are  the  Extended  Huckel  (EH-MO)  calculations  on  Sm(Cp*)2  and  Lu(Cp*)2  [7], 
and  the  sw-Xa  computation  on  Yb(Cp*)2  [8]. 
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The  recent  work  of  Hollis,  Burdett,  and  Bosnich  suggests  that  interligand  van  I 

der  Waals  contact  interactions  are  largely  responsible  for  the  bent  vs.  linear  geo-  j 

metrical  conformations  of  M(Cp*)2  systems  [6].  Thus  it  is  presumed  that  electronic  : 

factors  play  little  or  no  role  in  determining  the  equilibrium  geometrical  disposition  j 

of  these  complexes. 

The  objective  of  this  study  is  to  examine  electronic  structures  and  bonding  in 
the  lanthanide  M(Cp*)2  complexes.  The  results  obtained  thus  far,  specifically  for 
Sm(Cp*)2 ,  is  the  subject  of  this  report. 


Method  of  Calculation 

The  approximate  quasirelativistic  sc-meh-mo  method  which  has  been  previously 
applied  with  considerable  success  in  a  number  of  other  instances,  has  been  utilized 
in  this  study  as  well  [9].  The  pertinent  STO  wave  functions  and  orbital  energy 
parameters  were  computed  from  a  quasirelativistic  HF-SCF  atomic  program  [10]. 
The  Sm  AO  configurations  included  a  frozen  core  plus  4ffl5d66sc6p^  valence  states 
for  various  q  charges  of  Sm9,  0  <  #  <  4,  3  <  a  <  6,  0  <  &  <  1,  0  <  c  <  2,  0  <  d  < 
1 .  The  Cp*  was  calculated  as  a  full  Ci0Hi5  molecule  in  the  - 1 ,  0,  + 1  charged  states. 
These  results  were  fitted  into  the  charge-dependent,  orbital  energy  equations  required 
for  the  calculations. 

One  of  the  major  difficulties  with  computations  on  lanthanides  by  this  technique 
is  that  the  charge  and  configuration  dependent  hamiltonian  elements  for  the  4f  and 
5d  levels  generate  MOs  that  are  energetically  more  stable  than  any  other  levels. 
Consequently,  the  Sm  configurations  become  4f14  and  5d9  rather  than  4f65d°  or 
4f55d]  appropriate  for  Sm2+  in  the  Sm(Cp*)2 .  The  only  way  to  compensate  for  this 
problem  is  to  run  the  calculation  initially  with  only  the  Sm  valence  6s2  level,  omitting 
the  4f,  5d,  and  6p  orbitals.  In  this  way,  reasonable  atom  charges  and  populations 
for  the  Sm  and  Cp*  components  are  obtained.  This  is  followed  by  a  second  cal¬ 
culation  with  atom  charges  fixed  from  the  initial  results,  but  including  the  4f,  5d, 
and  6p  orbitals.  The  resulting  4f 14  and  5d9  configurations  are  corrected  for  electron 
repulsion  correlation.  This  is  evaluated  as  the  difference  in  electron  repulsion  energies 
for  4f14  vs.  4f6  and  4f55d]  (averaged)  and  the  5d9  vs.  4f55d!  configurations.  The 
necessary  Slater-Condon  parameters  obtained  from  the  quasirelativistic  HF-SCF 
atomic  program  cited  above,  were  used  to  generate  the  atomic  inputs  for  this  phase 
of  the  computation.  The  repulsive  energy  corrections  are  large,  amounting  to  36.426 
and  20.188  eV  for  the  4f  and  5d  levels,  respectively,  at  a  Sm  charge  of  1.47  (bent 
Sm — Cp*  bonds). 

The  final  third  calculation  is  done  with  the  electron  repulsion  correlation  for  the 
4f  and  5d  levels  included.  Following  this,  a  one-electron  configuration  interaction 
procedure  with  spin-orbit  coupling,  is  carried  out  on  all  MO’s.  This  procedure  is 
hereafter  designated  scmeh-rcci,  (self-consistent  modified  extended  Huckel  with 
repulsion  correlation  and  configuration  interaction).  Or,  as  in  this  case,  when 
quasirelativistic  AO  functions  are  employed  (for  Sm  anway),  the  method  is  labeled 
QR-SCMEH-RCCI. 


SCMEH-MO  CALCULATIONS  ON  LANTHANIDE  SYSTEMS.  I 


567 


Results  and  Discussion 

The  pertinent  final  data  for  both  the  bent  C2v)  and  linear  (D5h)  geometries  of 
Sm(Cp*)2  are  presented  in  Tables  I  and  II.  Filled  homo’s  for  both  cases  are  almost 
totally  Cp*  character,  except  for  the  small  degree  of  5d  character,  which  are  only 
1.8%  and  0.4%  for  linear  and  bent  forms,  respectively.  Of  course,  there  is  a  4/6 
sextet  manifold  (a  spin  heptet  state)  of  singularly  occupied  levels  lying  about  4.4 
eV  above  the  filled  homo,  in  each  case.  The  lumo  is  almost  100%  4f  character, 
but  the  next  to  lowest  unoccupied  mo  is  about  97%  Cp*  character,  and  lies  only 
0.22  eY  above  the  lumo. 

Based  on  the  very  small  degree  of  direct  covalent  bonding,  it  can  be  deduced 
that  the  Sm — Cp*  bonding  in  Sm(Cp*)2  is  some  93%  vs.  85%  ionic,  in  bent  and 
linear  forms,  respectively.  However,  owing  to  the  appreciable  charge  difference 
between  Sm  and  Cp*  and  the  facile  polarization  of  the  ligand,  secondary  polarization 
interaction  may  play  a  significant  role  in  reducing  this  degree  of  ionicity. 

Not  taking  into  account  this  latter  factor,  the  energy  to  break  the  ionic  Sm — 
Cp*  bond  will  be  that  required  to  overcome  the  total  attractive  and  repulsive  in¬ 
teractions  involving  the  Sm — Cp*  bond.  Using  the  charges  provided  in  Table  II 
together  with  the  Sm-C(ring)  distance  of  2.79  A  and  the  closest  Cp*-Cp*  contact 


Table  I.  mo  parameters  for  Sm(Cp*)2 . 


Percent  AO  Character 


Bent 


mo  Cp* 


(no.) 

E(eV) 

Occ 

4f 

5d 

6s 

6p 

(total  2’s,2p) 

33 

-4.141 

0 

1.7 

0.8 

_ 

_ 

97.5 

34 

-4.356 

0 

99.7 

_ 

— 

— 

0.3 

35 

-4.443 

1 

99.9 

— 

— 

— 

0.1 

36 

-4.444 

1 

99.9 

— 

— 

— 

0.1 

37 

-4.446 

1 

99.2 

— 

— 

— 

0.8 

38 

-4.446 

1 

99.5 

— 

— 

— 

0.5 

39 

-4.448 

1 

98.2 

— 

— 

— 

0.8 

40 

-4.449 

1 

98.5 

— 

— 

— 

0.5 

41 

-8.021 

2 

— 

0.4 

- 

— 

99.6 

42 

-8.129 

2 

— 

0.5 

— 

— 

99.5 

33 

-4.075 

0 

0.1 

3.1 

— 

— 

96.6 

34 

-4.287 

0 

99.2 

0.4 

— 

— 

0.4 

35 

-4.372 

1 

100.0 

— 

— 

— 

— 

36 

-4.373 

1 

100.0 

— 

— 

37 

-4.375 

1 

99.9 

— 

— 

— 

0.1 

38 

-4.375 

1 

95.2 

4.6 

— 

— 

0.2 

39 

-4.377 

1 

99.3 

0.5 

— 

— 

0.2 

40 

-4.378 

1 

99.9 

— 

— 

— 

0.1 

41 

-8.023 

2 

— 

1.8 

— 

— 

98.2 

42 

-8.119 

2 

— 

1.4 

— 

— 

98.6 

Linear 
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Table  II.  mo  characteristics  of  Sm(Cp*)2 . 


Bent 

Parallel 

(QR-SCMEH-RCCI) 

Orbital  parameters 

(EH  MO)a 

(QR-SCMEH-RCCl) 

%  AO  character  per  doubly 


occupied  mo 


4f 

_ 

— 

0.4 

5d 

— 

0.2 

0.6 

6s 

— 

4.3 

11.3 

6p 

— 

2.8 

2.7 

Total  Lowdin  population 

4f 

— 

5.957 

5.952 

5d 

8.20 

0.074 

0.210 

6s 

0.49 

0.512 

0.677 

6p 

0.41 

1.008 

1.074 

Charges 

Sm 

-7.10b 

1.47 

1.50 

Cp* 

3.55b 

-0.735 

-0.750 

a  From  Ref.  [7]. 

b  Derived  from  data  given  in  Ref.  [7]. 


distance  of  3.5  A  [6],  the  total  electrostatic  energy  for  the  Sm — Cp*  bond  is  com¬ 
puted  to  be  80.7  kcal/mol.  An  experimental  value  for  Sm(Cp*)2  has  not  been  re¬ 
ported,  but  the  values  listed  for  the  lanthanide  tricyclopentadienides  Ln(Cp)3,  range 
from  78.0  to  82.0  kcal/mol  [11].  Since  these  latter  compounds  have  been  charac¬ 
terized  to  be  ionic,  the  results  obtained  from  the  present  calculations  are  clearly 
reasonable. 

It  is  interesting  to  compare  these  results  with  that  reported  for  Sm(Cp*)2  via  the 
EH-MO  method  [7].  The  AO  orbital  character,  populations,  and  charges  are  given 
in  Table  II.  It  can  be  readily  deduced  from  the  data  given  in  Ref.  [7]  that  the  total 
populations  for  Sm  5d,  6s,  and  6p  orbitals  are  8.20,  0.49,  and  0.41,  respectively. 
This  results  in  a  net  Sm  charge  of  -7.10,  which  is,  of  course,  totally  unrealistic. 
Hence,  these  data  are  of  little  or  no  value  for  determining  the  bonding  in  Sm(Cp*)2 . 

Finally,  it  may  be  concluded  that  electronic  bonding  factors  do  not  play  any 
meaningful  role  in  determining  the  preferred  equilibrium  geometrical  conformation 
of  Sm(Cp*)2 .  Thus  the  findings  of  Hollis,  Burdett,  and  Bosnich,  which  suggest  that 
van  der  Waals  forces  play  the  dominant  role  in  establishing  the  molecular  geometry 
of  this  system,  appear  to  be  vindicated. 

Acknowledgments 

We  are  indebted  to  the  University  of  New  Orleans  Chemistry  Department  and 
the  Computer  Research  Center,  for  providing  the  facilities  and  resources  to  do  this 
work. 


SCMEH-MO  CALCULATIONS  ON  LANTHANIDE  SYSTEMS.  I 


569 


Bibliography 

[1]  P.  Jutzi,  F.  Kohl,  P.  Hofmann,  C.  Kruger,  and  Yi-H.  Tsay,  Chem.  Ber.  113,  757  (1980);  J.  L. 
Atwood  and  W.  E.  Hunter,  J.  Chem.  Soc.  Chem.  Commun.,  925  (1981);  L.  Femholt,  A.  Haaland, 
P.  Jutzi,  F.  Kohl,  and  R.  Seip,  Acta  Chem.  Scand.  A38,  211  (1984). 

[2]  R.  Blum,  K.  Faegri,  Jr.,  and  H.  V.  Volden,  Organometallics  9,  372  (1990);  R.  A.  Williams,  T.  P. 
Hanusa,  and  J.  C.  Huffman,  Organometallics  9,  1128  (1990). 

[3]  H.  P.  Luthi,  J.  H.  Ammeter,  J.  Almlof,  and  K.  Faegi,  Jr.,  J.  Chem.  Phys.  77,  2002  (1982). 

[4]  R.  A.  Andersen,  J.  M.  Boncella,  C.  J.  Bums,  R.  Blom,  A.  Haaland,  and  H.  V.  Volden,  J.  Organometall. 
Chem.  C49,  312  (1986);  R.  A.  Andersen,  R.  Bloom,  J.  M.  Boncella,  C.  J.  Bums,  and  H.  V.  Volden, 
Acta  Chem.  Scand.  A41,  24  (1987);  R.  A.  Andersen,  R.  Bloom,  C.  J.  Burns,  and  H.  V.  Volden,  J. 
Chem.  Soc.  Chem.  Commun.,  768  (1987). 

[5]  W.  J.  Evans,  L.  A.  Hughes,  and  T.  P.  Hanusa,  Organometallics  5,  1285  (1986). 

[6]  T.  K.  Hollis,  J.  K.  Burdett,  and  B.  Bosnick,  Organometallics  12,  3385  (1993). 

[7]  J.  V.  Ortiz  and  R.  Hoffmann,  Inorg.  Chem.  24,  2095  (1985). 

[8]  J.  C.  Breen,  D.  Hohl,  and  N.  Rosch,  Organometallics  6,  712  (1987). 

[9]  E.  A.  Boudreaux,  in  Vibrational  Spectra  and  Structure,  J.  R.  Durig,  Ed.  (Elsevier,  Amsterdam, 
1993),  Vol.  20,  pp.  189-238. 

[10]  E.  A.  Boudreaux,  S.  P.  Doussa,  and  M.  Klobukowski,  Int.  J.  Quant.  Chem.  20,  239  (1986). 

[11]  F.  R.  Hartley  and  S.  Patin,  The  Chemistry  of  the  Metal-Carbon  Bond  (Wiley,  London,  1982), 
Chap.  2. 

Received  February  17,  1994 


Structural  and  Electronic  Studies  of  Ga3As3,  Ga4As3, 

and  Ga3As4 

P.  PIQUINI,  S.  CANUTO,  and  A.  FAZZIO 

Institute)  de  Fi'sica,  Universidade  de  Sdo  Paulo,  CxP  20516,  01452-990  Sdo  Paulo,  SP,  Brazil 


Abstract 

Ab-initio  self-consistent-field  calculations  are  performed  on  Ga3As3,  Ga3As4,  and  Ga4As3  clusters. 
Geometries,  electron  affinities,  ionization  potentials,  and  bondings  are  analyzed  for  these  clusters.  Geometry 
optimizations  using  analytic  energy  gradients  followed  by  single  point  mp2  calculations  are  performed 
to  decide  for  the  equilibrium  structure.  The  alternation  pattern  observed  for  the  ionization  potential  and 
electron  affinity  are  verified  for  these  clusters  after  correlation  energy  is  included.  Our  results  are  compared 
with  previous  theoretical  and  experimental  calculations.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

The  understanding  of  the  experimentally  observed  results  regarding  geometry 
and  relative  stability  of  covalent  clusters  is  of  great  interest  and  challenge.  These 
clusters  can  be  produced  by  Laser  Vaporization  and  Supersonic  Clustering  technics 
[1].  An  important  point  in  these  studies  is  the  understanding  of  the  relationship 
between  the  physical  properties  of  a  cluster  and  its  size  and  composition.  Theoretical 
efforts  have  concentrated  on  calculations  of  the  geometric  properties  of  these  new 
species  generated  by  laser  vaporization  [2-15]. 

Recently,  interesting  experimental  results  on  geometric  and  electronic  properties 
of  GaAs  clusters  have  been  obtained  by  different  groups  [  1 6-20  ] .  In  particular,  the 
Rice  group  [1,16]  has  observed  an  alternation  pattern  in  the  ionization  potential 
(IP)  and  electron  affinity  (ea)  for  GawAs„  clusters  up  to  m  +  n  a  20  atoms.  The 
experimental  analysis  [1]  indicate  that  for  clusters  with  m  +  n  even  the  IP  is  greater 
than  6.4  eV  whereas  for  m  +  n  odd  the  IP  is  lower  that  this  value  that  corresponds 
to  the  energy  of  the  ArF  excimer  laser.  There  are  still  many  questions  regarding 
the  relative  stability  and  properties  of  these  GaAs  clusters. 

In  this  article  we  report  the  results  of  our  theoretical  studies  on  the  geometries, 
ip,  EA,  and  bonding  of  Ga3As3 ,  Ga3 AS4 ,  and  Ga4As3  clusters.  The  Ga3As3  cluster 
has  been  studied  previously  by  other  theoretical  groups  [8-10] .  Lou  et  al.  [9]  have 
determined  the  geometry  and  vertical  ip  and  ea  within  the  Local-Density  Approx¬ 
imation  (  lda  ) .  Al-Laham  and  Raghavachari  [11]  have  determined  the  geometry 
and  orbital  atomic  population  using  an  effective-core-potential  [21]  calculation. 
Graves  and  Scuseria  [10]  have  calculated  the  energetics  and  geometry  of  this  six- 
atom  cluster  within  a  SCF  all-electron  calculation.  The  seven-atom  clusters,  however, 
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have  been  nearly  ignored  but  Lou  et  al.  [9]  have  estimated  the  geometries,  ip  and 
ea,  within  the  lda  . 

Our  calculation  on  these  three  species  is  an  extension  of  our  previous  article  [13] 
where  we  have  studied  GaAs  clusters  up  to  5  atoms  with  an  ab-initio  Hartree-Fock 
SCF  all-electron  calculation  with  a  midi  basis  set  [22].  The  complexity  of  an  all¬ 
electron  calculation  becomes  more  and  more  pronounced  as  we  increase  the  number 
of  Ga  and  As  atoms  in  the  cluster.  Thus  in  this  study  we  have  utilized  a  double- 
zeta  valence  basis  set  which  gives  a  good  compromise  between  accuracy  and  com¬ 
putational  cost. 

Methods  and  Results 

The  calculations  are  performed  at  the  Hartree-Fock  ab-initio  self-consistent-field 
level  followed  by  single  point  second-order  perturbation  theory  (mp2)  [23].  We 
have  utilized  the  effective-core-potential  gaussian  basis  set  proposed  by  Wadt  and 
Hay  [21]  which  is  a  double-zeta  plus  diffuse  basis  set  representation  of  the  outer 
electrons  of  each  Ga  and  As  atoms.  All  calculations  were  made  using  the  GAMESS 
program  [24]. 

The  possible  geometrical  arrangements  increase  as  the  number  of  atoms  in  a 
mixed  cluster  increase  [5].  Hence,  we  have  implemented  an  experimental-  and 
theoretical-based  strategy  to  guide  our  search  of  the  minimum  energy  configuration. 
In  the  case  of  the  Ga3As3  system,  we  chose  the  initial  configurations  for  the  geometry 
optimization  as  those  obtained  by  edge-capping  and  face-capping  the  previous 
theoretically  proposed  structures  of  Ga3As2  and  Ga2As3  clusters  [13,9],  as  shown 
in  Figure  1 .  This  theoretically  proposed  structure  for  the  Ga2As3  is  in  agreement 
with  a  recent  ESR  study  [  20  ]  that  indicates  the  two  Ga  and  the  three  As  atoms  to 
be  symmetrically  equivalent.  For  the  Ga3As4  and  Ga4As3  cases,  the  initial  config- 


Figure  1 .  The  calculated  SCF  structures  for  Ga2As3  and  Ga3As2 .  Full  circles  correspond 
to  As  and  open  circles  to  Ga  atoms. 
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(1) 


(a) 


(b) 


(c) 


Figure  2.  The  calculated  SCF  +  single-point  mp2  structures  for  Ga3As3  (C,  symmetry), 
Ga3As4  (C  symmetry)  and  Ga4As3  (near  Cs  symmetry).  The  numbers  identify  the  atoms 

in  the  Table  I. 


urations  are  that  obtained  by  edge  and  face-capping  the  determined  Ga3As3  min¬ 
imum  energy  structure.  The  strategy  utilized  for  the  initial  configuration  is  based 
on  the  experimental  fact  that  the  GaAs  clusters  are  photofragmented  atom  by  atom, 
without  fission  of  the  cluster  [1].  We  also  note  that  in  the  case  of  Ga„Asm  clusters 
with  n  +  m  <  5,  all  proposed  structures  can  be  obtained  in  this  way  [7-9,13]. 
Besides  these  configurations  we  have  also  taken  as  initial  configuration  those  struc¬ 
tures  proposed  in  other  theoretical  studies  [9,1 1]. 

We  optimized  these  structures  without  any  symmetry  constraints.  Once  satisfied 
the  gradient  criterion  (0.0003  Hartree/Bohr)  we  calculated  the  vibrational  fre¬ 
quencies  to  verify  if  the  optimized  structure  was  a  local  minimum.  Once  the  SCF 
local  minima  were  obtained  we  have  performed  a  single  point  mp2  calculation. 

The  more  stable  structures  obtained  here  are  shown  in  Figure  2.  In  the  case  of 
Ga3As3  we  have  obtained  a  face-capped  trigonal  bipyramidal  structure  [Fig.  2(a)] 
which  is  in  perfect  agreement  with  previous  theoretical  calculations  [9,1 1] . 

The  calculated  Ga3As4  equilibrium  structure  was  obtained  when  we  relaxed  a 
face-capped  structure  where  the  As  approximates  the  face  labeled  I  in  Figure  2(a). 
The  configuration  obtained  after  relaxing  a  pentagonal  bipyramidal  structure,  pro¬ 
posed  by  Lou  et  al.  [9],  have  led  us  to  the  structure  shown  in  Figure  3(b).  [Fig. 


Figure  3.  The  figure  in  the  left  is  the  initial  pentagonal  bipyramidal  configuration  of 
Ga3As4  used  for  a  structure  optimization  run.  The  figure  in  right  is  the  final  structure 
obtained  at  a  SCF  level  starting  from  the  pentagonal  bipyramid. 
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3(a)  shows  the  initial  configuration]  which  is  3.2  kcal/mol  above  in  energy  than 
the  configuration  shown  in  Figure  2(b). 

The  calculated  Ga4As3  structure,  shown  in  Figure  2(c),  was  obtained  by  ap¬ 
proximating  one  Ga  atom  to  the  face  labeled  II  in  Figure  2(a).  This  calculated 
structure  is  6.0  kcal/mol  below  in  energy  than  that  obtained  when  we  start  the 
optimization  procedure  with  the  pentagonal  bipyramidal  structure  proposed  by 
Lou  et  al.  [9].  The  final  calculated  geometrical  parameters  for  the  three  systems 
studied  are  shown  in  Table  I. 


CATIONS  NEUTRALS 


ANIONS 

[-0.02] 


Figure  4.  The  calculated  net  charges  (in  brackets)  of  the  neutral  species  and  the  vertical 
cation  and  anion.  The  solid  lines  represent  bond-order  values  greater  than  0.1. 
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Table  I.  Calculated  geometric  parameters. 


Distances  (A) 

Ga3As3 

Ga3As4 

G&4AS3 

(D-(3) 

3.59 

(1)— (3)  4.66 

(D-(5) 

2.78 

(l)-(4) 

2.91 

(l)-(4)  3. IB 

(D-(6) 

3.91 

U)-(6) 

4.45 

(l)-(5)  2.71 

(D-(7) 

3.17 

(2)-(3) 

5.34 

(l)-(7)  4.20 

(2)-(3) 

4.59 

(2)-(6) 

3.22 

(2)-(4)  2.75 

(2)-(4) 

5.09 

(3)— (5) 

3.35 

(2)-(5)  3.28 

(2)-(5) 

2.69 

(3)-(6) 

2.64 

(2)— (7)  4.30 

(4)— (6) 

2.95 

(4)— (5) 

2.69 

(4)— (5)  2.66 

(4)— (7) 

2.75 

(4)-(6) 

2.60 

(4)-(7)  3.72 

(5)-(6) 

3.56 

(5)— (6)  3.56 

(5)-(7) 

3.96 

(5)-(7)  2.56 

(6)-(7) 

2.57 

Numbers  in  the  parentheses  refer  to  those  in  Figure  2. 


An  interesting  result  obtained  here  is  the  change  in  bond-order  when  the  neutral 
cluster  accepts  or  donates  an  electron.  In  Figure  4  we  show  the  positively  and 
negatively  charged  clusters  obtained  after  a  SCF  calculation  of  their  total  energies, 
bond-order,  and  charges  using  the  geometry  of  the  neutral  species.  The  solid  lines 
represent  bond-order  values  greater  than  0.1.  The  numbers  in  brackets  are  our 
calculated  values  for  the  net  charges.  We  verify  that  the  introduction  of  one  electron 
in  the  positively  charge  and  neutral  systems  increases  the  Ga  As  bond  and  de¬ 
creases  the  As  —  As  bond.  We  also  note  that  at  the  SCF  level  of  calculation  the  Ga 
atoms  have  a  tendency  to  bind  to  a  maximum  of  two  As  atoms.  The  introduction 
(removal)  of  one  electron  in  the  neutral  duster  increases  (decreases)  the  p  contri¬ 
bution  and  decreases  ( increases )  the  s  contribution  in  the  atomic  orbital  population 
on  each  atom.  By  these  findings  we  infer  that  the  hybridization  of  the  s  and  p 
orbitals  makes  an  important  role  in  the  effective  realization  of  the  Ga — As  binding. 

Another  interesting  aspect  related  to  these  clusters  is  the  experimental  suggestion 
of  an  alternating  pattern  regarding  the  ip  and  ea  with  the  number  of  atoms  in  the 
cluster  [1,16].  Thus,  using  the  optimized  geometries  obtained  at  the  SCF  level  we 
have  calculated  the  ip  and  EA  of  the  three  clusters  considered  here. 

The  Table  II  shows  our  results  for  the  vertical  IP  and  ea  obtained  both  at  the 
SCF  and  mp2  levels  but  using  the  SCF  geometries.  Typically  the  ip  of  the  GaAs 
clusters  lie  between  5eV  and  7eV  with  the  bulk  value  of  crystalline  GaAs  being 
5.49eV  [25].  As  to  the  ea,  the  cluster  values  lie  between  0.4eV  and  1.40eV,  with 
the  bulk  value  of  the  crystal  being  4.07eV  [26].  Our  results  are  in  fair  agreement 
with  these  numbers.  We  also  note  that  partial  inclusion  of  the  electron  correlation 
effects,  in  second-order  mp2,  decreases  the  ip  of  the  three  clusters.  The  results  ob¬ 
tained  for  the  seven-atom  clusters  are  below  6.4eV  after  inclusion  of  this  correlation 
correction.  For  the  EA,  the  correlation  correction  introduced  by  mp2  increases  for 
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Table  II.  Comparison  of  scf  and  second-order  Moller-Plesset 
results  for  the  vertical  iP(eV)  and  EA(eV)  of  Ga3As3,  Ga4As3,  and 
Ga3As4 . 


Ga3As3 

Ga3As4 

Ga4As3 

ip(scf) 

6.87 

6.46 

6.37 

ip(mp2) 

6.60 

5.50 

5.77 

ea(scf) 

0.40 

0.68 

0.84 

ea(mp2) 

0.36 

0.82 

1.38 

the  seven-atom  clusters  and  decreases  for  the  six-atom.  These  numbers  are  in  good 
agreement  with  the  experimental  results  but  at  this  level  of  calculation  we  do  not 
find  the  As-rich  (Ga3As4)  ea  higher  than  the  Ga-rich  (Ga4As3),  as  suggested  before. 
Our  calculated  values  for  the  ea  should  be  less  accurate  than  those  for  the  ip  because 
of  the  diffuse  character  of  the  anions.  In  any  case,  the  partial  introduction  of  the 
electron  correlation  energy  is  very  important  for  the  quantitative  verification  of  the 
experimentally  observed  even /odd  alternation  pattern  in  the  ip  and  EA  [1,16],  the 
even  (odd)  clusters  having  IP  greater  (lower)  than  6.4eV. 

Conclusions 

We  have  made  ab-initio  SCF  plus  single-point  mp2  calculations,  employing  ef- 
fective-core-potential,  to  study  the  electronic  and  structural  properties  of  Ga3As3, 
Ga4As3 ,  and  Ga3As4  clusters.  Our  calculated  structure  for  the  Ga3 As3  cluster  is  in 
perfect  agreement  with  previous  theoretical  studies  using  local-density  [9]  and  SCF 
followed  by  high-order  Moller-Plesset  calculations  in  the  effective-core-potential 
model  [11].  For  the  seven-atom  clusters  theoretical  investigations  are  more  scanty 
and  only  a  local-density  calculation  [9]  has  been  performed  before.  For  these  systems 
we  have  obtained  equilibrium  configurations  that  are  6.0  kcal/mol  (Ga4As3)  and 
3.2  kcal/mol  (Ga3As4)  below  the  pentagonal  bypiramid  structures  with  C2v  sym¬ 
metry  proposed  previously.  These  small  energy  differences  may  suggest  the  existence 
of  several  local  minima.  We  also  suggest  that  the  alternation  pattern  observed  ex¬ 
perimentally  for  the  ip  and  EA  [  1 , 1 6  ]  demands  the  inclusion  of  electron  correlation 
effects  for  a  proper  theoretical  description. 
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Abstract 

Optimum  eQuilibrium  geometries,  harmonic  vibrational  frequencies,  and  infrared  intensities  within 
the  double  harmonic  approximation  are  computed  for  diborane,  B2H6,  dialane,  A12H6,  and  digallane, 
Ga2H6,  at  both  the  SCF  level  of  theory  and  the  second-order  perturbation  theory  [E(2)]  using  three  large 
basis  sets:  6-31  lG(d,p),  6-31  lG(2d,2p),  and  6-31  lG(2df,2p).  In  particular,  the  results  obtained  with 
the  latter  basis  set  make  this  present  work  the  first  study  to  include  f- type  polarization  functions  in  a 
systematic  investigation  of  the  molecular  structure  and  properties  of  all  three  molecules  in  the  series. 
Because  of  the  good  agreement  of  the  present  theoretical  results  with  experimental  data  and  with  previous 
theoretical  studies  which  employed  a  higher  treatment  of  electron  correlation,  this  study  serves  to  show 
that  large  basis  sets  can  in  part  compensate  for  the  lack  of  a  more  advanced  treatment  of  electron 
correlation  in  these  electron-deficient  systems.  In  addition,  this  study  establishes  the  level  of  basis  set 
needed  for  future  work  on  these  systems  including  a  thorough  description  of  the  total  electronic  density 
at  a  correlated  level.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Diborane  is  the  prototypical  molecule  for  ju-hydrido  bridging  in  electron-deficient 
compounds.  As  a  result,  many  experimental  [1-10]  and  theoretical  [11-21]  studies 
have  been  made  on  this  system  and  its  corresponding  monomer.  In  contrast,  much 
less  work  has  been  published  on  dialane  and  digallane.  Experimental  results  are 
especially  rare  for  these  systems:  dialane  has  still  not  been  isolated  and  digallane 
only  just  synthesized  in  1989  by  Downs,  Goode,  and  Pulham  [22] .  However,  quite 
a  few  theoretical  studies  have  been  published  recently  [23-28].  Among  them  was 
a  thorough  investigation  by  Shen  and  Schaefer  in  which  optimum  equilibrium 
geometries,  harmonic  vibrational  frequencies,  and  infrared  intensities  were  com¬ 
puted  for  all  three  systems  through  the  level  of  configuration  interaction  limited  to 
single  and  double  excitations  (cisd)  using  a  standard  double-zeta  plus  polarization 
(dzp)  basis  set  [28].  Their  study  also  included  optimum  geometries,  vibrational 
frequencies,  and  infrared  intensities  for  diborane  and  dialane  and  optimum  ge¬ 
ometries  for  digallane  at  the  level  of  the  single  and  double  excitation  coupled- 
cluster  method  with  the  same  dzp  basis  set. 
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The  fact  that  dialane  has  not  been  synthesized  yet  is  somewhat  of  an  enigma. 
Ab  initio  predictions  of  its  binding  energy  (ca.  31  kcal/mol)  [24,28]  indicate  its 
larger  stability  as  compared  to  digallane  (binding  energy  ca.  22  kcal/mol)  [24,28]. 
Possibly,  factors  other  than  simple  thermodynamical  ones  are  responsible  for  such 
unique  properties  of  this  aluminum  hydride.  Therefore,  we  have  undertaken  a 
systematic  study  including  all  three  corresponding  systems  in  order  to  lighten  the 
shadow  of  this  unusual  chemistry. 

In  the  present  work  we  report  the  optimized  geometries,  harmonic  vibrational 
frequencies,  and  infrared  absorption  intensities  within  the  double  harmonic  ap¬ 
proximation  for  M2H6  (M  =  B,  Al,  Ga)  using  three  relatively  large  basis  sets 
at  both  the  SCF  level  of  theory  and  the  second-order  perturbation  theory 
[mbpt(2)  =  mp2  =  E(2)] .  The  main  reason  for  introducing  more  geometries  and 
vibrational  frequencies  into  the  literature  is  to  compare  our  results  with  previous 
theoretical  values  to  see  what  effects  extra  polarization  functions  have.  In  particular, 
this  is  the  first  study  to  include  fit ype  polarization  functions  in  a  systematic  inves¬ 
tigation  of  all  three  molecules  in  the  series.  Thus,  we  hope  to  determine  the  level 
of  basis  set  needed  for  future  work  on  these  systems  such  as  a  thorough  description 
of  the  total  electronic  density  at  a  correlated  level.  Such  a  determination  may  yield 
new  insight  into  the  nature  of  bridge  bonds  in  these  systems  and  related,  more 
complex  species. 


Computational  Details 

In  the  present  study,  optimized  equilibrium  geometries,  vibrational  frequencies, 
and  infrared  intensities  are  determined  for  each  of  the  three  molecular  species  at 
both  the  SCF  level  and  the  second-order  perturbation  theory  level  [E(2 )]  using  the 
GAUSSIAN92  program  package  [29].  Rotational  constants  are  also  determined 
at  each  optimum  geometry.  Each  molecule  has  D2h  symmetry  (see  Fig.  1 ),  and  the 
ground  electronic  state  for  each  system  is  !Ag.  Therefore,  the  SCF  calculations  all 
employ  a  single-configuration  RHF  wave  function. 

Three  basis  sets  of  valence  triple-zeta  quality  are  used  for  each  system.  For  B2H6 
and  A12H6,  the  three  basis  sets  are  6-311G(d,p),  6-31  lG(2d,2p),  and  6- 
31  lG(2df,2p).  The  6-3 11G  basis  is  a  standard  basis  set  in  GAUSSIAN92  and 


M  =  B,  Al,  Ga 


Figure  1 .  General  structure  for  diborane,  dialane,  and  digallane. 
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indicates  a  split-valence  basis  set  comprising  a  single-zeta  core  description  and  a 
triple-zeta  description  for  the  valence  orbitals  [30].  The  (d,p),  (2d,2p),  and  (2df,2p) 
indicate  the  types  and  number  of  polarization  functions  included  on  the  heavy 
atoms  and  on  the  hydrogens,  respectively.  Each  d  function  represents  five  energy- 
equivalent  complex  ^-functions,  and  the  f  function  represents  seven  energy- 
equivalent  complex/functions.  Thus,  for  diborane  the  6-3 1  lG(d,p)  basis  set  consists 
of  72  basis  functions  contracted  from  110  primitive  gaussian  functions;  the  6- 
31  lG(2d,2p)  basis  set  consists  of  100  basis  functions  contracted  from  138  primitive 
gaussian;  and  the  6-31  lG(2df,2p)  basis  set  consists  of  1 14  basis  functions  contracted 
from  152  primitive  gaussians.  For  dialane  the  6-31  lG(d,p)  basis  set  consists  of  88 
basis  functions  contracted  from  1 38  primitive  gaussian  functions;  the  6-3 1  lG(2d,2p) 
basis  set  consists  of  1 1 6  basis  functions  contracted  from  1 66  primitive  gaussians; 
and  the  6-31  lG(2df,2p)  basis  set  consists  of  130  basis  functions  contracted  from 
1 80  primitive  gaussians. 

Basis  sets  for  gallium  similar  to  those  that  we  used  for  diborane  and  dialane  are 
not  included  as  standard  basis  sets  in  GAUSSIAN92,  so  we  input  similar  basis  sets 
[  3 1  ]  via  the  input  decks.  Because  these  basis  sets  consist  of  a  triple-zeta  description 
for  valence  orbitals,  we  refer  to  them  as  TZ(d,p),  TZ(2d,2p),  and  TZ(2df,2p).  The 
Tz(d,p)  basis  set  consists  of  98  basis  functions  contracted  from  184  primitive  gauss¬ 
ian  functions;  the  TZ(2d,2p)  basis  set  consists  of  126  basis  functions  contracted 
from  212  primitive  gaussians;  and  the  TZ(2df,2p)  basis  set  consists  of  140  basis 
functions  contracted  from  226  primitive  gaussians. 

Second-order  perturbation  theory  is  the  least  expensive  level  of  correlated  cal¬ 
culation  with  regards  to  computer  cpu  time  and  disk  space,  and  yet  it  often  accounts 
for  >80%  of  the  electron  correlation  energy  [32-33].  At  this  level,  relatively  large 
molecules  with  heavy  atoms  may  be  studied  with  large  basis  sets.  Such  calculations 
as  those  reported  in  this  study  may  be  run  on  workstations  with  limited  amounts 
of  memory  and  disk  space.  In  contrast,  a  CISD  calculation  with  a  similar  sized  basis 
set  would  be  more  expensive  and  require  more  memory.  A  CISD  calculation  requires 
computational  steps  that  scale  in  cost  according  to  the  sixth  power  of  the  size  of 
the  basis  set. 

In  the  second-order  calculations  reported  in  this  study,  core  molecular  orbitals 
(MOs)  are  frozen.  In  diborane,  this  amounts  to  the  freezing  of  two  1 5-type  MOs.  In 
dialane  ten  1 5-,  25-,  and  2p-type  core  MOs  are  frozen;  in  digallane  a  total  of  28 
Is-,  2s-,  2 p-,  35-,  3 p-,  and  3d-type  MOs  are  frozen.  No  virtual  orbitals  are  deleted 
for  any  of  the  correlated  calculations.  Using  the  smaller  basis  set  [6-31  lG(d,p)], 
optimum  geometries,  vibrational  frequencies,  and  infrared  intensities  for  diborane 
and  dialane  are  computed  with  no  frozen  core  for  comparison  purposes. 

As  Shen  and  Schaefer  report,  the  number  of  vibrational  modes  belonging  to  each 
symmetry  type  for  these  molecules  is  different  depending  upon  which  of  the  six 
possible  right-handed  coordinate  systems  is  chosen  [28] .  They  chose  the  coordinate 
system  in  which  the  x-axis  passes  through  the  heavy  atom  nuclei  and  the  y-axis 
passes  through  the  bridging  hydrogen  nuclei.  We  follow  the  choice  of  Lammertsma 
and  Leszczynski  in  which  the  x-axis  passes  through  the  heavy  atom  nuclei  and  the 
z-axis  passes  through  the  bridging  hydrogen  nuclei  [24].  In  this  orientation  there 
are  the  following  types  of  normal  vibrational  modes: 
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Ag 

four 

Au 

one 

Big 

two 

Biu 

two 

B2g 

two 

b2u 

three 

B3g 

one 

B3u 

three. 

Because  all  of  the  molecules  studied  here  have  an  inversion  center,  all  vibrational 
modes  of  gerade  symmetry  will  be  IR  forbidden  within  the  doubly  harmonic  ap¬ 
proximation  in  which  the  vibrational  modes  are  taken  to  be  harmonic  and  the 
dipole  moment  function  is  assumed  to  be  linear. 

Results  and  Discussion 

Results  of  geometry  optimizations  for  diborane  are  given  in  Table  I  along  with 
the  experimental  equilibrium  geometry  of  Duncan  and  Harper  [9].  Note  the  con¬ 
sistency  and  trends  in  changes  of  the  theoretical  bond  distances  from  one  basis  set 
to  the  next.  The  SCF  level  B — B  interatomic  distance  for  the  smallest  basis  set  differs 
by  only  0.006  A  from  the  SCF  result  of  the  largest  basis  set.  The  B — H  bond  distances 
are  even  more  invariant.  The  terminal  B — H  bond  distance  differs  by  only  0.002 
A  at  this  level,  and  the  bridge  B — H  bond  distance  differs  by  0.005  A.  The  changes 
in  these  distances  from  one  basis  set  to  the  next  are  slightly  larger  at  the  E( 2)  level: 
0.017  A,  0.004  A,  and  0.007  A,  respectively,  but  are  still  quite  consistent.  The  bond 
angle  also  varies  little  for  the  applied  basis  sets.  This  consistency  indicates  that  an 
even  larger  basis  set  would  probably  have  little  effect  on  our  best  predicted  geometry. 
The  agreement  between  the  SCF  and  E(2)  results  for  diborane’s  equilibrium  ge¬ 
ometry  is  also  quite  good.  The  terminal  B — H  bond  distance  changes  the  least  with 
the  addition  of  electron  correlation;  and,  as  Shen  and  Schaefer  point  out,  the  B — 
B  interatomic  distance  is  the  most  sensitive  to  electron  correlation  [28  ] .  Interestingly, 
the  two  interatomic  distances  associated  with  the  central  ring  in  the  molecule  are 
calculated  to  be  shorter  at  the  E (2)  level  than  at  the  SCF  level.  This  result  is  most 
likely  due  to  the  fact  that  the  unusual  bonding  in  hydrogen  bridge  systems  is  not 
completely  defined  by  a  single  configuration  reference  function.  Specifically,  the 
B — B  bonding  molecular  orbital  is  unoccupied  in  the  SCF  reference  state.  The  cor¬ 
related  calculation  incorporates  this  orbital  and  other  important  contributions  to 
the  bonding  and  results  in  shorter  distances. 

Not  only  are  these  theoretical  results  for  B2H6  internally  consistent,  they  also 
agree  quite  well  with  the  experimental  results  of  Duncan  and  Harper  [9].  The  6- 
31  lG(2df,2p)  E(2)  distances  are  all  within  0.009  A  of  the  experimental  data,  and 
the  bond  angle  differs  by  only  0.6°.  In  fact,  two  of  these  three  bond  distances  are 
within  the  error  bars  of  the  experiment.  This  agreement  is  even  better  than  the 
highly  correlated  dzp  CCSD  results  of  Shen  and  Schaefer  [28].  Their  calculated 
interatomic  distances  at  that  level  differ  from  the  experimental  results  by  up  to 
0.042  A,  and  the  bond  angle  differs  by  1.5°. 
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In  Table  II  the  computed  harmonic  vibrational  frequencies  and  infrared  intensities 
for  diborarie  are  presented  along  with  the  experimental  vibrational  frequencies  of 
Duncan  { 10] .  Once  again,  the  theoretical  results  are  quite  consistent  from  one  basis 
set  to  the  next  at  both  the  SCF  and  E(2)  levels.  The  E(2)  frequencies  computed 
with  the  6-3 1  lG(2d,2p)  basis  set  are  all  within  1 8  cm-1  from  those  calculated  with 
the  6-31  lG(2df,2p)  basis  set;  all  but  three  are  within  10  cm-1.  The  differences 
between  the  SCF  and  correlated  results  are  more  pronounced  for  the  harmonic 
vibrational  frequencies  and  the  infrared  intensities  than  the  differences  between 
these  levels  for  the  geometrical  parameters.  However,  such  a  result  is  common.  The 
somewhat  unusual  result  is  that  four  of  the  vibrational  modes  have  higher  frequency 
values  than  the  corresponding  SCF  results.  Normally  one  expects  to  see  the  opposite 
trend  [34,35].  Shen  and  Schaefer  also  point  out  that  some  of  their  correlated  fre¬ 
quencies  are  greater  in  value  than  the  corresponding  SCF  results,  and  they  explain 
that  this  result  is  due  to  the  fact  that  some  of  the  correlated  interatomic  distances 
are  smaller  than  the  SCF  distances  [28]. 


Table  II.  B2H6  harmonic  vibrational  frequencies  (cm  ‘)  and  IR  intensities  (km/mol).a 


Sym 

6-31 1G  (d,p) 

6-31 1G  (2d,2p) 

6-31 1G  (2df,2p) 

Exp.c 

SCF 

E(2)b 

SCF 

E(2)b 

SCF 

E(2)b 

Ag 

819 

822 

819 

823 

821 

830 

788 

1291 

1230 

1297 

1235 

1295 

1226 

1187 

2230 

2210 

2241 

2221 

2243 

2222 

2096 

2694 

2660 

2703 

2662 

2702 

2670 

2526 

A„ 

888 

869 

889 

870 

889 

869 

833 

Big 

990 

954 

993 

954 

992 

953 

918 

2763 

2743 

2771 

2745 

2769 

2752 

2597 

Biu 

1061  (21) 

1010(19) 

1061  (26) 

1008  (25) 

1061  (26) 

1008  (24) 

974 

2006  (43) 

2031  (10) 

2013  (39) 

2035  (9) 

2018  (39) 

2053  (8) 

1924 

B2g 

900 

904 

900 

918 

903 

916 

860 

1886 

1926 

1897 

1936 

1902 

1944 

1756 

b2u 

404(14) 

359  (15) 

409  (15) 

380(16) 

408  (15) 

372  (17) 

367 

1091  (5) 

983  (1) 

1094  (3) 

993  (0.9) 

1092  (3) 

974  (0.6) 

951 

2779  (262) 

2756(174) 

2787  (245) 

2759  (172) 

2785  (246) 

2766  (163) 

2613 

B3g 

1160 

1085 

1160 

1082 

1158 

1066 

1023 

B311 

1270  (132) 

1217(87) 

1276  (128) 

1225  (83) 

1275  (126) 

1215  (76) 

1175 

1772  (752) 

1764  (523) 

1785  (734) 

1780  (513) 

1789  (728) 

1777  (489) 

1615 

2674(185) 

2645  (144) 

2683  (182) 

2646(146) 

2683  (182) 

2654(144) 

2518 

a  Infrared  intensities  are  shown  in  parentheses.  Blanks  indicate  identically  zero  values  for  modes  for¬ 
bidden  in  the  infrared  spectrum. 

b  Two  1 5-type  core  molecular  orbitals  are  frozen;  no  virtual  orbitals  are  deleted. 
c  See  Ref.  [10]. 
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Our  computed  harmonic  vibrational  frequencies  compare  quite  well  with  the 
experimental  anharmonic  frequencies  reported.  As  expected,  the  theoretical  results 
are  all  too  high  due  to  the  fact  that  they  are  harmonic.  The  6-31  lG(2df,2p)  E(2) 
level  results  are  higher  than  the  experimental  frequencies  by  5%,  3%,  6%,  6%,  4%, 
4%,  6%,  3%,  7%,  7%,  1 1%,  1%,  2%,  6%,  4%,  3%,  10%,  5%,  respectively  (in  the  order 
given  in  Table  II).  These  results  do  not  compare  quite  as  well  as  the  dzp  CCSD 
results  of  Shen  and  Schaefer;  but  the  difference  is  slight.  The  average  discrepancy 
of  their  results  with  experiment  is  just  under  4%;  our  results  differ  on  average  by 
just  over  5%. 

Table  III  contains  the  results  of  the  geometry  optimizations  for  A12H6 .  As  before, 
the  different  basis  sets  yield  interatomic  distances  and  bond  angles  that  vary  little 
at  both  the  SCF  and  the  E(2)  levels.  In  addition,  the  difference  between  the  SCF 
and  E(2)  results  are  not  quite  as  large  as  they  were  for  diborane.  This  trend  of 
vanishing  differences  with  the  increase  of  atomic  mass  has  been  noted  previously 
for  some  molecules  containing  elements  from  the  IVth  and  Vth  groups  of  the 
periodic  table  [36].  Similar  to  the  B— B  distance  in  B2H6,  the  Al— A1  interatomic 
distance  is  the  most  sensitive  to  electron  correlation,  but  it  is  not  as  sensitive  as  the 
corresponding  diborane  parameter.  Also  similar  to  the  results  of  B2H6,  we  note 
that  the  interatomic  distances  associated  with  the  ring  in  the  molecule  are  smaller 
at  the  correlated  level  than  at  the  SCF  level.  Again,  this  is  most  likely  due  to  the 
ring  structure  being  somewhat  poorly  described  by  a  single  configuration  reference 
function. 

Perhaps  the  most  interesting  observation  from  the  comparison  of  the  optimized 
equilibrium  geometries  of  diborane  and  dialane  is  the  fact  that  the  H — Al — H  angle 
in  dialane’s  ring  is  less  than  85°  at  both  levels  of  theory  for  all  basis  sets  used.  The 
corresponding  angle  in  diborane  is  greater  than  95°  in  all  cases  we  considered.  A 


Table  III.  A12H6  total  energies,  optimized  equilibrium  geometries,  and  rotational  constants. 


6-31 1G  (d,p) 

6-31 1G  (2d,2p) 

6-31 1G  (2df,2p) 

SCF 

E(2)a 

SCF 

E(2)a 

SCF 

E(2)a 

Total  energy 

(a.u.) 

-487.32011 

-487.49018 

-487.32603 

-487.50822 

-487.32708 

-487.51667 

re( BB),  A 

2.623 

2.606 

2.616 

2.608 

2.617 

2.607 

Terminal  re 

(BH),  A 

Bridge  re  (BH), 

1.571 

1.568 

1.569 

1.569 

1.570 

1.571 

A 

Terminal 

1.741 

1.730 

1.739 

1.732 

1.740 

1.733 

Z(HBH) 

127.1° 

128.1° 

126.9° 

127.6° 

126.9° 

127.9° 

Rotational  const. 

47.57 

47.55 

47.69 

47.60 

47.70 

47.42 

(GHz) 

4.52 

4.59 

4.54 

4.58 

4.54 

4.58 

4.31 

4.37 

4.34 

4.37 

4.33 

4.37 

a  Ten  Is-,  2s-,  and  2p-type  core  molecular  orbitals  are  frozen;  no  virtual  orbitals  are  deleted. 
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recent  study  of  B2H6  which  appraised  the  bonding  from  a  valence-bond  point  of 
view  concluded  that  the  directional  B — H  interactions  were  the  chief  interactions 
in  this  species,  but  that  the  direct  through  space  B — B  interactions  were  not  negligible 
[21].  If  this  is  true,  the  bonding  between  the  two  borons  may  pull  them  closer 
together  than  a  distance  for  which  the  bridge  bonds  alone  could  account,  thereby 
accounting  for  the  large  H — B — H  angle.  Similar  reasoning  from  the  opposite  di¬ 
rection  would  imply  that  little  or  no  direct  bonding  exists  between  the  two  aluminum 
atoms  in  dialane.  Comparison  of  the  largest  coefficients  from  the  second-order 
calculations  of  diborane  and  dialane  support  this  conclusion;  the  coefficients  are 
larger  for  B2H6.  Similarly,  the  simple  fact  that  the  energy  for  diborane  changes 
more  with  correlation  than  does  that  of  dialane  indicates  that  the  SCF  reference 
state  in  which  the  M — M  bonding  orbital  is  unoccupied  describes  A12H6  better 
than  B2H6 . 

The  theoretical  harmonic  vibrational  frequencies  and  infrared  intensities  for  di¬ 
alane  are  presented  in  Table  IV.  As  before,  the  theoretical  results  are  quite  consistent 
from  one  basis  set  to  the  next  at  both  levels  of  theory.  The  E(2 )  frequencies  calculated 


Table  IV.  A12H6  harmonic  vibrational  frequencies  (cm  ‘)  and  IR  intensities  (km/mol).a 


Sym 

6-31 1G  (d,p) 

6-31 1G  (2d,2p) 

6-31 1G  (2df,2p) 

SCF 

E(2)b 

SCF 

E(2)b 

SCF 

E(2)b 

Ag 

390 

396 

391 

389 

389 

387 

813 

779 

814 

782 

814 

775 

1620 

1613 

1604 

1582 

1594 

1569 

2049 

2035 

2044 

2015 

2041 

2009 

Au 

445 

433 

442 

433 

442 

431 

Big 

497 

480 

497 

479 

497 

476 

2046 

2043 

2039 

2020 

2036 

2015 

Biu 

685  (346) 

658  (320) 

681  (327) 

656  (308) 

680  (324) 

652  (302) 

1326  (537) 

1344  (423) 

1315  (475) 

1315  (386) 

1309  (467) 

1312  (367) 

B2g 

508 

502 

506 

506 

506 

502 

1451 

1473 

1440 

1453 

1435 

1452 

B2u 

241  (18) 

227  (16) 

242  (21) 

227  (18) 

242  (20) 

224  (17) 

964  (348) 

913  (298) 

939  (268) 

899  (240) 

935  (261) 

880  (225) 

2054  (536) 

2050  (438) 

2046  (497) 

2027  (414) 

2043  (497) 

2022  (397) 

B3g 

824 

788 

808 

787 

807 

783 

®3u 

765  (845) 

733  (731) 

768  (783) 

739  (696) 

769  (778) 

731  (676) 

1559  (1386) 

1562  (1221) 

1547  (1315) 

1540  (1165) 

1539(1316) 

1535  (1154) 

2041  (122) 

2028  (12) 

2036(133) 

2008  (125) 

2033  (133) 

2002  (123) 

a  Infrared  intensities  are  shown  in  parentheses.  Blanks  indicate  identically  zero  values  for  modes  for¬ 
bidden  in  the  infrared  spectrum. 

b  Ten  Is-,  2s-,  and  2/j-type  core  molecular  orbitals  are  frozen;  no  virtual  orbitals  are  deleted. 
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with  the  6-31  lG(2d,2p)  basis  set  all  differ  from  those  computed  with  the  larger 
basis  set  by  less  than  20  cm-1;  all  but  two  are  within  8  cm-1  of  each  other.  The 
differences  between  the  scf  and  E(2)  results  are  again  more  pronounced  for  the 
computed  frequencies  and  intensities  than  for  the  geometrical  parameters. 

Absent  from  both  Table  III  and  Table  IV  is  a  column  for  experimental  results 
because  dialane  has  not  been  observed  to  date.  However,  we  may  compare  our 
results  to  the  dzp  CCSD  results  of  Shen  and  Schaefer  [28] .  Even  though  the  basis 
set  they  used  is  much  smaller  than  any  of  those  incorporated  in  the  present  study, 
they  used  a  higher  level  of  theory.  All  of  the  interatomic  distances  from  our  6- 
31  lG(2df,2p)  E(2)  results  are  within  0.014  A  of  the  dzp  ccsd  distances,  and  the 
angle  differs  by  only  0.5°.  Based  on  this  agreement  and  on  the  fact  that  our  results 
for  diborane  were  slightly  closer  to  the  experimental  values  than  their  dzp  ccsd 
results,  we  believe  that  our  geometry  should  give  good  agreement  to  experiment  if 
ever  dialane  is  observed  experimentally. 

To  compare  the  vibrational  frequencies,  one  must  remember  that  the  choice  of 
molecular  orientation  is  different  in  the  two  theoretical  studies.  Thus,  the  BJg  and 
Blu  vibrational  modes  in  the  present  study  correspond  to  the  B2g  and  B2u  in  the 
work  of  Shen  and  Schaefer  and  vice  versa.  Our  harmonic  vibrational  frequencies 
computed  at  the  E( 2)  level  with  the  6-31  lG(2df,2p)  basis  set  differ  from  the  dzp 
CCSD  results  of  Shen  and  Schaefer  by  3%,  2%,  4%,  1%,  3%,  3%,  1%,  2%,  4%,  3%, 
1%,  5%,  8%,  1%,  5%,  2%,  3%,  2%,  respectively  (in  the  order  given  in  Table  IV).  It 
is  interesting  to  note  that  all  of  our  frequencies  are  lower.  This  was  not  the  case  for 
diborane,  where  only  five  of  our  6-31  lG(2df,2p)  E(  2)  level  frequencies  were  lower 
than  the  corresponding  dzp  CCSD  results  of  Shen  and  Schaefer,  12  frequencies  were 
higher,  and  one  was  the  same.  Due  to  the  close  agreement  of  these  two  theoretical 
sets  of  frequencies  and  due  to  the  fact  that  calculated  harmonic  vibrational  fre¬ 
quencies  are  usually  higher  than  corresponding  anharmonic  experimental  frequen¬ 
cies,  we  again  believe  our  results  will  yield  good  agreement  to  experiment. 

Table  V  presents  results  which  compare  the  effects  of  frozen  core  and  full  electron 
correlation  on  E(2)  geometries  of  diborane  and  dialane  computed  with  the  6- 


Table  V.  Effects  of  frozen  core  on  E(2)  geometries  using  a  6-31 1G  (d,p)  basis  set 


b2h6 

A12H6 

Frozen*  core 

Full  electron 
correlation 

Frozenb  core 

Full  electron 
correlation 

Total  energy  (a.u.) 

-53.05651 

-53.09193 

-487.49018 

-487.74034 

>V(MM),C  A 

1.769 

1.765 

2.606 

2.602 

Terminal  r^MH),  A 

1.188 

1.188 

1.568 

1.567 

Bridge  r^MH),  A 

1.316 

1.315 

1.730 

1.728 

Terminal  Z-(HMH) 

122.4° 

122.3° 

128.1° 

128.1° 

a  Two  ls-type  core  molecular  orbitals  are  frozen;  no  virtual  orbitals  are  deleted. 
b  Ten  Is-,  2s-,  and  2/?-type  core  molecular  orbitals  are  frozen;  no  virtual  orbitals  are  deleted. 
c  M  =  B  or  Al. 
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31G(d,p)  basis  set.  The  interatomic  distance  between  the  heavy  atoms  appears  to 
be  the  geometrical  parameter  affected  the  most  by  frozen  core  molecular  orbitals, 
but  even  these  differences  are  small — 0.004  A  for  both  molecules.  The  other  geo¬ 
metrical  parameters  are  affected  even  less.  They  vary  by  either  0.001  A,  0.1°,  or 
not  at  all. 

The  effects  of  freezing  and  not  freezing  core  molecular  orbitals  on  harmonic 
vibrational  frequencies  and  infrared  intensities  of  diborane  and  dialane  computed 
at  the  E( 2)  level  with  a  6-3 1  lG(d,p)  basis  set  are  seen  in  the  results  listed  in  Table 
VI.  The  differences  in  wave  numbers  between  the  frozen  core  frequencies  and  full 
correlation  frequencies  for  B2H6  are  4,  0,  3,  0,  2,  2,  1,  2,  8,  6,  6,  1,  0,  0,  1,  1,2,  1, 
respectively.  The  differences  in  infrared  intensities  between  these  two  levels  for 
B2H6  are  0,  1,0,  0,  2,  3,  6,  1,  respectively  (in  the  order  of  nonzero  intensities  given 
in  Table  VI).  The  differences  in  wave  numbers  between  the  frozen  core  frequencies 


Table  VI.  Effects  of  frozen  core  on  E(2)  harmonic  vibrational  frequencies  (cm  ')  and  infrared  intensities 
(km/mol)a  using  a  6-31 1G  (d,p)  basis  set. 


Sym 

b2h6 

A12H6 

Frozenb  core 

Full  electron 
correlation 

Frozenc  core 

Full  electron 
correlation 

Ag 

822 

826 

396 

393 

1230 

1230 

779 

779 

2210 

2213 

1613 

1616 

2660 

2660 

2035 

2051 

Au 

869 

871 

433 

434 

b16 

954 

956 

480 

481 

2743 

2742 

2043 

2042 

Biu 

1010(19) 

1012(19) 

658  (320) 

659  (318) 

2031  (10) 

2039  (9) 

1344  (423) 

1351  (417) 

B2g 

904 

910 

502 

504 

1926 

1932 

1473 

1482 

b2u 

359  (15) 

360  (15) 

227  (16) 

226  (16) 

983  (1) 

983  (1) 

913  (298) 

914(296) 

2756  (174) 

2756  (172) 

2050  (438) 

2057  (435) 

B3g 

1085 

1086 

788 

791 

B3u 

1217(87) 

1218(84) 

733  (731) 

733  (726) 

1764  (523) 

1766  (517) 

1562(1221) 

1569  (1215) 

2645  (144) 

2644(143) 

2028  (120) 

2035  (121) 

a  Infrared  intensities  are  shown  in  parentheses.  Blanks  indicate  identically  zero  values  for  modes  for¬ 
bidden  in  the  infrared  spectrum. 

b  Two  1  j'-type  core  molecular  orbitals  are  frozen;  no  virtual  orbitals  are  deleted. 
c  Ten  Is-,  2s-,  and  2p-type  core  molecular  orbitals  are  frozen;  no  virtual  orbitals  are  deleted. 


DIBORANE,  DIALANE,  AND  DIGALLANE 


589 


and  full  correlation  frequencies  for  A12H6  are  3,  0,  3,  16,  1,  1,  1,  1,  7,  2,  9,  1,  1,7, 
3,  0,  7,  7,  respectively.  The  differences  in  infrared  intensities  between  these  two 
levels  for  A12H6  are  2,  6, 0,  2,  3,  5,  6,  1,  respectively.  The  agreement  between  results 
computed  with  frozen  core  and  those  with  no  frozen  core  presented  in  Table  V 
and  Table  VI  leads  us  to  believe  that  freezing  core  molecular  orbitals  when  the  6- 
31  lG(d,p)  basis  set  is  used  will  have  little  effect  on  overall  theoretical  predictions 
in  the  series  of  molecules  under  present  study.  For  the  larger  basis  sets,  the  core 
molecular  orbitals  should  be  stabilized  more;  therefore,  freezing  core  MOs  in  the 
larger  basis  sets  is  even  more  justified. 

Results  of  geometry  optimizations  for  digallane  are  given  in  Table  VII  along 
with  the  gas-phase  electron  diffraction  structure  from  Pulham  et  al.  [37].  In  contrast 
to  the  similar  results  for  the  different  basis  sets  at  both  the  SCF  and  E(2)  levels  for 
both  diborane  and  dialane,  the  theoretical  geometries  for  digallane  are  much  more 
basis  set  dependent  at  both  levels  of  theory.  The  general  trend  for  the  theoretical 
geometrical  parameters  to  become  smaller  as  the  basis  set  is  enlarged  is  much  more 
pronounced  in  these  results  of  digallane  than  it  is  in  the  computations  of  diborane 
presented  in  Table  I.  This  trend  is  actually  reversed  in  dialane,  in  which  most  of 
the  calculated  geometrical  parameters  become  slightly  larger  at  both  levels  of  theory 
as  the  basis  set  is  increased. 

Even  though  the  geometry  for  digallane  is  more  basis  set  dependent  than  the 
geometries  of  diborane  and  dialane,  the  SCF  and  E(2)  results  for  digallane  are  quite 
similar  for  all  three  basis  sets.  In  addition,  the  agreement  between  the  6- 
31  lG(2df,2p)  E(2)  level  results  and  the  experimental  data  is  quite  good.  The  dif¬ 
ferences  between  the  computed  interatomic  distances  at  this  level  and  the  experi¬ 
mental  results  are  0.014  A,  0.035  A,  and  0.035  A,  respectively.  These  last  two 
parameters  are  both  Ga-— H  bond  distances,  and  the  theoretical  results  for  both  are 
within  the  error  bars  of  the  experiment  [37].  In  addition,  these  6-31  lG(2df,2p) 
E(2)  results  are  on  average  closer  to  the  experimental  values  than  the  dzp  CCSD 
results  of  Shen  and  Schaefer  [28]. 

Two  further  points  should  be  made  with  regards  to  the  geometry  of  digallane. 
First,  as  with  dialane  the  H— Ga— H  ring  angle  is  computed  to  be  less  than  85°  at 
both  levels  of  theory  for  all  three  basis  sets.  In  addition,  the  experimental  geometry 
yields  this  ring  angle  to  be  approximately  82°.  If  we  use  the  same  argument  discussed 
earlier,  this  result  would  appear  to  indicate  that  only  very  little  direct  through  space 
Ga — Ga  interaction  exists.  The  second  point  is  related,  but  more  obvious.  In  com¬ 
paring  the  interatomic  distances  reported  for  dialane  and  digallane,  we  note  that 
they  are  quite  similar.  This  similarity  originates  from  the  fact  that  aluminum  and 
gallium  have  similar  valence  radii.  Such  a  result  is  expected  for  a  pair  of  third-  and 
fourth-row  elements  [38].  However,  even  though  similar,  our  predicted  distances 
associated  with  the  ^-hydrido  bridge  are  actually  shorter  for  digallane.  If  shorter 
bonds  indicate  a  greater  degree  of  chemical  bonding,  then  we  may  assume  that 
digallane  is  more  tightly  bound  than  dialane.  This  fact  may  be  one  of  the  reasons 
that  dialane  has  eluded  experimental  observation  thus  far.  It  is  interesting  to  note 
that  the  dzp  CCSD  results  of  Shen  and  Schaefer  predicted  the  opposite  trend  in 
these  interatomic  distances  of  A12H6  and  Ga2H6  [  28  ] . 


Table  VII,  Ga2H6  total  energies,  optimized  equilibrium  geometries,  and  rotational  constants. 
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In  Table  VIII  the  computed  harmonic  vibrational  frequencies  and  infrared  in¬ 
tensities  for  digallane  are  presented  along  with  the  experimental  vibrational  modes 
reported  by  Pulham  et  al.  [  37  ] .  Once  again,  the  theoretical  results  are  quite  consistent 
from  one  basis  set  to  the  next  at  both  levels  of  theory.  The  E(2)  vibrational  fre¬ 
quencies  calculated  with  the  6-31  lG(d,p)  basis  set  all  differ  from  those  computed 
with  the  6-3 1  lG(2d,2p)  basis  set  by  less  than  30  cm"1 ;  all  but  two  of  the  frequencies 
are  within  19  cm"1.  The  E(2)  vibrational  frequencies  calculated  with  the  6- 
31  lG(2d,2p)  basis  set  are  all  within  27  cm"1  of  those  computed  with  the  6- 
31  lG(2df,2p)  basis  set,  and  all  but  two  are  within  15  cm"1  of  each  other.  With  the 
exception  of  the  two  lower  frequency  vibrational  modes  of  B3u  symmetry,  the  theo¬ 
retical  infrared  intensities  also  compare  quite  well  between  the  different  basis  sets 
employed.  Except  for  these  two  B3u  modes,  the  intensities  computed  at  the  E(2) 
level  with  the  larger  two  basis  sets  are  all  within  20  km/mol  of  each  other. 


Table  VIII.  Ga2H6  harmonic  vibrational  frequencies  (cm"1)  and  ir  intensities  (km/mol).a 


Sym 

TZ  (d,p) 

tz  (2d,2p) 

TZ  (2df,2p) 

Exp.c 

SCF 

E(2)b 

SCF 

E(2)b 

SCF 

E(2)b 

Ag 

239 

239 

237 

235 

242 

241 

789 

752 

795 

755 

804 

762 

1584 

1570 

1572 

1553 

1600 

1580 

2081 

2054 

2108 

2071 

2118 

2086 

Au 

479 

460 

483 

470 

480 

466 

Blg 

503 

480 

509 

489 

501 

481 

2067 

2052 

2096 

2071 

2104 

2084 

Biu 

700  (198) 

666  (176) 

704(196) 

672  (183) 

694(198) 

660(179) 

— 652 

1276  (400) 

1292  (309) 

1274  (378) 

1284  (301) 

1285  (366) 

1304(281) 

1202 

B2g 

394 

421 

406 

429 

410 

429 

1349 

1365 

1360 

1376 

1365 

1382 

B2u 

242  (9) 

221  (7) 

252  (10) 

235  (9) 

253  (11) 

236  (9) 

879  (221) 

814(182) 

888  (209) 

842  (181) 

879  (196) 

823  (163) 

760 

2074  (493) 

2058  (413) 

2102  (481) 

2077  (419) 

2111  (489) 

2090  (414) 

1993 

B3g 

834 

787 

840 

809 

830 

795 

B31I 

726  (760) 

695  (619) 

732  (750) 

697  (638) 

744  (729) 

709  (598) 

671 

1423  (1375) 

1417  (1226) 

1431  (1321) 

1426  (1182) 

1434(1302) 

1428  (1157) 

1273 

2074  (136) 

2048  (130) 

2100(143) 

2065  (137) 

2110(149) 

2079  (141) 

1976 

a  Infrared  intensities  are  shown  in  parentheses.  Blanks  indicate  identically  zero  values  for  modes  for¬ 
bidden  in  the  infrared  spectrum. 

b  Twenty-eight  \s- ,  2s-,  2p-,  3s-,  3 p-,  and  3^-type  core  molecular  orbitals  are  frozen;  no  virtual  orbitals 
are  deleted. 

c  See  Ref.  [37].  Cited  here  are  fundamental  vibrational  frequencies  for  digallane  in  the  vapor  phase  at 
ca.  270  K.  The  exception  is  the  approximate  value  at  652  cm-1,  which  is  an  average  for  values  obtained 
in  solid  argon  and  nitrogen  matrices  at  20  K. 
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Our  computed  harmonic  vibrational  frequencies  for  digallane  also  compare  quite 
well  to  the  experimental  anharmonic  frequencies  reported  [37] .  As  with  diborane, 
the  theoretical  results  are  all  too  high  due  to  the  fact  that  they  are  harmonic.  The 
6-31  lG(2df,2p)  E(2)  level  results  differ  from  the  experimental  frequencies  by  1%, 
8%,  8%,  5%,  6%,  12%,  and  5%,  respectively  (in  the  order  of  observed  experimental 
frequencies  reported  in  Table  VIII).  The  average  discrepancy  of  our  theoretical 
frequencies  with  the  experimental  data  is  just  over  6.5%.  The  corresponding  dzp 
CCSD  results  of  Shen  and  Schaefer  [28]  are  higher  than  the  experimental  frequencies 
by  5%,  8%,  10%,  5%,  8%,  1 1%,  and  6%,  respectively,  yielding  an  average  discrepancy 
of  just  under  7.5%.  This  comparison  gives  more  support  to  our  contention  that 
calculations  made  with  the  large  basis  sets  employed  in  this  study  at  the  E(2)  level 
can  compensate  for  the  lack  of  a  more  advanced  treatment  of  electron  correlation 
in  these  electron  deficient  systems. 


Conclusions 

One  of  the  principle  objectives  of  this  article  was  to  show  that  reliable  geometries 
and  harmonic  vibrational  frequencies  can  be  calculated  at  the  second-order  per¬ 
turbation  theory  level  provided  that  a  good  basis  set  is  used.  For  the  three  molecules 
considered  in  this  study,  experimental  data  exists  for  two.  For  both  of  these  mol¬ 
ecules,  the  6-31  lG(2df,2p)  E(2)  computed  geometries  are  in  closer  agreement  to 
experiment  than  the  dzp  ccsd  results  of  Shen  and  Schaefer,  and  the  theoretical 
harmonic  vibrational  frequencies  are  almost  as  good.  The  larger  basis  sets  used  in 
this  study  appear  to  compensate  for  the  lack  of  extended  correlation  in  these  systems. 
This  result  implies  that  a  higher  level  of  correlation  with  these  same  large  basis  sets 
would  yield  even  better  results. 

Due  to  a  computational  bottleneck  for  larger  molecular  systems,  especially  those 
involving  heavy  elements  such  as  aluminum  and  gallium,  correlated  calculations 
with  such  large  basis  sets  are  often  not  feasible.  However,  our  study  proved  that  a 
TZ  basis  set  augmented  by  one  set  of  d  polarization  functions  on  the  heavy  atoms 
and  one  p -orbital  on  the  hydrogens  gives  a  good  balance  between  necessary  computer 
resources  and  reliability  of  predictions  at  the  E(2)  level  of  theory. 

Another  objective  of  this  study  was  to  find  a  level  of  basis  set  appropriate  for 
future  investigations  of  correlated  densities  in  series  of  related  molecules.  As  pre¬ 
viously  mentioned,  a  recent  study  of  the  bonding  in  diborane  which  took  a  valence- 
bond  point  of  view  concluded  that  while  the  directional  B — H  interactions  are  the 
chief  interactions  in  the  system,  the  direct  through  space  B — B  interactions  are  not 
negligible  [21].  Many  contour  maps  of  the  electronic  charge  density  for  this  system 
have  been  computed  at  the  scf  level.  Perhaps  a  contour  map  from  a  correlated 
wave  function  determined  with  a  large  basis  set,  such  as  those  used  in  this  study, 
will  provide  an  interesting  comparison  for  this  valence  bond  interpretation. 
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Abstract 

Electronic  structure  investigations  on  a  broad  range  of  gold  compounds,  including  naked  and  ligated 
gold  clusters,  are  reviewed.  The  calculations  have  been  carried  out  with  a  recently  introduced  relativistic 
variant  of  the  linear  combination  of  Gaussian-type  orbitals  density-functional  (lcgto-df)  method  which 
affords  all-electron  investigations  for  very  large  systems.  The  accuracy  of  the  method  will  be  evaluated 
for  the  gold  dimer.  Then  the  electronic  structure  of  the  naked  cluster  Au55  is  studied,  both  in  Ih  and  Oh 
symmetry.  Nonrelativistic  and  relativistic  results  obtained  by  the  present  method  are  compared  to  those 
of  the  much  simpler  jellium  model.  Since  triphenylphosphine  is  among  the  most  common  ligands  in 
gold  chemistry  a  series  of  mononuclear  gold  phosphine  compounds  MeAuPR3  with  increasingly  complex 
ligands  PR3  (R  =  H,  CH3,  C5H6)  is  discussed.  The  calculations  reveal  the  success  and  the  limitations  of 
simpler  phosphines  often  employed  as  model  ligands  in  theoretical  studies.  Some  aspects  of  the  phosphine 
gold  interaction  in  these  simpler  compounds  carry  over  to  the  main  group  element  centered  gold  clusters. 
Thereby  one  arrives  at  a  rationalization  of  the  particularly  high  stability  of  the  carbon-centered  octahedral 
cluster  cation  [(R3PAu)6C]2+  as  compared  to  the  neighboring  isoelectronic  boron  and  nitrogen-centered 
clusters.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Gold  is  a  noble  metal,  yet  it  affords  a  surprisingly  rich  structural  chemistry  [1- 
3].  This  is  perhaps  best  exemplified  by  the  discovery  of  the  beautiful  octahedral 
cluster  cations  [(Ph3PAu)6C]2+  [4]  and  [(Ph3PAu)6P]3+  [5,6]  (with  Ph  =  C6H5) 
which  boast  carbon  and  phosphorus  as  central  atoms,  respectively,  in  the  unusual 
six-fold  coordination.  A  further  example  is  provided  by  the  unexpected  square- 
pyramidal  C4v  structure  of  the  four-fold  coordinate  cluster  cation  [(Ph3PAu)4As]  + 
[7]  which  is  in  contrast  to  that  of  the  tetragonal  cluster  [(Ph3PAu)4N]+  [8].  Even 
some  rather  large  clusters  are  known  with  a  well-defined  stoichiometry  and  structure, 
the  compound  Au55(PPh3)12Cl6  being  arguably  the  most  prominent  one  [9].  A 
large  variety  of  physical  measurements  on  this  compound  is  at  hand  [10]  and  an 
accurate  electronic  calculation  is  highly  desirable  to  assist  in  their  interpretation, 
particularly  concerning  the  question  to  which  extent  the  AU55  cluster  core  exhibits 
metallic  behavior. 
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Gold  cluster  compounds,  with  and  without  ligands,  find  widespread  applications 
ranging  from  novel  materials  for  nonlinear  optics  [11]  to  labeling  agents  in  bio¬ 
chemistry  [12].  On  the  other  hand,  more  fundamental  interests  are  fueled  by  the 
fact  that  gold  compounds  feature  very  prominent  relativistic  effects  as  the  ratio  of 
nonrelativistically  and  relativistically  calculated  atomic  radii  exhibits  a  maximum 
value  for  gold  within  the  periodic  table  of  elements  [13],  Thus  gold  compounds 
require  a  relativistic  treatment  of  the  electronic  structure.  We  have  recently  proposed 
a  scalar-relativistic  extension  of  the  linear  combination  of  Gaussian-type  orbitals 
density-functional  (lcgto-df)  method  which  has  been  implemented  in  a  self- 
consistent  fashion  [14-16].  The  method  affords  a  variationally  stable  and  numer¬ 
ically  efficient  treatment  even  for  larger  compounds.  In  the  next  section  we  will 
briefly  describe  the  salient  features  of  the  method  and  discuss  some  computational 
details  concerning  the  applications  to  be  presented  in  later  sections  of  the  article. 

To  demonstrate  the  accuracy  of  the  method,  we  will  discuss  results  for  the  gold 
dimer  Au2,  a  widely  used  benchmark  system  for  relativistic  electronic  structure 
methods,  and  compare  them  to  those  obtained  by  other  methods.  Then  we  will 
proceed  to  a  study  of  the  cluster  Au55  which  is  presented  here  for  the  first  time. 
Differences  and  common  features  in  the  electronic  structure  of  this  compound 
between  various  symmetries  and  computational  models  will  be  discussed.  Particular 
attention  will  be  paid  to  the  question  to  which  extent  the  spherical  jellium  model 
may  serve  as  a  basis  for  a  qualitative  description  of  the  valence  level  spectrum  of 
this  metal  cluster.  One  of  the  most  common  ligands  encountered  in  gold  chemistry 
is  triphenylphosphine,  PPh3.  Electronic  structure  investigations  on  PPh3  ligated 
gold  clusters  are  commonly  carried  out  with  the  help  of  model  phosphine  ligands. 
To  explore  the  validity  and  the  limitations  of  this  approach,  we  will  analyze  the 
effects  of  phosphine  substituents  for  the  series  of  gold(I)  compounds  MeAuPR3 
(with  R  =  H,  Me,  Ph). 

The  carbon-centered  octahedral  cluster  [(LAu)6C]2+  (with  L  =  PPh3)  is  available 
in  high  experimental  abundancy  [4]  whereas  the  clusters  [(LAu)6B]+  and 
[(LAu)6N] 3+  have  not  been  observed  so  far.  Therefore,  we  finally  address  the  prob¬ 
lem  how  the  carbon-centered  gold  cluster  cation  differs  from  its  isoelectronic  boron- 
and  nitrogen-centered  analogues.  To  this  end  we  will  analyze  the  changes  in  the 
electronic  structure  of  the  octahedral  gold  phosphine  clusters  along  the  series  B  -► 
C  -►  N. 


The  Quasirelativistic  lcgto-df  Method 

The  four-component  Dirac-Kohn-Sham  equations  [17,18], 

h ks i/'7  =  {col  •  7r  +  (3c 2  +  vn  +  vc  +  vxc)ypi  =  etfi  ,  ( 1 ) 

provide  a  convenient  starting  point  for  a  relativistic  density-functional  method. 
Here,  the  effective  one-particle  “Hamiltonian”  /zks  contains  the  nuclear  potential 
vn,  the  classical  electronic  Coulomb  potential  vc,  and  the  exchange-correlation  po¬ 
tential  vxc.  The  latter  also  comprises  relativistic  contributions,  such  as  radiative 
corrections  [18].  Since  chemistry  is  largely  dominated  by  the  electronic  solutions 
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the  elimination  of  the  positronic  degrees  of  freedom  from  Eq.  ( 1 )  by  some  trans¬ 
formation  is  computationally  advantageous. 

The  quasi-relativistic  extension  [14-16]  of  the  LCGTO-DF  method  [19,20]  is 
based  on  the  Douglas-Kroll  (dk)  transformation  [21] .  This  transformation  allows 
for  a  variationally  stable  reduction  of  the  four-component  Dirac-type  to  the  familiar 
two-component  formalism  and  permits  the  self-consistent  treatment  of  relativistic 
effects.  This  methodology  which  employs  the  no-pair  projection  operator  formalism 
of  quantum  electrodynamics  has  been  used  previously  in  the  context  of  wave  func¬ 
tion-based  electronic  structure  methods  [22-24]. 

Here,  we  merely  report  the  results  of  the  dk  transformation  in  the  framework 
of  the  Kohn-Sham  theory,  correct  to  the  second  order  in  the  effective  one-particle 
potential  v  (=vn  +  vc  +  vxc).  One  arrives  at  a  set  of  two-component  Kohn-Sham 
like  equations  with  an  effective  one-particle  “Hamiltonian”  [15,16] 

AS  =  Ep  +  ApvAp  +  ApRpvRpAp  -  \(EPW2  +  W2E„  +  2 WEPW) .  (2) 

In  this  equation,  the  abbreviations 


Ep  =  cVp2  +  c 2 ,  .  Ap 


/ Ep  +  c2  R  =  cjv-P) 
V  2Ep  ’  p  Ep  +  c2 


(3) 


have  been  used  where  a  designates  the  vector  of  the  Pauli  spin  matrices,  and  p  the 
electronic  momentum  operator.  The  integral  operator  W  is  given  in  the  momentum 
representation  by 


W, 


p,p 


RpVp,p'  Vp,p'Rp' 
Ep  +  Ep> 


(4) 


This  DK  transformation  avoids  the  generation  of  highly  singular  operators  that  arise 
when  the  more  familiar  Foldy-Wouthuysen  transformation  is  used  [25,26].  As  a 
consequence  of  the  dk  transformation,  matrix  techniques  [24]  may  be  successfully 
employed  for  the  evaluation  of  the  various,  rather  complicated  operators.  However, 
only  matrix  elements  of  the  operators  (p  •  vp)  and  (p  X  vp)  have  to  be  evaluated 
in  addition  to  those  already  required  in  the  standard  nonrelativistic  version  of  the 
method. 

For  the  investigations  to  be  discussed  in  this  article,  a  scalar-relativistic  variant 
of  the  above  methodology  has  been  employed  which  is  obtained  when  one  formally 
replaces  the  Pauli  spin  matrices  by  unit  matrices.  This  procedure  is  equivalent  to 
the  neglect  of  the  spin-orbit  interaction.  Furthermore,  only  the  dominating  nuclear 
potential  has  so  far  been  taken  into  account  in  the  DK  transformation*  but  not  the 
costly  electronic  contributions  to  the  potential.  This  level  of  theory  has  been  termed 
“vn2”  in  Ref.  [16].  Recent  applications,  besides  the  studies  of  gold  compounds, 
comprise  a  wide  range  of  other  heavy  element  compounds  including  Ce@C28  [  27 ,28  ] 
and  Ni38Pt6(CO)48  [29]. 

Before  presenting  the  results  for  gold  compounds,  it  is  appropriate  to  touch  on 
some  computational  details.  The  orbital  and  fitting  basis  sets  employed  for  all  in¬ 
vestigations  but  in  the  fourth  section  as  well  as  the  geometries  used  have  been 


598 


HABERLEN,  CHUNG,  AND  ROSCH 


described  previously  ([16],  [30],  and  [31]).  For  a  rough  orientation,  we  nevertheless 
mention  the  size  of  the  uncontracted  and  of  the  contracted  basis  sets  in  shorthand 
notation:  Au  ( 2 Is,  1 7p,  1  ld,7f)/[l ls,10p,7d,3f];  B,  C,  and  N  as  central  atoms 
( 9s,6p,  1  d ) / [ 7s, 5p, Id];  C  in  the  CH3  and  P(CH3)3  ligands  (9s,5p,ld)/[7s,4p,ld]; 
P  ( 12s,9p,ld)/[8s,6p,ld];  and  H  (6s, lp)/[4s, lp] .  Atomic  contractions  have  been 
used  throughout. 

For  the  naked  Au55  clusters  a  somewhat  smaller  orbital  basis  set  was  designed 
for  gold,  since  the  larger  clusters  with  their  increasing  average  coordination  numbers 
lead  to  approximate  linear  dependencies  caused  by  the  diffuse  exponents.  Therefore 
the  original  Au  (21s,  17p,  1  ld,7f)  basis  set  [16]  was  reduced  to  ( 19s,15p,10d,6f)  by 
dropping  the  most  diffuse  exponents.  The  resulting  basis  set  was  contracted  using 
atomic  orbitals  from  a  spin-restricted  calculation.  Test  calculations  on  the  octahedral 
cluster  Au6  and  the  icosahedral  cluster  AuJ3  produced  changes  of  less  than  1  pm 
in  the  bond  distances,  less  than  3  kJ/mol  in  the  binding  energy  per  atom  and  less 
than  5%  in  the  frequency  of  the  totally  symmetric  breathing  mode.  With  this  basis 
set  the  4345  electrons  of  Au55  are  described  by  3960  contracted  Gaussian-type 
orbitals. 

Both  Au55  clusters,  in  Ih  and  in  Oh  symmetry,  consist  of  two  shells  of  gold  atoms 
arranged  around  a  central  atom  (see  Fig.  1 ).  In  the  octahedral  cluster  all  gold-gold 
distances  were  constrained  to  be  the  same  and  only  the  total  symmetric  breathing 
mode  was  optimized.  The  icosahedral  cluster  features  two  different  bond  lengths, 
a  radial  “intershell”  distance  <irand  a  tangential  “intrashell”  distance  dt.  Their  ratio 
dr\dt  is  fixed  to  0.95105  [32].  Thus,  in  the  icosahedral  case  only  one  degree  of 
freedom  has  been  optimized,  too.  All  lcgto-df  calculations  on  the  AU55  clusters 
have  been  performed  with  the  spin-unrestricted  formalism  although  the  energetic 


Au55(Ih) 


Auss(Oh) 


Figure  1.  The  structures  of  the  icosahedral  (Ih)  and  cuboctahedral  (Oh)  Au55  clusters. 
The  central  atom  and  the  first  shell  are  indicated  by  shading. 
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differences  between  corresponding  levels  of  majority  and  minority  spin  were  found 
to  be  very  small  (less  than  0.1  eV),  as  is  to  be  expected  for  a  nonmagnetic  metal. 

The  jellium  model  calculations  for  Au55  were  performed  with  a  spherical  ho¬ 
mogeneous  background  charge  density  of  radius  R  =  629  pm  and  with  a  total  of 
55  electrons.  The  radius  R  corresponds  to  the  average  radius  of  the  icosahedral 
Au55  cluster  as  obtained  from  the  relativistic  lcgto-df  method  by  averaging  over 
the  radial  distances  of  the  second  shell  atomic  positions  and  adding  half  of  the 
tangential  nearest  neighbor  distance. 

The  exchange-correlation  potential  used  throughout  this  work  (in  LCGTO-df  as 
well  as  in  jellium  calculations)  was  employed  as  parameterized  by  Vosko,  Wilk, 
and  Nusair  (vwn)  [33]  based  on  the  essentially  exact  homogeneous  electron  gas 
data  of  Ceperley  and  Alder  [34]. 

Benchmark  Results  on  the  Gold  Dimer  Au2 

The  gold  dimer  is  a  popular  benchmark  compound  for  relativistic  electronic 
structure  methods  since  the  relativistic  effects  of  its  spectroscopic  constants  are 
among  the  largest  known.  The  scalar-relativistic  variant  of  the  lcgto-df  method 
yields  a  relativistic  bond  length  contraction  of  23  pm,  a  corresponding  increase  in 
the  vibrational  frequency  by  60  cm”1,  and  a  relativistic  increase  of  the  binding 
energy  by  about  60  kJ  /mol.  As  can  be  seen  from  Table  I  this  leads  to  excellent 
agreement  with  the  experimental  values  in  case  of  the  bond  length  and  the  vibrational 
frequency  whereas  the  binding  energy  is  too  large  compared  to  experiment.  Such 
overbinding  is  quite  frequently  encountered  in  local  density-functional  calcula¬ 
tions  [17,35]. 

The  present  LCGTO-DF  results  compare  quite  well  to  those  of  other  relativistic 
density  functional  methods.  Two  variants  of  the  Hartree-Fock-Slater  method  are 
listed  in  Table  I,  one  that  takes  relativistic  effects  into  account  by  first  order  per¬ 
turbation  theory  (hfs-fopt)  [36]  and  a  fully  relativistic  four-component  method 
(dfs-dvm)  [37].  Proper  treatment  of  electron  correlation  is  essential  if  one  aims 
for  an  accurate  description  of  Au2  in  the  framework  of  wave  function-based  ab 
initio  methods.  This  is  clearly  demonstrated  by  the  Hartree-Fock  (hf)  all-electron 
calculations  that  treat  relativistic  effects  by  first  order  perturbation  theory  [38]. 
Uncorrelated  (hf-fopt)  and  correlated  (hf-fopt-cpf)  calculations  yield  a  rel¬ 
ativistic  effect  on  the  bond  length  of  about  4  pm,  almost  no  effect  on  the  vibrational 
frequency,  and  a  strong  increase  of  the  binding  energy  by  92  kJ/mol,  but  even  the 
correlated  values  do  not  match  the  experimental  values  very  well.  The  authors 
attribute  this  result  to  the  basis  set  used  which  was  too  small  to  allow  for  an  efficient 
correlation  of  the  Au  5d  electrons.  The  role  of  the  Au  4f  electrons  for  the  electron 
correlation  was  pointed  out  by  Schwerdtfeger  who  found  an  3  pm  increase  of  the 
gold-gold  distance  by  neglecting  this  effect  [39]. 

The  different  pseudopotential  methods  quoted  in  Table  I  (rep-ci  [40,41],  arpp- 
cepa  [42],  and  arpp-QCi(t)  [39])  reproduce  the  experimental  values  more  or 
less  equally  well,  the  differences  being  due  to  varying  parameterizations  of  the 
pseudopotentials  and  to  the  different  treatments  of  the  electron  correlation.  At  this 
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Table  I.  Comparison  of  the  spectroscopic  constants  of  Au2  (bond  length  re,  vibrational  frequency  ue, 
binding  energy  De )  as  calculated  by  various  nonrelativistic  (nr)  and  relativistic  (rel)  electronic  structure 

methods. 


Method 

re 

[cm" 

De  [kJ/mol] 

nr 

rel 

nr 

rel 

nr 

rel 

HFS— FOPT8 

290 

244 

93 

201 

113 

243 

HFS/DFS-DV  Mb 

272 

251 

149 

191 

210 

305 

HF— FOPT0 

289 

267 

108 

156 

37 

80 

HF-FOPT— CPF41 

283 

263 

105 

151 

124 

172 

REP— CIe 

283 

247 

105 

165 

— 

218 

ARPP— CEPAf 

293 

254 

99 

170 

31 

179 

arpp-qci(t)8 

279 

254 

120 

178 

127 

205 

LCGTO-DF11 

270 

247 

135 

195 

215 

278 

Experiment1 

247 

191 

222 

a  Hartree-Fock-Slater  method,  relativistic  elfects  treated  by  first-order  perturbation  theory  (fopt) 
[36]. 

b  Nonrelativistic  Hartree-Fock-Slater  and  fully  relativistic  Dirac-Fock-Slater  method  [37]. 
c  Hartree-Fock  method  with  fopt  [38]. 

d  Same  as  footnote  (c),  but  electron  correlation  treated  by  the  coupled  pair  functional  method  [38]. 
e  Relativistic  effective-core  potential  configuration  interaction  method  [40,  41]. 
f  Spin-orbit  averaged  relativistic  pseudo-potential  method  with  coupled  electron  pair  approximation 
(cepa-1),  nonrelativistic  results  without  cepa-1  [42]. 

8  Same  as  footnote  (f),  but  with  quadratic  configuration  interaction  (single  and  double  substitutions 
as  well  as  some  triple  contributions)  [39]. 
h  Scalar-relativistic  variant  of  the  lcgto-df  method  [16]. 

1  Reference  [54]. 


time  the  arpp-qci(t)  calculation  [39]  represents  the  most  complete  calculation 
with  respect  to  the  inclusion  of  correlation  effects  in  a  wave  function-based  ab  initio 
method.  The  experimental  values  of  the  vibrational  frequency  and  of  the  binding 
energy  are  reproduced  very  well,  the  bond  length  is  calculated  too  long  by  7  pm. 
The  neglect  of  core- valence  correlation  yields  a  further  increase  of  the  bond  length 
by  12  pm  [39] .  It  is  clearly  an  advantage  of  the  present  lcgto-df  approach  that 
the  correlation  of  all  electrons  is  taken  into  account  in  the  framework  of  density- 
functional  theory. 

In  summary,  the  presented  relativistic  lcgto-df  method  performs  very  advan¬ 
tageously  compared  to  other  methods,  especially  if  one  takes  into  account  its  com¬ 
putational  efficiency  (50  s  of  CPU  time  on  one  processor  of  a  cray-y/mp  computer 
for  the  AU2  molecule ) . 


The  Electronic  Structure  of  Au55 

The  compound  Au55(PPh3)i2Cl6  (1)  [9]  has  been  well  characterized  by  a  large 
variety  of  spectroscopic  methods  [10].  Compound  1  has  attracted  much  interest 
in  the  recent  years  since  it  is  one  of  the  largest  stoichiometrically  well-characterized 
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gold  cluster  compounds.  Also,  one  envisages  possible  novel  applications  such  as 
the  creation  of  quantum  dot  structures  [  43] .  A  problem  of  particular  interest  is 
the  question  whether  the  Au55  core  is  large  enough  to  exhibit  metallic  behavior. 
To  supplement  the  many  experimental  investigations  of  compound  1  we  present 
nonrelativistic  and  relativistic  LCGTO-DF  calculations  on  unligated  Au55  clusters. 
As  compound  1  features  a  cuboctahedral  metal  core  [9]  the  calculations  for  Au55 
have  been  performed  employing  Oh  symmetry.  We  also  performed  calculations  on 
Au55  in  Ih  symmetry  since  for  unligated  pure  metal  clusters  the  icosahedral  structure 
often  dominates  [44]. 

First,  we  briefly  summarize  the  structural  results  of  the  calculations.  The  geometry 
optimization  furnished  an  Au-Au  distance  of  280  pm  for  Au55(Oh),  about  3%  less 
than  the  bulk  gold-gold  distance  of  288  pm  [45].  This  result  agrees  well  with  an 
experimental  estimate  of  the  average  gold-gold  distance  in  the  cluster  1  of  roughly 
4%  less  than  the  bulk  value  [10] .  The  calculated  average  Au-Au  distance  in  Au5s(Ih), 
281  pm,  is  nearly  the  same  as  in  Au55(Oh)  although  in  the  icosahedral  case  two 
different  values  for  the  next-neighbor  distance  occur,  84  radial  distances  of  272  pm 
and  1 50  tangential  distances  of  286  pm. 

Which  of  the  two  isomers  was  found  to  be  more  stable?  The  calculated  average 
binding  energies  per  atom,  3.59  eV  for  the  icosahedron  and  3.55  eV  for  the  cub- 
octahedron,  are  very  close  to  each  other  with  a  slight  preference  for  the  icosahedron. 
The  ligands  may  very  well  change  this  structural  advantage  so  that  the  present 
findings  in  no  way  exclude  a  cuboctahedral  structure  of  1 .  Both  values  for  the 
binding  energy  seem  to  be  already  rather  close  to  the  experimental  bulk  cohesive 
energy  of  3.80  eV  [45].  However,  if  one  attempts  a  proper  extrapolation  to  the 
bulk  based  on  a  series  of  naked  gold  clusters  [46]  one  deduces  an  overestimation 
of  the  bulk  energy  by  about  1.0  eV  in  the  local  density  approximation. 

The  jellium  model  works  well  for  alkali  clusters  [47,48].  Since  gold  is  an  also 
monovalent  metal  one  may  expect  this  model  to  provide  a  successful  description 
of  gold  clusters  as  well.  Obviously  the  participation  of  the  Au  5d  shell  in  the  bonding 
of  gold  compounds  [  3 1 ,49,50  ]  will  pose  restrictions  to  the  applicability  of  the  jellium 
model  but,  as  we  will  see  in  the  following,  at  least  some  qualitative  aspects  may  be 
carried  over.  The  results  of  a  spherical  jellium  model  calculation  for  the  55  s-type 
valence  electrons  of  Au55  (see  the  second  section  for  details  of  the  model)  are 
presented  in  Figure  2(a).  One  obtains  the  classical  ordering  of  jellium  levels  ( Is, 
lp,  Id,  2s,  If,  2p,  lg,  2d,  .  .  .)  with  the  lg  level  as  homo.  This  level  contains  15 
electrons  and  hence  lacks  three  electrons  to  the  next  shell  closing  which  is  consistent 
with  the  next  magic  number  of  58  electrons. 

Relativistic  effects  lead  to  a  radial  contraction  and  energetic  lowering  of  the  Au 
6s  orbitals  while  they  are  destabilizing  and  spatially  extending  the  Au  5d  orbitals 
[13].  This  improves  the  possibility  of  hybridization  between  the  two  types  of  orbitals. 
Thus,  one  expects  a  closer  match  of  the  jellium  electronic  structure  to  that  of  a 
full-potential  first  principles  calculation  in  the  nonrelativistic  case  where  the  energetic 
and  spatial  separation  of  the  Au  6s  and  5d  orbitals  is  larger.  Furthermore,  the 
icosahedral  cluster  Au55  resembles  more  the  spherical  jellium  symmetry  whence 
we  will  start  the  comparison  with  the  nonrelativistic  lcgto-df  one-particle  spec- 
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spherical  Au55  LCGTO-DF 

A 


a)  b)  c)  d)  e) 


Figure  2.  The  electronic  structure  of  Au55  Kohn-Sham  one-particle  energies  from  rela¬ 
tivistic  (rel)  and  nonrelativistic  (nr)  all-electron  LCGTO-DF  calculations  are  compared  to 
results  of  the  spherical  jellium  model.  Filled  bars  indicate  occupied  levels,  empty  bars 
unoccupied  levels,  (a)  Spherical  jellium  model  calculation  with  55  electrons  and  R  =  629 
pm.  (b)  Nonrelativistic  all-electron  LCGTO-DF  calculation  in  icosahedral  geometry  at  equi¬ 
librium.  (c)  Same  as  (b),  but  for  the  geometry  obtained  from  the  relativistic  calculation, 
(d)  Relativistic  lcgto-df  calculation  in  icosahedral  geometry,  (e)  Same  as  (d),  but  in 
octahedral  symmetry  at  equilibrium. 


trum  of  Au55(Ih)  shown  in  Figure  2(b).  All  levels  with  pertinent  s-character  (more 
than  about  50%  s  +  p  character  according  to  a  Mulliken  population  analysis)  are 
drawn  as  bars,  the  5d  manifold  is  indicated  by  a  shaded  box. 

The  first  striking  feature  is  the  discrete  level  spectrum  around  the  HOMO  (66  gg 
occupied  by  5  electrons)  with  an  average  spacing  of  about  half  an  eV.  In  the  case 
of  a  true  metallic  behavior,  one  would  expect  a  much  narrower  level  spacing  ap¬ 
proaching  the  continuous  level  spectrum  of  the  bulk.  On  the  other  hand,  if  the 
high  Ih  symmetry  were  broken,  many  more  levels  would  result  in  the  energy  region 
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around  the  HOMO,  thus  reducing  the  level  spacing  significantly.  The  d-band  is  quite 
narrow,  3.9  eV,  and  located  at  the  lower  end  of  the  s-band. 

Formal  imposition  of  a  lower  symmetry  constraint  on  the  spherical  jellium  leads 
to  a  splitting  of  the  irreducible  representations  as  shown  in  Table  II  for  Ih  and  Oh 
symmetry.  With  the  help  of  this  correlation  table  it  is  possible  to  establish  a  one- 
to-one  correspondence  between  the  jellium  levels  [Fig.  2(a)]  and  those  lcgto-df 
Kohn-Sham  levels  which  exhibit  predominantly  sp-character  [Fig.  2(b)].  If  one 
takes  the  symmetry  reduction  into  account,  one  notes  that  the  jellium  model  cor¬ 
rectly  predicts  the  ordering  of  all  the  occupied  all-electron  levels  of  s-character.  The 
jellium  levels  whose  degeneracy  is  reduced  in  Ih  symmetry  (If,  lg,  lh,  .  .  .)  are 
split  by  about  0.5  eV  in  the  all-electron  calculation.  The  two  lowest  levels  of  s- 
character  fall  within  the  range  of  the  d-band  and  are  distributed  to  a  certain  degree 
over  several  levels  of  d-character,  a  consequence  of  the  weak  but  noticeable  s-d 
mixing  even  in  a  nonrelativistic  description  of  gold. 

As  a  next  step  on  the  way  to  the  one-electron  spectrum  of  a  relativistic  calculation, 
the  Au55(Ih)  cluster  is  compressed  from  its  nonrelativistic  equilibrium  structure 
( JAuAu  =  299  pm)  to  its  calculated  relativistic  equilibrium  geometry  (dA uAu  =  281 
pm),  but  still  treated  nonrelativistically.  The  results  [Fig.  2(c)]  may  be  described 
mainly  as  a  broadening  of  the  d-band  (from  3.9  to  5.4  eV)  and  a  lowering  of  the 
s-levels,  the  latter  ones  being  most  distinct  in  case  of  the  low-lying  bonding  levels. 
In  the  next  step  the  relativistic  treatment  was  switched  on.  The  resulting  level  spec¬ 
trum  is  shown  in  Figure  2(d).  The  width  of  the  d-band  increases  further  (from  5.4 
to  6.3  eV),  mainly  due  to  rearrangements  at  the  upper  end  of  the  5d  manifold. 
The  highest  lying  Au  5d  levels  now  have  energies  above  -7  eV.  On  the  other  hand, 
the  Au  s-manifold  is  lowered  in  energy.  As  a  consequence,  the  two  lowest  lying  s- 
type  levels  now  drop  out  of  the  range  of  the  d-band  whereas  some  of  the  higher 
lying  s-levels  sink  into  the  upper  part  of  the  d-band.  Since  relativistic  effects  strongly 
enhance  s-d  mixing  in  gold  as  explained  above  some  of  the  s-levels  situated  within 
the  d-band  (e.g.,  jellium  If  -►  gu  +  t2u  in  Ih)  can  no  longer  be  discriminated  as 
their  contribution  is  spread  out  over  a  large  number  of  d-levels. 

In  the  last  column  of  Figure  2,  the  effect  of  further  reducing  the  symmetry  of  the 
AU55  cluster  from  Ih  to  Oh  is  demonstrated.  Some  of  the  Aussflh)  levels  split  (Table 


Table  II.  Correlation  table  for  the  irreducible  representations  of  the  infinite  rotation 
group  0(3)  and  the  Ih  and  Oh  point  groups. 


Angular 

momentum 

0(3) 

symmetry 

Ih  symmetry 

Oh  symmetry 

0 

s 

A 

aig 

1 

P 

t.u 

tlu 

2 

d 

hg 

eg  +  t2g 

3 

f 

t2u  +  gu 

a2u  -h  tlu  4-  t2u 

4 

g 

hg  4"  gg 

aig  +  eg  +  tig  4-  t2g 

5 

h 

tlu  +  t2u  +  hu 

eu  +  thi  +  2  X  t2u 
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II).  Particularly  noteworthy  is  the  five-fold  occupied  gg  homo  in  Au55(Ih)  which 
splits  into  a  five-fold  occupied  tlg  homo  and  an  empty  aIg  lumo  in  Au55(Oh). 
However,  the  splitting  is  rather  small,  0. 1 5  eV.  The  width  of  the  d-band  essentially 
remains  unchanged  from  Ih  (6.3  eV)  to  Oh  (6.1  eV).  Both  values  agree  quite  well 
with  the  experimentally  determined  width  of  the  d-band  in  bulk  gold,  5.7  ±  0.3  eV 
[51],  indicating  near  bulk-like  behavior  for  this  property.  A  further  point  worth 
noting  is  the  position  of  the  most  bonding  valence  level  of  s-type  which  is  lower  in 
energy  by  about  0.3  eV  in  the  icosahedral  than  in  the  cuboctahedral  Au55  cluster. 
This  finding  is  consistent  with  the  slightly  stronger  average  binding  energy  of  the 
former  cluster. 


The  Effect  of  Phosphine  Substituents  in  MeAuPR3 

Triphenylphosphine  PPh3  is  a  very  common  ligand  in  gold  cluster  compounds, 
in  particular  due  to  the  extensive  steric  shielding  it  affords.  On  the  other  hand,  this 
compound  causes  a  heavy  burden  in  any  electronic  structure  calculation.  Thus,  in 
electronic  structure  calculations,  triphenylphosphine  is  most  often  modeled  by  sim¬ 
ple  phosphines  PH3.  However,  it  is  well  known  that  the  phosphine  substituents  are 
not  innocent  in  any  ligand  metal  bonding.  Therefore  it  seems  worthwhile  to  in¬ 
vestigate  to  which  extent  it  is  justified  in  electronic  structure  calculations  to  model 
the  triphenylphosphine  ligands  PPh3  of  existing  compounds  by  simple  phosphines 
PH3 .  Compounds  of  the  type  MeAuPR3  afford  the  simplest  systems  for  studying 
this  effect  as  they  are  tractable  by  the  relativistic  lcgto-df  method  up  to  Me- 
AuPPh3.  Furthermore,  experimental  data  are  available  for  the  two  compounds 
MeAuPMe3  and  MeAuPPh3 . 

Good  agreement  is  achieved  between  calculated  results  and  available  experimental 
values  of  observables  (see  Table  III).  Thus,  the  observed  trends  should  be  mean¬ 
ingful.  The  obvious  discrepancy  in  the  gold-carbon  bond  length  of  MeAuPPh3 
between  experiment  and  calculation  has  been  discussed  elsewhere  [30] .  The  Au-P 
bond  distance  in  MeAuPR3  (see  Table  III)  does  not  show  a  monotonic  trend  along 
the  series  R  =  H,  Me,  Ph:  the  methyl-substituted  compound  marks,  in  a  sense,  an 
extremum.  As  a  consequence,  the  PH3  ligand  may  be  taken  as  a  satisfactory  model 
ligand  for  PPh3  with  respect  to  structural  aspects.  The  values  of  the  dipole  moment, 
the  first  ionization  potential,  and  the  binding  energy  show  a  clear  convergence 
along  the  series  of  substituents  R  =  H,  Me,  Ph.  For  these  observables,  the  agreement 
between  PH3  and  the  target  PPh3  is  rather  approximate  whereas  PMe3  provides  an 
almost  quantitative  model  for  PPh3.  The  monotonic  decrease  in  the  ionization 
potential  from  9. 10  eV  for  MeAuPH3  to  8. 13  eV  for  MeAuPPh3  corresponds  to  an 
increase  in  the  electron  donating  capability  of  the  ligands  [  52  ]  which  is  also  reflected 
in  the  increase  of  the  dipole  moment. 

As  an  aid  in  understanding  the  stabilization  of  gold  cluster  compounds  by  phos¬ 
phine  ligands,  it  is  illuminating  to  compare  the  dissociation  energies  of  AuPR3  and 
AuPR3  (see  Table  III).  The  binding  energy  of  PR3  to  a  gold  cation  is  more  than 
three  times  larger  than  to  a  neutral  gold  atom.  The  value  of  the  binding  energy 
increases  along  the  series  R  =  H,  Me,  Ph,  in  the  neutral  systems  by  60  kJ/mol,  in 
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Table  III.  Comparison  of  selected  properties  (bond  distances  dAuc  and  dAuP,  dipole  moment  n,  first 
ionization  potential  ip,  and  various  binding  energies  BE)  of  the  compound  MeAuPR3  for  the  substituents 
R  =  H,  Me,  Ph.  Available  experimental  values  are  given  in  parentheses. 


Property 

R  =  H 

R  =  Me 

R  =  Ph 

dA uC  (MeAuPR3)  [pm] 

200 

203  (203a) 

202  (2 1 2b) 

dAuP  (MeAuPR3)  [pm] 

229 

225  (228a) 

228  (227b) 

M(MeAuPR3)  [D] 

3.84 

5.57  (5.5C) 

5.80  (5.6d) 

lp(MeAuPR3)  [eV] 

9.10 

8.34  (8.24c) 

8.13 

BE(MeAuPR3)f  [kJ/mol] 

556 

610 

620 

BEfAuPRa)8  [kJ/mol] 

124 

176 

184 

BE(AuPR3)h  [kJ/mol] 

401 

553 

580 

be(HPR3/  [kJ/mol] 

762  (782k) 

906  (935k) 

924 

a  Reference  [55]. 
b  Reference  [56]. 
c  Reference  [57]. 
d  Reference  [58]. 
c  Reference  [59]. 

f  Reference  energy:  fragments  Me,  Au,  and  PR3 . 
8  Reference  energy:  fragments  Au  and  PR3. 
h  Reference  energy:  fragments  Au+  and  PR3. 

’  Reference  energy:  fragments  H+  and  PR3. 
k  Reference  [60]. 


the  cationic  case  by  1 80  kJ  /mol.  Since  no  experimental  data  seem  to  be  available 
for  the  “gold  ion  affinities”  of  phosphines,  these  results  might  be  compared  to  the 
proton  affinities  of  PR3  moieties  where  good  agreement  between  experimental  and 
calculated  values  is  found  (Table  III).  The  absolute  values  of  the  energy  changes 
are  almost  by  a  factor  of  two  larger  in  the  case  of  a  proton  as  compared  to  Au+, 
but  the  change  from  PH3  to  PPh3  with  about  160  kJ/mol  is  comparable  to  that 
calculated  for  gold. 

Returning  to  the  question  raised  at  the  beginning  of  this  section  we  may  sum¬ 
marize  this  study  in  the  following  strategy.  Simple  phosphine  ligands  may  be  used 
during  the  geometry  optimization  of  an  electronic  structure  calculation,  but,  if 
desirable  and  feasible,  a  last  calculation  at  the  final  geometry  should  be  carried  out 
employing  trimethylphosphine  ligands  to  improve  the  energetic  aspects. 

The  Stability  of  Element-Centered  Octahedral  Gold  Cluster  Cations 

Finally,  we  turn  to  the  recently  synthesized  octahedral  gold  cluster  cations  which 
feature  unusually  coordinated  main-group  elements,  namely  the  carbon  centered 
cluster  [(LAu)6C]2+  [4]  where  L  =  PPh3.  Strong  efforts  to  synthesize  its  isoelectronic 
neighbors  [(LAu)6B]+  and  [(LAu)6N]3+  [3,53]  have  so  far  not  met  with  success. 
It  seems  as  if  the  carbon-centered  octahedral  cluster  cation  exhibits  a  particular 
stability.  Therefore,  we  set  out  to  investigate  the  isoelectronic  series  of  cations 
[(LAu)6Xwr+  with  X,  =  B,  X2  =  C,  and  X3  =  N  in  a  quantum  chemical  study. 
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In  the  following,  we  present  results  that  point  toward  an  important  role  of  the 
ligands  in  stabilizing  these  cluster  cations  [31] .  Following  the  computational  study 
established  in  the  preceding  section,  we  optimized  the  geometry  of  the  clusters  using 
model  phosphine  ligands  (L  =  PH3)  and  employed  trimethylphosphine  ligands 
(L  =  PMe3 )  at  the  obtained  equilibrium  geometry  to  improve  the  values  for  the 
binding  energies. 

The  stabilizing  effect  of  the  central  atom  is  reflected  in  the  results  compiled  in 
the  first  part  of  Table  IV.  In  the  unligated  clusters  [Au6Xm]"J+  the  binding  energy 
of  the  central  atom  decreases  from  boron  to  nitrogen  in  agreement  with  orbital 
overlap  arguments  advanced  earlier  [31,50].  Ligands  clearly  influence  the  bonding 
of  the  central  atom  to  the  gold  cluster  cage;  for  PMe3  ligands  this  binding  energy 
increases  by  1 10  kJ/mol  for  boron,  by  297  kJ/mol  for  carbon,  and  by  460  kJ/mol 
for  nitrogen.  Thus,  carbon  acts  as  the  most  stabilizing  central  atom  as  indicated  by 
underlining  in  Table  IV.  On  the  other  hand,  it  is  possible  to  directly  interpret  the 
stabilizing  influence  of  the  ligands  by  means  of  the  ligand  shell  binding  energy  (see 
Table  IV,  lower  part)  which  rises  significantly  along  the  series  of  central  atoms 
Xm  =  B  -►  C  -►  N.  Recalling  that  the  binding  energy  of  PR3  to  a  gold  cation  is 
substantially  larger  than  that  to  the  neutral  gold  atom  (Table  III),  one  discovers  a 
correlation  between  the  cluster  stabilization  by  the  ligand  shell  and  the  increasing 
charge  of  the  cluster  cations.  Furthermore,  one  notes  that  the  difference  in  the 
ligand  shell  binding  energy  between  PH3  and  PMe3  increases  considerably  from 
boron  (AE  =  202  kJ/mol)  to  nitrogen  (A E  =  560  kJ/mol),  emphasizing  once 
more  the  necessity  of  using  trimethylphosphine  model  ligands  when  studying  ener¬ 
getic  aspects. 

After  having  established  the  stabilizing  effect  of  the  ligands,  we  will  discuss  the 
stability  of  the  empty  cage  itself  and  how  it  is  affected  by  its  net  charge.  Table  V 
shows  the  total  binding  energies  of  a  series  of  four  empty  cluster  models  of  increasing 
complexity  for  various  total  charges  of  the  cluster  m ;  in  each  row  of  the  table  the 
most  stable  cluster  is  indicated  by  underlining.  When  proceeding  along  the  first 
four  rows  of  Table  V  one  notices  that  in  each  row  the  net  charge  of  the  most  stable 


Table  IV.  Binding  energy  (in  kJ/mol)  of  the  central  atom  (X!  =  B,  X2  =  C,  X3  =  N)  inside  the  Au6 
cage  and  of  the  ligand  shell  to  the  gold  cage  for  differently  substituted  phosphine  ligands. 


Reaction 

B 

C 

N 

iau  6r+  +  xm  ->[Au6xwr+ 

-790 

-666 

-212 

[(H3PAu)6]m+  +  Xm  -  [(H3PAu)6Xm]m+ 

-903 

-949 

-657 

[(Me3PAu)6r+  +  Xm  ->  [(Me3PAu)6Xw]m+ 

-900 

-963 

-672 

[Au6Xwr+  +  6  PH3  -  [(H3PAu)6Xmf”+ 

-1197 

-1783 

-2482 

[Au6Xmr+  +  6  PMe3  -*■  [(Me3PAu)6Xm]m+ 

-1399 

-2152 

-3042 
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cluster  clearly  increases.  To  quantify  this  effect  to  a  certain  degree  we  also  display 
the  (fractional)  charge  m  for  which  the  total  energy  assumes  its  minimum  as  de¬ 
termined  by  interpolating  the  values  along  a  row. 

For  an  octahedral  array  of  point  charges  only,  the  Coulomb  repulsion  obviously 
increases  when  proceeding  along  the  row  (m  =  0.0).  For  the  naked  gold  clusters 
this  electrostatic  repulsion  is  overcompensated  by  the  covalent  gold-gold  interaction 
up  to  cluster  charges  as  high  as  +3;  the  singly  charged  cluster  (m  =  1 .0)  is  the  most 
stable  one.  Attaching  phosphine  ligands  to  the  cluster  further  shifts  the  position  of 
maximum  stability  toward  the  dication  (m  =  2.1).  Here  again  the  ability  of  the 
ligands  to  provide  increasing  stabilization  for  clusters  with  a  higher  net  charge 
shows  up.  The  trimethylphosphine  ligands  enhance  this  effect,  furnishing  maximum 
stability  for  the  cluster  [(Me3PAu)6]3+  (m  -  2.7). 

Since  the  binding  energy  of  the  central  atom  is  smallest  for  nitrogen  (see  Table 
IV)  the  stability  maximum  of  the  cluster  cations  is  shifted  back  to  a  lower  net 
cluster  charge  when  the  central  atom  is  taken  into  account.  In  the  case  of  a  ligand 
free  cluster  model  [Au6Xm]m+  the  boron-centered  cluster  was  found  to  be  most 
stable  ( m  =  1.0),  whereas  the  carbon-centered  cluster  turns  out  to  be  the  most 
stable  one  in  the  isoelectronic  series  ( m  =  1 .8  for  L  =  PH3 ,  m  =  2. 1  for  L  =  PMe3 ), 
once  the  ligands  are  taken  into  account. 

Thus,  three  main  energy  trends  may  be  identified  when  analyzing  the  isoelectronic 
series  of  cluster  cations  [(LAu)6Xm]m+  for  the  central  atoms  boron,  carbon,  and 
nitrogen:  (1)  a  decreasing  affinity  of  the  central  atom  to  the  cluster  cage,  (2)  an 


Table  V.  Total  binding  energies3  (in  kJ/mol)  of  various  octahedral  empty  and  element-centered  model 

clusters. 


Charge  m  = 

0 

+  1 

+2 

+3 

+4 

m 

Empty  cluster  models 

ter" 

0 

202 

802 

1818 

3232 

0.0 

[Au6r+ 

-1289 

-1504 

-1266 

-593 

523 

1.0 

[(H3PAu)6]m+ 

-2083 

-2588 

-2766 

-2631 

-2143 

2.1 

[(Me3PAu)6jm+ 

-2159 

-2793 

-3121 

-3176 

-2928 

2.7 

Element-centered  clusters 

X  =  B 

X  =  C 

X 

il 

2 

[Au6Xmr+ 

-2294 

-1932 

-805 

1.0 

[(H3PAu)6Xw]m+ 

-3491 

-3715 

-3287 

1.8 

[(Me3PAu)6Xm]m+ 

-3693 

-4084 

-3848 

2.1 

The  maximum  of  the  binding  energy  along  each  row  of  varying  charge  m  is  indicated  by  underlining. 
Also  given  is  the  value  m  as  determined  by  a  least  squares  fit  to  a  parabola  for  which  the  energy  assumes 
its  minimum  in  a  given  row  of  the  table. 

a  Total  binding  energy  with  respect  to  the  constituents  Xm,  m  X  Au+,  (6  -  m)  X  Au,  and  6  X  PR3 
(R  =  H,  Me). 

b  Array  of  point  charges  (charge  q  =  +m/6)  in  the  geometry  of  [Au6]2+  {dqq  =  270  pm). 
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increasing  electrostatic  repulsion  due  to  the  rising  cluster  charge,  and  ( 3 )  a  strong 
counteracting  effect  brought  about  by  the  improved  stabilization  due  to  the  ligand 
shell.  The  net  effect  of  all  these  trends  results  in  a  stability  maximum  for  the  carbon- 
centered  cluster  although  the  energy  differences  to  the  boron-  and  nitrogen-centered 
clusters  are  small. 


Summary  and  Conclusions 

We  have  presented  a  broad  range  of  applications  of  the  relativistic  lcgto-df 
method  to  naked  and  ligated  gold  clusters.  All-electron  calculations  on  icosahedral 
and  cuboctahedral  Au55  clusters  demonstrate  the  efficiency  of  the  method.  They 
provided  new  insight  into  the  electronic  structure  of  these  systems  and  showed  the 
spherical  jellium  model  to  be  adequate  for  a  qualitative  analysis  of  the  manifold 
of  s-type  one-electron  valence  levels. 

The  computational  efficiency  of  the  method  was  also  important  for  allowing  an 
investigation  on  the  series  of  model  compounds  MeAuPR3  with  different  phosphorus 
substituents  R.  The  simple  phosphine  ligands  PH3 ,  often  used  in  quantum  chemical 
studies  instead  of  the  experimental  triphenylphosphine  ligands,  were  found  to  be 
satisfactory  models  for  structural  investigations  of  gold  compounds  whereas  PMe3 
ligands  have  to  be  employed  to  achieve  an  accurate  description  of  binding  energies. 
The  influence  of  the  phosphine  ligands  turned  out  to  be  essential  for  rationalizing 
the  high  stability  of  the  carbon-centered  octahedral  gold  cluster  cation  and,  in  par¬ 
ticular,  the  preferred  role  of  carbon  in  the  isoelectronic  series  [(Ph3PAu)6Xw]m+ 
with  Xw  =  B,  C,  N.  The  main  stabilizing  reason  was  found  in  the  high  affinity  of 
the  phosphine  ligand  shell  to  positively  charged  clusters  that  counteract  the  trend 
of  decreasing  stability  of  the  naked  cluster  core  along  the  series  B  ->  C  N. 

In  summary,  the  relativistic  Lcgto-df  method  was  shown  to  be  a  useful  tool 
for  studying  the  electronic  structure  of  large  gold  cluster  compounds  and  will  allow 
to  tackle  many  interesting  problems  of  heavy  element  systems. 
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Abstract 

A  first-principles  quantum  molecular  dynamics  scheme  based  on  density-functional  theory  is  used  to 
investigate  the  finite-temperature  properties  of  metallic  hydrogen  at  density  2.7  g  cm-3  (rs  =  in 
particular  to  determine  melting  temperature,  velocity  autocorrelation  functions  below  and  above  melting, 
and  the  temperature  dependence  of  proton  self-diffusion  in  the  liquid  phase.  A  strong  deviation  from 
the  prediction  of  the  one-component  plasma  model  is  found  in  the  value  of  the  melting  temperature. 
Results  of  simulations  of  the  molecular  phase  are  also  presented  and  found  to  be  in  agreement  with 
previous  work  on  dense  molecular  hydrogen.  The  scheme  is  based,  for  the  modeling  of  the  electrons,  on 
time-dependent  Schrodinger  equations,  coupled  to  classical  equations  of  motion  for  the  protons.  All 
numerical  results  were  obtained  using  a  parallel  computer,  and  an  outline  of  the  computer  program 
implementation  is  presented.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Recently,  a  quantum  molecular  dynamics  scheme  for  condensed-matter  systems, 
based  on  time-dependent  density-functional  theory,  was  implemented  by  the  author 
[1,2],  and  principally  applied  to  the  simulation  of  solid  and  liquid  sodium,  although 
preliminary  work  on  hydrogen  was  also  presented  in  Ref.  [  1  ] .  In  this  scheme,  the 
electrons  are  modeled  quantum  mechanically  by  a  set  of  time-dependent  wave 
functions,  evolving  according  to  Schrodinger-like  equations,  whereas  the  ions  obey 
classical  equations  of  motion.  While  this  approach  is  similar  to  the  Car-Parrinello 
scheme  [  3  ] ,  the  unitarity  of  the  electronic  evolution  operator  automatically  insures 
that  orthonormality  of  the  wave  functions  is  maintained  at  all  times.  Explicit  or¬ 
thonormality  constraints  are  not  required  in  the  electronic  equations  of  motion, 
resulting  in  algorithmic  simplification. 

In  the  present  work,  we  extend  the  hydrogen  simulations  begun  in  Ref.  [1] ,  and 
explore  the  properties  of  metallic  hydrogen  (p  ^  1.2  g  cm”3),  especially  as  regards 
the  determination  of  the  melting  temperature  and  the  calculation  of  proton  self¬ 
diffusion  in  the  liquid.  With  this  focus,  our  work  is  complementary  to  that  reported 
by  Zerah,  Clerouin,  and  Pollock  [4] ,  where  a  scheme  based  on  the  Thomas-Fermi 
density  functional  was  used  to  investigate  dense  plasma  physics,  and  that  reported 
by  Hohl  et  al.  [  5  ] ,  which  extensively  studied  the  dense  molecular  phases  of  hydrogen, 
using  the  original  Car-Parrinello  method.  In  the  present  article,  we  also  present 
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simulation  results  of  the  molecular  phase,  verifying  the  validity  of  our  scheme  in 
that  regime  as  well. 

The  present  numerical  work  was  entirely  achieved  on  a  parallel  computer,  the 
Thinking  Machines  CM-5.  Given  the  growing  use  of  parallel  computers  for  con¬ 
densed  matter  physics  simulations  [6],  we  sketch  here  the  computer  program  im¬ 
plementation  of  our  scheme  in  a  high-level  language,  “High  Performance  Fortran,” 
and  report  performance  figures  for  varying  machine  sizes. 

Equations  of  Motion  of  the  Electron-Ion  System 

As  the  theoretical  basis  and  algorithmic  implementation  of  the  method  used  were 
presented  in  some  detail  in  Refs.  [1]  and  [2],  only  a  brief  outline  of  the  scheme 
will  be  given  here.  In  what  follows  we  use  atomic  units  (a.u.),  so  that  with  me 
denoting  the  electron  mass  and  e  its  charge,  length  is  given  in  units  of  the  Bohr 
radius  a0  =  h2/mee2  =  0.529  A,  energy  in  units  of  1  hartree  =  e2/ao  =  27.2  eV 
and  time  in  units  of  h  / 1  hartree  =  2.42  10~17  s. 

We  consider  a  system  of  N  electrons  interacting  with  N  classical  protons  of  mass 
M,  the  effect  of  each  of  which  is  modeled  by  a  nonsingular,  local  pseudopotential 
i^s(x).  The  ionic  positions  at  time  t  are  given  by  { R*(f) } ,  k  =  1,  2,  . . . ,  N,  and, 
with  double  occupancy  of  orbitals,  \[/j(x)J  ~  1,  2, . . . ,  N0  =  N/2  denote  the  time- 
dependent  electron  wave  functions.  The  complete  set  of  equations  of  motion  is 
given  by  (keeping  the  variables  e  =  me  =  1  explicit)  [2] 

d  1 

i  —  ij(\,  t)  =  -  - —  V2\J/j(x,  t)  +  vclr(x,  t,  [«])^(x,  /), 

ot  Zme 

j=  1 ,2,...,N0  =  N/2,  (1) 

d 2 

M-^Rk(t)  =  Fk(t),  k=  1,2,...,  N,  (2) 

dt 

where  is  the  total  force  on  the  kth  proton, 

F*w  =  v'ir,(o-57oT~  J  d'x  VpAx  ~  R*W)Vn(x’  (3) 

and  where  the  electronic  number  density  is  given  by 

n(x,t)  =  I'Z  \ij(x,t)\2.  (4) 

i- 1 

The  effective  potential  acting  on  the  electrons  is 

Ni 

vtff(x,  t,  [ n ])  =  2  Vps(x  -  R *(<))  +  v„(x,  t)  +  0)  (5) 

k=  1 

where  vH(x,  t)  is  the  electrostatic,  Hartree  potential,  and  where  the  exchange  and 
correlation  term  fixc(n)  is  the  local  derivative 
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Pxc(n)  =  (6) 

where  exc(n)  is  given  by  the  ground-state  local-density  approximation  (lda)  of 
density-functional  theory  [  7  ] . 


Hydrogen  Pseudopotential 


The  local  hydrogen  pseudopotential  is  of  the  form  of  Topp  and  Hopfield  [8], 


vPs(r)  = 


v0  cos (kr)  +  C, 
-Ur, 


0  <  r  <  rc, 
rc<  r, 


(7) 


where  all  four  parameters  v0,  k,  C,  and  rc  are  adjustable.  For  given  values  of  rc  and 
k,  u0  and  C  are  determined  by  requiring  continuity  of  v(r)  and  its  first  derivative 
across  the  boundary  r=rc.  For  a  given  value  of  rc,  k  is  then  determined  by  requiring 
that  the  energy  of  the  Is  state  in  the  potential  of  Eq.  (7)  has  the  correct  value  of  1 
Ry  =  -1/2  hartree.  This  is  done  by  repeatedly  solving  for  the  energy  eigenvalue  e 
of  the  radial  Schrodinger  equation  for  u(r)  =  rypXs(r ), 

1  d 2 

+  vps(i-)u(r)  =  eii(r),  (8) 


successively  adjusting  the  value  of  k  so  as  to  finally  obtain  e  =  — 1/2. 

The  choice  of  the  value  of  rc  is  guided  by  the  desire  to  simultaneously  minimize 
the  amplitude  of  v(r)  at  the  origin,  and  its  extent  in  Fourier  space,  while  insuring 
that  the  electronic  charge  density  is  accurately  represented  for  typical  nuclear  sep¬ 
arations.  Choosing  rc=  1 .4  as  a  reasonable  compromise,  one  finds  the  remaining 
parameters 

k  =  2.04137,  v0  =  -0.89298,  C=  -1.57157.  (9) 

While  the  fitting  procedure  outlined  above  is  constrained  only  by  the  energy  of  the 
atomic  ground  state,  it  turns  out  that  the  energies  of  the  excited  states  in  the  potential 
of  Eq.  (7 )  are  also  accurately  represented  [  8  ] .  Furthermore,  the  normalized  pseudo 
wave  function  resulting  from  Eq.  (8)  is  accurate  for  r  ^  rcf  2,  so  that  the  potential 
of  Eq.  (7)  is  of  the  “norm-conserving”  variety,  insuring  transportability  into  bulk 
hydrogen  [9]. 


Method  of  Solution 

The  equations  of  motion  for  the  protons,  Eqs.  (2),  are  solved  using  the  Verlet 
(or  “leapfrog”)  algorithm  [2] .  In  the  calculation  of  the  effective  electron  potential, 
Eq.  (5),  the  Hartree  term  U//(x,  t)  is  found  by  solving  Poisson’s  equation  with 
periodic  boundary  conditions,  using  a  three-dimensional  fast  Fourier  transform 
(FFT ).  The  most  demanding  part  of  the  computation  is,  however,  in  the  advance¬ 
ment  of  the  time-dependent  Schrodinger  equations  for  the  electrons,  Eq.  ( 1 ):  for 
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each  wave  function,  evolution  from  time-step  tn  to  tn+l  =  tn  +  A/,  n  =  0,  1,2,..., 
is  accomplished  by  using  a  “split-step”  decomposition  of  the  short-time  propagator, 

ij(\,  tn+')  =  exp|-i'y  Ue(r(x,  /"+l/2)j-exp(-(Ar(-V2/2mc)) 

X  exp|-/y  verr(x,  *"+1/2)j i^(x,  r")  (10) 

where  t)eff(x,  r”+1/2)  is  found  from  a  previous  half-step  extrapolation.  The  elfect  of 
the  operator  exp(— /A/(— V2/2me))  is  computed  using  three-dimensional  ffts,  while 
the  effect  of  exp(-/(  Ar/2)t;eff(x,  tn+l/2))  is  found  from  direct  multiplication  in 
real  space.  Because  Eq.  ( 10)  expresses  an  explicitly  unitary  transformation,  it  con¬ 
serves  to  round-off  accuracy  the  norm  and  orthogonality  of  all  of  the  wave  functions. 
The  scheme  is  otherwise  0(At3)  accurate. 

Parallel  Implementation 

The  algorithm  outlined  above  was  implemented  on  the  Thinking  Machines  CM- 
5,  a  distributed-memory,  parallel  computer  of  the  mimd  type  of  architecture  (mimd: 
Multiple  Instructions,  Multiple  Data).  The  program  was  written  in  CM  Fortran,  a 
subset  of  the  “High  Performance  Fortran”  (hpf)  language  [10].  Under  hpf,  many 
details  of  the  computer  architecture,  pertaining  to  its  multiplicity  of  processors  and 
distributed  memory,  are  largely  hidden  to  the  user,  who  is  presented  with  a  single, 
“global”  model  of  the  machine. 

For  instance,  the  entire  set  of  three-dimensional  electronic  wave  functions  i pj(x, 
t),j  =  1,2,...,  N/2,  is  represented  by  the  single  Fortran  variable  PSI,  referring  to 
a  four-dimensional,  complex-valued  array:  the  first  index  of  PSI  indicates  the  orbital 
j9  and  the  other  three  indices  refer  to  spatial  location  on  a  Cartesian  grid.  The  actual 
allocation  at  run  time  of  distributed  computer  memory  for  the  array,  and  the  division 
of  labor  among  processors  during  program  execution,  when  PSI  is  Fourier  trans¬ 
formed,  multiplied  by  other  arrays  and  so  forth,  is  implemented  by  the  hpf  compiler 
without  requiring  detailed  instructions  from  the  programmer.  The  programmer 
retains  control  over  data  allocation  through  the  use  of  general  compiler  directives, 
which  for  instance  can  specify  which  axes  of  the  array  must  physically  lie  in  the 
same  processor.  The  result  is  a  “scalable”  code,  which  can  run  on  any  machine 
size,  provided  enough  memory  is  available.  For  the  present  simulations,  the  same 
compiled  program  has  been  run  on  machines  with  32,  64  or  128  processors. 

Programming  a  parallel  computer  with  a  high-level  language  such  as  hpf  is  to 
be  contrasted  with  the  “message-passing”  approach,  in  which  the  programmer  ex¬ 
plicitly  implements  memory  allocation  and  division  of  labor  among  the  parallel 
processors,  in  effect  replacing  the  compiler  in  these  tasks.  The  two  approaches  are 
largely  complementary.  The  present  problem,  with  its  large,  homogeneous  set  of 
three-dimensional  wave  functions,  appears  well  suited  to  an  hpf  implementation. 
To  give  an  indication  of  performance,  a  Fortran  77  implementation  of  the  algorithm, 
previously  optimized  for  a  Cray  architecture  [1],  requires  4.6  s  per  time-step  on 
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one  processor  of  a  Cray  Y-mp  computer  for  a  54-atom  system.  The  corresponding 
times  achieved  by  the  HPF  implementation  are  3.4,  1.9,  2.0  and  1. 1  s  per  time-step 
on,  respectively,  32  and  64-processor  CM-5  computers  (Sparc  processors),  and  32 
and  64-processor  CM-5E  computers  (SuperSparc  processors). 

Simulation  Results 


Metallic  Hydrogen 

The  simulations  were  conducted  in  a  fashion  similar  to  those  presented  in  Ref. 
[2].  We  considered  a  sample  of  metallic  hydrogen,  composed  of  54  protons  and 
54  electrons,  enclosed  in  a  cubical  region  of  side  L  with  periodic  boundary  con¬ 
ditions,  with  the  protons  initially  set  at  the  sites  of  an  atomic  bcc  lattice.  The  density 
is  defined  in  terms  of  the  ion-sphere  radius, 


where  np  =  N/L3  is  the  average  proton  number  density.  The  spatial  meshes  used 
to  define  the  electronic  wave  functions  are  each  of  extent  nx  X  ny  X  nz  =  (32)3, 
with  energy  cutolf  of  136  hartree  for  rs  =  1.  Corresponding  to  the  initial  proton 
positions,  the  electronic  ground  state  was  determined  by  steepest-descent  mini¬ 
mization  of  the  electronic  energy.  Figure  1  is  a  representation  of  the  initial  config¬ 
uration  for  rs  =  1,  in  which  the  protons  (small  spheres),  are  encapsulated  by  the 
electron  density  isosurface,  calculated  for  a  density  half-way  between  minimum 
and  maximum.  The  great  regularity  of  the  electron  distribution  can  be  seen  in  the 
left-hand  comer  of  Figure  1,  where  we  show  a  contour  plot  of  a  cross  section  of 
the  density,  taken  through  the  first  plane  of  protons. 

The  initial  ground  state  energy  E0  as  a  function  of  ion-sphere  radius  rs  is  shown 
in  Figure  2(a)  for  the  range  of  0.6  <  rs  <  1.8  (12.5  >  p  >  0.46  g  cm-3),  and  the 
resulting  pressure  P,  as  determined  by  numerical  differentiation,  is  shown  in  Figure 
2(b).  The  region  of  existence  of  metallic  hydrogen  at  zero  temperature  is  estimated 
to  be  rs  ^  1.3  (Ref.  [11]),  so  that  in  Figs.  2(a),  (b)  only  the  range  rs  ^  1.3  is 
physically  meaningful.  In  what  follows,  we  consider  dynamical  simulations  for  the 
specific  value  rs  =  1  (p  =  2.7  g  cm-3,  P  =  5.45  Mbar),  well  inside  the  region  of 
existence  of  the  metallic  phase. 

The  dynamical  simulations  were  conducted  by  initially  assigning  a  Maxwellian 
distribution  of  velocities  to  the  protons,  and,  in  the  subsequent  evolution,  by  con¬ 
trolling  the  temperature  via  a  Nose  thermostat  [12].  The  time-step  was  At  = 
4.9  X  10-6  ps.  For  each  temperature  chosen,  an  initial  equilibration  phase  of 25,000 
time  steps  was  followed  by  runs  of  50,000  to  75,000  time  steps  (0.24-0.37  ps) 
during  which  statistics  were  gathered. 

In  Figure  3  a  snapshot  of  the  electron-proton  configuration  is  shown  for  T  = 
238  K,  after  an  evolution  time  of  0.16  ps.  There  is  considerable  proton  motion, 
and  the  resulting  electron  density  is  strongly  perturbed,  as  can  be  directly  seen  from 
the  isosurface  in  Figure  3,  or  from  the  contour  plot  displayed  in  the  upper  left- 
hand  corner  of  the  figure. 
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Figure  1 .  Initial  conditions  for  the  dynamical  simulations  of  metallic  hydrogen  (p  =  2.7 
g  cm-3,  rs  =  1,  54  protons  and  54  electrons,  bcc  atomic  crystal  lattice).  In  the  three- 
dimensional  figure,  the  proton  positions  are  labeled  by  small  spheres,  encapsulated  by  the 
electron  density  isosurface,  plotted  for  a  density  half-way  between  minimum  and  maximum. 
The  contour  plot  in  the  upper  left-hand  corner  refers  to  a  cross  section  of  the  electron 
density  taken  through  the  first  plane  of  ions. 


The  extent  of  relative  proton  motion  at  various  temperatures  is  quantified  in 
Figure  4,  which  displays  the  radial  distribution  functions  g(r).  Even  at  the  lowest 
temperature  considered,  238  K,  the  radial  distribution  function  is  broad,  consistent 
with  the  snapshot  of  Figure  3,  which  showed  large  displacements  of  the  protons. 
As  temperature  increases,  flattening  of  the  distribution  functions  proceeds  until  at 
the  largest  temperature  considered,  5000  K,  the  first  maximum  is  only  a  round¬ 
ish  bump. 

The  solid-liquid  transition  is  not  apparent  from  examining  Figure  4.  Rather,  we 
determine  the  melting  point  of  the  solid  by  locating  the  minimum  temperature  at 
which  the  protons  have  finite  self-diffusion,  in  other  words,  the  temperature  at 
which  the  thermally  agitated  protons  wander  without  returning  to  the  vicinity  of 
their  equilibrium  lattice  sites.  For  the  set  of  runs  presented  here,  while  for  T  =  600 
K  there  is  no  self-diffusion,  finite  diffusion  does  occur  at  900  K,  indicating  that  the 
melting  temperature  Tm  is  in  the  range  600  K  <  Tm  <  900  K.  It  is  of  some  interest 
to  compare  this  prediction  with  that  of  the  one-component  plasma  (OCP)  model 
of  the  same  system,  which  assumes  a  completely  homogeneous  electron  background 
[13].  The  OCP  melting  temperature  is  given  by 


617 


METALLIC  AND  MOLECULAR  HYDROGEN 


Figure  2.  Zero-temperature  properties  of  the  atomic  crystal  ( 54  protons  and  54  electrons), 
for  variable  rs:  (a)  total  ground-state  energy,  (b)  ground-state  pressure.  The  atomic  crystal 
is  expected  to  be  stable  only  for  rs  £  1.3,  reverting  to  a  molecular  phase  at  larger  values 

of  rs. 


Tocp  = 
1  m 


1 

IV/ 


(12) 


where,  neglecting  finite  simulation-size  effects,  -  178  (Ref.  [14]).  For  rs—  1, 
one  obtains  T°cp  =  1773  K,  a  value  much  larger  than  our  prediction  of  Tm  <  900 
K,  and  this  discrepancy  shows  the  importance  of  the  fully  quantum-mechanical 
modeling  of  the  electrons.  It  is  our  intention  to  systematically  map  the  metallic 
hydrogen  melting  curve  in  the  near  future,  so  as  to  determine  Tm  =  Tm(rs),  for  a 
finite  range  of  rs. 

In  Figures  5(a)  and  (b)  are  shown  the  velocity  autocorrelation  functions  Z(t ), 
defined  by 
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Figure  3.  Snapshot  of  the  proton  positions  (small  spheres  in  the  three-dimensional  figure) 
and  of  the  electron  density  isosurface,  for  a  dynamical  simulation  at  temperature  238  K, 
after  an  evolution  ofO.  16  ps  starting  from  the  initial  configuration  of  Figure  1.  The  contour 
plot  in  the  upper  left-hand  corner  refers  to  a  cross  section  of  the  electron  density  taken 
through  the  same  plane  as  in  Figure  I . 


Z(0  =  !<▼((>)•  t(0>. 


(13) 


for  the  temperatures  of  238  K  (solid  phase)  and  900  K  (liquid  phase).  At  the  lower 
temperature,  Figure  5(a),  the  velocity  autocorrelation  function  displays  a  long- 
lived  oscillatory  behavior,  extending  to  about  0.2  ps,  with  the  mix  of  frequencies 
that  is  characteristic  of  a  solid  [15].  In  the  liquid  phase,  Figure  5(b),  the  decrease 
of  Z(t)  is  far  more  rapid  (note  the  change  in  horizontal  scale  as  well),  indicating 
a  much  faster  loss  of  correlation  as  the  protons  freely  diffuse. 

A  quantitative  measure  of  the  frequency  content  of  the  ion  motion  is  given  by 
the  power  spectrum  Z( «),  defined  as  the  Fourier  transform  of  Z(t), 


Z(a> 


dteiulZ(t). 


(14) 


Graphs  of  Z(co)  are  displayed  in  Figure  6  for  T  =  238,  600,  900,  and  2000  K.  For 
T  =  238  K,  Figure  6(a)  (solid  phase),  the  power  spectrum  displays  a  set  of  discrete 
peaks  due  to  the  thermal  excitation  of  the  finite  number  of  phonon  which  exist  in 
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Figure  4.  Proton-proton  radial  distribution  functions  g(r)  for  rs  =  1,  at  the  temperatures 
indicated  in  the  plot  [the  first  maximum  of  g(r)  monotonically  decreases  with  increasing 

temperature]. 


the  crystal  lattice.  At  T  =  600  K,  Figure  6(b)  (solid  phase),  the  broadening  of  the 
peaks  due  to  inharmonicity  and  phonon-phonon  coupling  has  largely  washed-out 
the  discrete  mode  structure.  For  T  =  900  K,  Figure  6(c)  (liquid  phase),  a  new 
feature  appears  with  Z( 0)  +  0:  this  is  because  the  proton  self-diffusion  coeffi¬ 
cient  D , 

J'°°  1  „ 

dtZ(t)  =  -Z(0),  (15) 

0  z- 


is  nonzero.  Finally,  for  T  =  2000  K,  Figure  6(d),  the  power  spectrum  has  become 
almost  completely  diffusive. 

In  Figure  7  we  display  the  proton  self-diffusion  coefficient  as  a  function  of  tem¬ 
perature,  as  determined  by  Eq.  (15).  For  the  range  of  temperatures  studied,  and 
rs  —  1 ,  the  diffusion  coefficient  is  approximated  by  the  empirical  relation 


D{T)  = 


(16) 


where  T0,  a  temperature,  and  /0,  a  length,  are  adjustable  parameters.  In  Figure  7, 
T0  =  855  K  and /0  =  0.18  a.u. 


Molecular  Hydrogen 

In  Ref.  [1]  results  were  presented  for  a  simulation  under  conditions  (rs=  1.39, 
T  =  2000  K)  under  which  the  molecular  phase  is  expected  to  be  stable.  Starting 
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(a)  T  =  238  K  (solid  phase)  showing  the  long-lived  oscillatory  behavior  of  the  velocity 
autocorrelation  function;  (b)  T  =  900  K  (liquid  phase)  showing  the  much  faster  loss  of 
correlation  (note  change  in  horizontal  scale). 


from  the  same  atomic  bcc  crystal  as  considered  above,  the  protons  were  seen  to 
spontaneously  pair  into  molecules.  This  behavior  has  since  been  confirmed  by  the 
extensive  simulations  of  Hohl  et  al.  [  5  ] ,  using  the  Car-Parrinello  method  [  3  ] . 

We  report  here  on  a  simulation  for  vs  =  1.86  in  which  the  molecular  phase  is 
assumed  a  priori.  In  Figure  8  the  radial  distribution  function  is  shown  for  a  system 
of  64  protons  and  electrons,  in  which  the  protons  are  initially  paired  with  the  same 
separation  as  occurs  in  the  isolated  H2  molecule.  The  system  is  left  to  self-consistently 
evolve  at  temperature  2000  K.  The  sharp  peak  in  the  radial  distribution  function, 
which  occurs  at  rs  =  1.45,  attests  to  the  continued  existence  of  the  molecules,  while 
the  much  broader  peak  to  the  right  is  due  to  the  distance  between  protons  in  different 
molecules.  The  molecular  system  is  in  the  liquid  state,  with  unbounded  diffusion. 
Thus,  the  power  spectrum  shown  in  Figure  9  contains  both  a  broad,  low  frequency 
peak,  due  to  the  diffusive  motion  of  the  molecules,  and  an  outlying  peak  at  the 
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3 


(a)  u  (s  x) 


(b)  u>  (s  x) 


Figure  6.  Proton  velocity  power  spectra;  (a)  T=  238  K(  solid  phase)  showing  the  distinct 
bcc  lattice  phonon  frequencies;  (b)  T  =  600  K  (solid  phase)  showing  the  merging  of  the 
phonon  peaks  due  to  anharmonicity;  (c)  T  =  900  K  (liquid  phase)  showing  finite  proton 
self-diffusion,  Z(0)  *  0;  (d)  T  =  2000  K  (liquid  phase)  showing  an  almost  all-diffusive 

power  spectrum. 


“vibron”  frequency  w  «  7.8  X  10 14  s_1.  This  latter  frequency  is  approximately 
equal  to  the  vibrational  frequency  of  the  isolated  hydrogen  molecule  (a>e  =  8.28  X 
10 14  s"1 ,  Ref  [16]),  but  also  includes  numerical  errors  due  to  the  finite  accuracy 
of  the  density-functional  scheme  [  1  ] ,  as  well  as  the  physical  effects  of  anharmonicity 
and  many-body  interactions. 


Conclusions 

A  first-principles  scheme  based  on  a  time-dependent,  quantum-mechanical 
modeling  of  the  electrons,  coupled  to  classical  equations  of  motion  for  the  protons, 
was  implemented  on  a  parallel  computer,  and  applied  to  the  study  of  dense  hydrogen. 
We  have  estimated  the  melting  temperature  of  metallic  hydrogen  at  density  2.7  g 
cm'3  (rs  =  1 )  to  be  approximately  850  K,  and  we  have  investigated  its  dynamical 
properties  both  below  and  above  the  melting  point.  A  large  deviation  from  the 
prediction  of  the  one-component  plasma  model  ( T°CP  =  1773  K)  was  thus  found 
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T( K) 

Figure  7.  Proton  self-diffusion  coefficient  in  metallic  hydrogen  (p  =  2.7  g  cm-3,  rs  =  1 ), 
plotted  as  a  function  of  temperature.  The  dark  line  is  the  empirical  fit,  Eq.  ( 16). 


in  the  value  of  the  melting  temperature.  The  temperature  dependence  of  the  self¬ 
diffusion  coefficient  in  the  liquid  metal  was  then  examined,  and  in  the  range  studied, 
found  to  obey  a  simple  empirical  relation. 

Following  the  preliminary  work  of  Ref.  [1],  a  simulation  of  dense  hydrogen  in 
the  molecular  phase  was  also  conducted,  at  density  0.16  g  cm-3  (rs  =  1.86)  and 


Figure  8.  Radial  distribution  function  for  the  molecular  phase  at  2000  K  (p  =  0.16  g 
cm"*3,  rs  =  1.86),  obtained  for  a  system  of  64  protons  and  64  electrons. 
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Figure  9.  Velocity  power  spectrum  for  the  molecular  phase  at  2000  K  (p  -  0. 16  g  cm  3, 
rs  -  1.86).  The  “vibron”  peak  at  the  right  occurs  at  a  =  7.8  X  10 14  s'1. 


temperature  2000  K.  Results  were  comparable  to  those  obtained  by  Hohl  et  al. 
[5],  with  the  first  principles  scheme  predicting  a  molecular  liquid,  and  correct 
molecular  bond  lengths  and  vibrational  frequencies. 

A  systematic  investigation  of  the  density  dependence  of  the  melting  temperature 
of  metallic  hydrogen  and  of  its  transport  properties  in  the  liquid  phase  is  currently 
under  way. 
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Dependent  Exchange-Correlation  Approximation 
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Abstract 

We  test  an  exchange-correlation  functional  with  explicit  dependence  on  kinetic-energy  density  as  well 
as  the  density,  its  gradient,  and  its  Laplacian,  on  the  Gaussian-2  thermochemical  data  base.  With  a  small 
degree  of  exact-exchange  mixing,  we  find  average  errors  with  respect  to  experiment  of  order  2  kcal/mol, 
0.15  eV,  and  2  kcal/mol,  respectively,  for  atomization  energies,  ionization  potentials,  and  proton  affinities. 
©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

We  are  currently  engaged  in  a  systematic  survey  of  density-functional  exchange- 
correlation  approximations  in  thermochemical  applications  [1-3].  Our  survey  has 
so  far  included  the  local  spin-density  approximation  (lsda),  corrections  to  the 
lsda  depending  on  spin-density  gradients  (the  GGA,  or  generalized  gradient  ap¬ 
proximation),  and  mixtures  of  the  latter  with  exact  exchange.  These  various  ap¬ 
proximations  have  been  tested  on  the  Gaussian-2  (G2)  thermochemical  data  set 
of  Pople  and  coworkers  [4]  containing  accurate  experimental  data  on  56  atomization 
energies,  42  atomic  and  molecular  ionization  potentials,  and  8  proton  affinities  of 
first  and  second-row  systems.  Very  encouraging  results  have  been  obtained.  The 
GGA /exact-exchange  mixture  of  Ref.  [3],  for  example,  achieves  absolute  average 
errors  with  respect  to  experiment  of  only  2.4  kcal/mol,  0.14  eV,  and  1.2  kcal/mol 
for  the  G2  atomization  energies,  ionization  potentials,  and  proton  affinities, 
respectively. 

In  the  present  work,  we  similarly  assess  an  exchange-correlation  approximation 
with  explicit  dependence  on  kinetic-energy  density  as  well  as  the  density,  its  gradient, 
and  its  Laplacian.  The  exchange  part  of  the  present  approximation  was  introduced 
several  years  ago  in  Ref.  [5],  and  the  correlation  part  in  Ref.  [6].  Despite  the  age 
of  these  functionals,  they  have  not  until  now  been  tested  in  thermochemical  ap¬ 
plications  due  to  their  complexity  in  comparison  with  the  GGA.  Given  the  success 
of  the  GGA  tests  of  Refs.  [1-3],  however,  exploration  of  this  more  complicated 
class  of  exchange-correlation  approximations  is  called  for.  In  the  following  sections, 
we  briefly  review  the  work  of  Refs.  [5]  and  [6],  and  then  we  report  the  results  of 
the  present  thermochemical  tests. 


International  Journal  of  Quantum  Chemistry:  Quantum  Chemistry  Symposium  28,  625-632  (1994) 

©  1994  John  Wiley  &  Sons,  Inc.  CCC  0360-8832/94/010625-08 


626 


BECKE 


Exchange 

The  reader  should  consult  Ref.  [  5  ]  for  a  detailed  account  of  the  present  exchange 
approximation.  Here,  we  review  only  the  underlying  physics. 

The  exchange  energy  EXa  of  electrons  of  a  spin  is  given  by 


Iff  r2)  j3r  ^2 

2  J  J  ft? 


representing  the  interaction  of  the  spin-density  pa  with  an  exchange  charge  (or 
exchange  or  Fermi  “hole”)  pXa  defined  by 

Px„(r i,r2)=  2  ^*(ri)^,v(r2)  j  pA r,).  (2) 

The  following  constraints  on  pXa  are  easily  verified: 

Px<t(Yi,  T])  =  pa(T\)  (3) 

J  pxAii,  r2)d}T2  =  1  (4) 

and  are  valid  at  any  reference  point  ij  in  any  system.  Also,  note  that  pXa  is  always 
positive,  and  that  only  its  spherical  average  around  each  reference  point  is  required 
in  the  energy  integral  of  Eq.  ( 1 ) . 

Furthermore,  Taylor  expansion  of  the  spherically  averaged  pXa  around  each  ref¬ 
erence  point  reveals  its  quadratic  behavior  for  small  values  of  the  interelectronic 
separation: 


PxA r,  s)  =  f>„  +  -  (V2p„  -  2 D„)s2  + 


D.  =  r.  -  X-  (Vp,)2/p.  (6) 

with  arguments  (r,  5)  denoting  spherical  average  on  a  shell  of  radius  5  about  the 
reference  point  r,  and  with  the  positive  definite  kinetic-energy  density  rff  defined 
by 

T0  =  2  IV1M2.  (7) 


Eq.  (5 )  is  exact  at  any  reference  point  in  any  system  with  zero  current  density  (see 
Ref.  [7]  for  the  generalization  to  nonzero  currents). 

We  introduced  in  Ref.  [  5  ]  a  relatively  simple  model,  inspired  by  pXa  in  the  special 
case  of  the  hydrogenic  atom,  able  to  satisfy  the  conditions  of  Eqs.  (3)— (5)  at  all 
reference  points  in  any  system  with  no  empirical  parameters.  The  resulting  exchange 
functional  is  unfortunately  not  expressible  in  closed  analytical  form,  and  we  refer 
the  reader  to  Ref.  [  5  ]  for  discussion  of  its  evaluation.  In  atomic  tests,  our  model 
gives  significantly  better  exchange  energies,  by  about  an  order  of  magnitude,  than 
the  exchange  lsda  [  5  ] . 
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Our  model  does  not,  however,  exactly  treat  the  uniform  electron  gas,  resulting 
in  systematic  underestimation  of  exchange  energies  in  the  heavy-atom  limit  (see 
Ref.  [5  ]).  We  therefore  introduced  an  ad  hoc  parameter  y  designed  to  enforce  the 
uniform  gas  limit,  at  the  expense  of  slightly  altering  the  quadratic  behavior  in  Eq. 
( 5 ) .  Though  this  manipulation  somewhat  improves  the  exchange  energies  of  heavy 
atoms,  we  do  not  employ  it  here.  In  the  present  work,  we  use  the  unaltered  form 
(i.e.,  y  =  1 )  of  our  model. 


Correlation 


As  with  exchange,  we  refer  the  reader  to  a  previous  article  ( Ref.  [  6  ] )  for  a  full 
account  of  our  correlation  approximation. 

Our  correlation  approximation  is  based  on  well-known  interelectronic  cusp  con¬ 
ditions  on  the  correlation  hole,  and  the  zero-charge  sum  rule  as  opposed  to  the 
unit-charge  sum  rule,  Eq.  (4),  for  exchange.  Since  opposite  and  parallel-spin  electron 
pairs  are  governed  by  different  cusp  conditions,  we  obtain  separate  functionals  for 
opposite-spin  and  parallel-spin  correlation  energies: 


PaPfiZ  afi 


E°cv  =  -0.8  J 
E°c  =  -0.01  J  f>„Dczi 


1  - 


ln(  1  +  Zaf>) 


zA'+t). 


d3  r 
d3  r 


(8) 

(9) 


where  the  parallel-spin  functional  Eq.  (9)  again  involves  the  kinetic-energy  density 
t a  through  the  function  Dff  of  Eq.  (6),  and  where  E°cV  and  E c ,  respectively, 
involve  the  correlation  “lengths”  or  “ranges”  za0  and  zff<r.  These  are  defined  (see 
Ref.  [6])  as  the  interelectronic  separations  at  which  the  opposite  and  the  parallel- 
spin  correlation  holes  cross  zero  (as  they  must,  due  to  the  zero-charge  sum  rule). 

For  the  correlation  ranges  zaf3  and  zff<r,  we  apply  reasonable  physical  intuition. 
Since  each  electron  is  screened,  in  effect,  by  its  Fermi  hole  pXa,  we  assume  that  the 
correlation  range  for  the  a  a’  pair  is  related  to  a  sum  of  the  Fermi  hole  radii  Rp 
and  Rp  \ 

zffff'  -  c„ARf  +  Rf)  (10) 


where  caa>  is  a  constant  of  proportionality  with  only  two  independent  values:  cap  = 
Cpa  for  opposite-spin  pairs,  and  caa  =  cm  for  parallel-spin  pairs.  Moreover,  we  assume 
that  the  Fermi  hole  “radius”  Rp  is  given  by  the  inverse  of  the  coulomb  potential 
UXa  arising  from  pXa: 

R°f=UxK  and  UxA r,)  =  f T-±>  (11) 

^  *\2 

where  UXe  is  computed  from  the  present  r-dependent  exchange  functional  in  the 
manner  of  Ref.  [5].  Herein  lies  a  minor  deviation  from  our  original  work  of  Ref. 
[ 6  ] .  In  Ref.  [6 ] ,  we  used  an  old  and  now-outdated  exchange  gga  [ 8  ]  to  determine 
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UXa  at  each  reference  point,  whereas  we  now  employ  the  exchange  model  of  the 
preceding  section. 

The  correlation  range  parameters  may  be  determined  by  fitting  to  exact  correlation 
energies  of  atoms,  and  for  this  purpose  we  use  the  recent  compilations  of  Davidson 
and  coworkers  [9].  The  value  ca(3  =  0.63  has  been  determined  by  the  correlation 
energy  of  the  helium  atom,  and  a  least  squares  fit  to  the  remaining  atoms  in  the 
first  and  second  rows  (minimizing  the  relative  deviation  from  experiment)  gives 
caa  =  0.88.  The  resulting  correlation  energies  are  presented  in  Table  I,  and  the 
overall  rms  relative  deviation  from  experiment  is  4.9%. 

Thermochemical  Tests 

The  explicitly  r-dependent  functionals  described  above  have  been  tested  on  the 
atoms  and  molecules  of  the  G2  thermochemical  data  set  of  Pople  and  coworkers 
[4]  using  the  fully  numerical  NUMOL  density-functional  program  [10].  For  now, 
our  computations  are  post-LSDA :  that  is,  performed  using  lsda  orbitals  and  den¬ 
sities.  Complete  self-consistency  will  require  considerable  effort  to  implement,  be¬ 
cause  r  dependence  introduces  awkward  complications  in  the  orbital  equation  [11]. 
These  will  be  discussed  elsewhere.  Note  that  the  post-LSDA  nature  of  this  work 
precludes  the  computation  of  electron  affinities,  since  the  lsda  does  not  bind  anions. 
We  therefore  restrict  our  tests  to  the  G2  atomization  energies,  ionization  potentials, 
and  proton  affinities  only. 


Table  I.  Atomic  correlation  energies  (a.u.). 


Exact8 

PWb 

H 

0.000 

0.000 

He 

-0.042 

-0.042 

Li 

-0.045 

-0.053 

Be 

-0.094 

-0.090 

B 

-0.125 

-0.126 

C 

-0.156 

-0.160 

N 

-0.188 

-0.192 

O 

-0.258 

-0.259 

F 

-0.325 

-0.315 

Ne 

-0.390 

-0.365 

Na 

-0.396 

-0.389 

Mg 

-0.438 

-0.431 

A1 

-0.470 

-0.470 

Si 

-0.505 

-0.508 

P 

-0.540 

-0.543 

S 

-0.605 

-0.615 

Cl 

-0.666 

-0.675 

Ar 

-0.722 

-0.730 

a  Exact:  Ref.  [9]. 

b  PW:  present  work,  using  Hartree-Fock  or¬ 
bitals  and  densities  [12], 
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H2 

LiH 

BeH 

CH 

ch2(3b.) 

ch2(‘a,) 

ch3 

ch4 

NH 

NH2 

nh3 

OH 

H20 

HF 

Li2 

LiF 

C2H2 

c2h4 

c2h6 

CN 

HCN 

CO 

HCO 

H2CO 

ch3oh 

n2 

n2h4 

NO 


Table  II.  Atomization  energies  Z)0  (kcal/mol). 


Expt.a 

PWb 

Eq.  (12)c 

Expt.a 

PWb 

Eq.  (12)c 

103.5 

101.5 

104.3 

o2 

118.0 

128.7 

121.5 

56.0 

55.8 

57.1 

h2o2 

252.3 

251.0 

248.7 

46.9 

53.7 

54.5 

f2 

36.9 

43.3 

35.4 

79.9 

77.5 

79.2 

C02 

381.9 

391.9 

384.1 

179.6 

178.6 

181.4 

SiH2(‘A1) 

144.4 

140.5 

145.3 

170.6 

165.1 

168.9 

SiH2(3Bj) 

123.4 

119.5 

123.5 

289.2 

286.0 

291.7 

SiH3 

214.0 

205.8 

212.8 

392.5 

386.6 

394.6 

SiH4 

302.8 

292.2 

302.2 

79.0 

77.8 

79.0 

ph2 

144.7 

141.7 

146.8 

170.0 

167.9 

170.8 

ph3 

227.4 

219.3 

226.8 

276.7 

271.4 

276.3 

h2s 

173.2 

168.5 

173.0 

101.3 

101.6 

102.1 

HC1 

102.2 

100.2 

102.1 

219.3 

217.5 

218.9 

Na2 

16.6 

16.4 

16.1 

135.2 

136.6 

135.9 

Si2 

74;0 

74.5 

75.0 

24.0 

19.4 

20.4 

P2 

116.1 

113.3 

114.2 

137.6 

143.6 

140.2 

s2 

100.7 

106.3 

105.2 

388.9 

388.2 

389.9 

C12 

57.2 

57.2 

56.3 

531.9 

526.6 

534.0 

NaCl 

97.5 

93.1 

94.1 

666.3 

653.2 

666.6 

SiO 

190.5 

193.0 

188.9 

176.6 

181.6 

175.2 

CS 

169.5 

169.9 

167.3 

301.8 

303.2 

301.7 

SO 

123.5 

131.5 

126.7 

256.2 

258.3 

254.5 

CIO 

63.3 

70.3 

65.1 

270.3 

276.0 

273.2 

C1F 

60.3 

65.0 

60.7 

357.2 

357.5 

357.8 

Si2H6 

500.1 

478.8 

495.6 

480.8 

473.0 

479.4 

CH3C1 

371.0 

364.3 

371.0 

225.1 

223.7 

219.9 

ch3sh 

445.1 

435.6 

445.0 

405.4 

393.6 

400.4 

HOC1 

156.3 

156.6 

154.9 

150.1 

153.5 

148.1 

so2 

254.0 

259.8 

251.1 

a  Expt:  from  Refs.  [4]  and  [13]. 
b  PW:  present  work  with  ca&  =  0.63  and  caa  =  0.88. 
cEq.  (12):  with  cx  =  0.154,  caP  =  0.66  and  caa  =  0.88; 


In  Tables  II,  III,  and  IV,  we  list  atomization  energies,  ionization  potentials,  and 
proton  affinities,  respectively,  of  the  G2  data  set.  Column  “PW”  of  each  table 
contains  data  for  the  exchange-correlation  approximation  of  the  present  work.  The 
average  absolute  deviations  from  experiment  are  4.6  kcal/mol  for  the  atomization 
energies,  0.15  eV  for  the  ionization  potentials,  and  2.8  kcal/mol  for  the  proton 
affinities.  These  errors  are  similar  to  those  of  the  generalized  gradient  approximation 
(GGa)  reported  in  Ref.  [2]  (i.e.,  5.7  kcal/mol,  0.15  eV,  and  1.5  kcal/mol). 

The  data  in  the  third  columns  of  Tables  II,  III,  and  IV  are  obtained  by  replacing 
a  small  fraction  of  the  present  exchange  approximation  with  exact  (essentially 
Hartree-Fock)  exchange  as  follows: 


Exc  =  ET del  +  cx(ETct  ~  ETde])  +  ET  +  ET  +  E? . 


(12) 
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Table  III.  Ionization  potentials  (eV). 


Expt.a 

PWb 

Eq.  (12)c 

H 

13.60 

13.62 

13.61 

He 

24.59 

24.97 

25.05 

Li 

5.39 

5.42 

5.42 

Be 

9.32 

8.96 

9.03 

B 

8.30 

8.62 

8.63 

C 

11.26 

11.58 

11.57 

N 

14.54 

14.89 

14.86 

O 

13.61 

13.94 

13.91 

F 

17.42 

17.77 

17.71 

Ne 

21.56 

21.97 

21.88 

Na 

5.14 

5.36 

5.31 

Mg 

7.65 

7.59 

7.61 

A1 

5.98 

5.80 

5.84 

Si 

8.15 

8.01 

8.05 

P 

10.49 

10.40 

10.44 

s 

10.36 

10.33 

10.43 

Cl 

12.97 

12.97 

13.05 

Ar 

15.76 

15.80 

15.86 

ch4 

12.62 

12.56 

12.62 

nh3 

10.18 

10.25 

10.23 

OH 

13.01 

13.18 

13.16 

H20 

12.62 

12.71 

12.68 

HF 

16.04 

16.23 

16.17 

SiH4 

11.00 

10.77 

10.90 

PH 

10.15 

10.12 

10.15 

ph2 

9.82 

9.80 

9.84 

ph3 

9.87 

9.79 

9.83 

SH 

10.37 

10.33 

10.41 

SH2(2B,) 

10.47 

10.36 

10.42 

SH2(2Aj) 

12.78 

12.59 

12.66 

HC1 

12.75 

12.71 

12.77 

c2H2 

11.40 

11.34 

11.34 

c2h4 

10.51 

10.48 

10.46 

CO 

14.01 

14.02 

14.18 

N2(22g) 

15.58 

15.55 

15.81 

N2(2nu) 

16.70 

16.73 

16.77 

o2 

12.07 

12.46 

12.55 

p2 

10.53 

10.63 

10.83 

S2 

9.36 

9.38 

9.49 

C12 

11.50 

11.17 

11.32 

C1F 

12.66 

12.45 

12.58 

cs 

11.33 

11.26 

11.36 

aExpt:  from  Refs.  [4]  and  [13]. 
b  PW:  present  work  with  cafi  =  0.63  and  caa  =  0.88. 
c  Eq.  (12):  with  cx  =  0.154,  cal3  =  0.66  and  caa  =  0.88. 
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Table  IV.  Proton  affinities  (kcal/mol). 


Expt.a 

PWb 

Eq.  (12)c 

H2 

100.8 

97.0 

97.8 

c2h2 

152.3 

151.0 

153.0 

nh3 

202.5 

199.4 

201.6 

h2o 

165.1 

160.2 

162.3 

SiH4 

154.0 

152.6 

152.6 

ph3 

187.1 

183.5 

185.5 

h2s 

168.8 

166.8 

167.6 

HC1 

133.6 

131.5 

132.1 

aExpt:  from  Refs.  [4]  and  [13]. 
b  PW:  present  work  with  ca(i  =  0.63  and  caa  =  0.88. 
c  Eq.  (12):  with  cx  =  0.154,  cafs  =  0.66  and  caa  =  0.88. 


Small  admixture  of  exact  exchange  is  motivated  by  the  coupling-strength  integration 
formula  of  Kohn-Sham  theory,  and  was  proposed  in  Ref.  [  3  ]  to  which  we  refer 
the  reader  for  further  discussion.  For  the  present  r-dependent  exchange-correlation 
model,  Eq.  (12)  is  the  analog  of  Eq.  (2)  in  Ref.  [3]  for  the  GGA.  We  find  that 
exact-exchange  mixing  improves  thermochemical  results  insignificantly  unless  the 
correlation  range  parameter  caf}  has  slightly  larger  value  than  its  atomic  helium 
value.  The  outcome  of  a  two-parameter  fit  to  the  G2  data,  with  ca(j  —  0.66  and  an 
exact-exchange  mixing  fraction  of  cx  =  0.154,  is  recorded  in  Tables  II,  III,  and  IV. 
Average  error  in  the  atomization  energies  is  reduced  to  a  remarkably  small  1.8 
kcal/mol,  with  errors  in  the  ionization  potentials  and  proton  affinities  of  0.14  eV 
and  1 .6  kcal  /mol,  respectively.  The  average  errors  of  the  analogous  GGA  plus  exact- 
exchange  theory  of  Ref.  [3]  are  2.4  kcal/mol,  0.14  eV,  and  1.2  kcal/mol. 


Conclusions 

The  functionals  of  this  work  give  slightly  better  atomization  energies  than  their 
GGA  counterparts.  They  are  based,  however,  on  a  totally  different  philosophy.  The 
GGA  relies  on  the  uniform  electron  gas  as  starting  point,  and  adds  gradient-dependent 
corrections  to  account  for  local  inhomogeneities.  As  such  the  GGA  does  not  treat 
the  simplest  conceivable  chemical  system,  the  hydrogenic  atom,  exactly.  Conversely, 
the  functionals  of  the  present  work  are  inspired  by  the  hydrogenic  atom  as  a  pro¬ 
totypical  system,  but  do  not  exactly  treat  the  uniform  electron  gas.  Interestingly, 
these  two  rather  different  approaches  give  results  of  comparable  quality  in  these 
molecular  tests.  We  are  now  investigating  their  relative  merits  in  transition-metal 
chemistry. 

In  summary,  contemporary  exchange-correlation  approximations,  whether  of 
the  GGA  variety  or  the  present  r-dependent  variety,  typically  yield  average  errors 
with  respect  to  experiment  of  5  kcal/mol,  0.15  eV,  and  2  kcal/mol,  respectively, 
for  the  atomization  energies,  ionization  potentials,  and  proton  affinities  of  the  G2 
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data  base  [4],  With  exact-exchange  mixing  of  order  15-20%,  as  espoused  in  Ref. 
[3]  the  average  error  of  the  atomization  energies  may  be  reduced  to  order  2 
kcal/mol. 
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Abstract 

The  Density-Functional  method,  with  Linear  Combination  of  Atomic  Orbitals,  has  been  applied  to 
eight  crystals:  the  lattice  equilibrium  parameters,  and  the  lattice  formation  energies  have  been  calculated 
at  the  Hartree-Fock  level  (hf),  at  the  hybrid  Hartree-Fock  Density-Functional  level  (dft/hf),  and  at 
the  Kohn-Sham  Density-Functional  level  (dft).  The  band  structures  and  the  electronic  charge  distri¬ 
butions  calculated  at  the  DFT  and  HF  levels  are  compared.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Computational  methods  connected  with  the  Density-Functional  Theory  have 
for  many  years  been  applied  to  crystalline  solids,  using  various  basis  sets  to  expand 
the  one-electron  Bloch  orbitals,  such  as  plane  waves  [1],  muffin-tin  orbitals  [2], 
and  augmented  plane  waves  [3] .  Recently  dft  methods  have  been  applied  exten¬ 
sively  to  molecular  systems  [4-12].  With  the  use  of  the  Linear  Combination  of 
Atomic  Orbitals  (lcao)  method,  the  dft  has  been  successfully  compared  with  the 
Hartree-Fock  method  and  with  the  standard  molecular  correlation  methods,  like 
Configurations  Interaction,  the  perturbative  and  the  Coupled  Cluster  methods.  Only 
a  few  examples  of  the  periodic  LCAO  dft  method  exist  in  the  literature  [  13].  In 
the  present  work,  the  Hartree-Fock  lcao  method  for  periodic  systems  [14]  has 
been  modified  to  deal  with  dft  methods.  The  comparison  is  now  possible  between 
dft  and  hf  related  methods  for  condensed  systems,  in  a  way  strictly  analogous  to 
molecular  systems.  The  CRYSTAL  [  15  ]  suite  of  programs  has  therefore  been  gen¬ 
eralized  in  order  to  apply  dft  lcao  methods  as  well  as  the  hf  lcao  technique  to 
bulk  crystals,  slabs,  as  models  of  surfaces,  polymers,  and  molecules. 

At  a  first  stage  [  16,17]  the  correlation  energy  functionals  have  been  taken  into 
consideration:  the  Colle-Salvetti  [  18]  and  Perdew  [19]  correlation  functionals  have 
been  applied  to  the  Hartree-Fock  electronic  density,  in  the  so  called  a  posteriori 
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DFT  correlation  method.  Then  the  exchange-correlation  energy  functionals  have 
been  applied  a  posteriori  to  the  hf  density  [20]:  this  method  has  been  defined  as 
the  hybrid  dft-hf  method.  The  hybrid  method  has  been  applied  to  a  set  of  20 
covalent  and  ionic  crystals.  In  the  present  work  the  Density-Functional  Theory,  in 
the  Local-Density  Approximation  (lda),  has  been  introduced  to  the  Self-Consistent 
Field  (scf)  process,  resolving  at  each  stage  the  Kohn-Sham  (KS)  equations,  instead 
of  the  Hartree-Fock  equations.  It  is  thus  possible  to  compare  not  only  the  hf  and 
DFT  energies  (and  the  structural  related  quantities),  but  also  the  eigenvalue  spectra 
and  the  observables  related  with  the  first  order  density  matrix,  such  as  the  electronic 
charge  distribution  in  the  direct  or  in  the  momentum  space. 

With  the  computer  program  developed  in  the  present  work  it  is  therefore  possible 
to  study  the  electronic  structure  at  the  Hartree-Fock  or  Kohn-Sham  level,  by  using 
the  same  computational  conditions,  like  the  basis  set  or  the  limits  and  approximation 
of  the  infinite  sums  of  interactions.  With  the  use  of  gaussian  basis  sets,  obtained 
by  a  few  modifications  of  the  molecular  basis  sets,  the  periodic  and  the  molecular 
systems  can  be  treated  in  the  same  way,  and  the  genuine  solid  state  effects,  that 
arise  from  the  application  of  the  dft  and  the  hf  methods,  can  be  explored. 

Methods  and  Computational  Details 

In  this  work  the  Hartree-Fock  electron  density  pHF  and  the  Kohn-Sham  density 
Pks  have  been  calculated.  In  the  following,  phf/ks  indicates  both  HF  and  KS  densities: 

J*  occ.  bands[£] 

dk  2  I<Plhf/ks(?)|2. 

Brill.  Zone  ^ 

The  crystalline  orbitals  <^hf/ks(?)  are  a  solution  of  the  one-particle  equations: 

^FIF/KS^LHF/KsC  ?)  =  £?<PLHF/Ks(  ?)  • 

The  Hartree-Fock  mono-electronic  operator  is  defined  as  follows: 

^hf  =  t  +  v  +  J[p(r)]  +  KHF[p(?,T)], 

where  t,  v,  J,  and  K  are,  respectively,  the  kinetic,  the  external-potential,  the  Coulomb, 
and  the  exchange  operators.  The  infinite  lattice  sums,  that  are  implicit  in  the  direct 
lattice  implementation  of  the  LCAO  periodic  hf  method  are  limited  and  approxi¬ 
mated  as  described  in  Ref.  [  14] . 

The  Kohn-Sham  mono-electronic  operator  is  defined  as  follows: 

h^  =  t  +  v  +  J[p(7)]  +  vxc{7). 

The  exchange-correlation  potential  vx~c  is  the  functional  derivative  of  the  exchange- 
correlation  Density-Functional  Energy: 

E~-  I 

UNIT 

CELL 
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Only  few  examples  of  application  of  dft  correlation  and  Hartree-Fock  exchange 
can  be  found  in  the  literature  [21].  The  main  reason  is  that  the  HF  density  of 
molecules  is  affected  by  the  lack  of  static  correlation,  while  the  local  ( or  gradient ) 
dft  exchange  densities  are  less  affected  by  this  problem  [22]:  on  the  other  hand, 
correlation  dft  takes  only  dynamical  effects  into  account.  For  covalent  and  ionic 
solids  the  static  correlation  is  probably  less  important:  *  so  the  hf  electronic  density 
could  be  a  good  basis  for  the  application  of  correlation  dft  .  In  the  present  work 
both  the  approaches  have  been  followed:  exchange  and  correlation  have  been  treated 
by  dft  (using  vxc  potential)  and  also  calculations  with  HF  “exact”  exchange  and 
DFT  correlation  have  been  performed.  In  this  way  the  effects  of  the  DFT  exchange 
and  correlation  on  the  electronic  density  and  on  the  band  structure  (better  defined 
as  the  one-particle  eigenvalue  spectrum)  have  been  separately  studied.  Thus,  a 
correlation-only  Kohn-Sham  operator  /zhf-ks  correi  has  been  defined: 

/1tHF-KS  correi  =  t  +  V  +  J[p(r)]  +  KhfIpC?  ,?')]  +  VC(J). 

This  method  will  be  defined  in  the  following  as  HF-LDA. 

In  DFT  lcao  methods,  the  vx~c  potential  is  represented  in  a  basis  set  of  gaussian 
atomic  orbitals  X^r  -g),  where  g  labels  the  unit  where  the  /xth  orbital  is  located. 
The  integrals: 

J  d7x„(7)  vx~c(7)  Xv(7  -  ?). 

SPACE 

must  be  calculated  numerically,  owing  to  the  complex  analytical  expression  of  the 
exchange-correlation  potentials  vx  c ,  even  at  a  local  level.  The  analytical  method 
for  calculating  dft  matrix  elements,  recently  proposed  by  Almlof  [23],  is  com¬ 
putationally  too  expensive  to  be  used  in  lcao  calculations  of  periodic  systems. 
Instead  of  performing  the  numerical  integration  directly  over  each  pair  of  atomic 
orbitals  (for  each  unit  cell  g),  in  the  present  work  the  exchange-correlation  potential 
has  been  expanded  in  the  following  way: 

Px~c(?)  £  2  CjG0). 
j 

The  fitting  functions  Gj  are  linear  combinations  of  gaussian-type  functions,  total 
symmetric  with  respect  to  translations  and  rotations:  the  exchange-correlation  po¬ 
tential  vx~c{  r )  is  a  basis  of  the  total  symmetric  irreducible  representation  of  the 
space  group.  The  dft  potential  integrals  are  consequently: 

N 

v  fi\v,g  ~  2  ’ 

i 

J  drxjr)  Gj(r)  X„(7  -  ?). 

SPACE 


*  Work  is  in  progress  in  our  laboratory  to  analyze  the  correlation  of  ionic  solids  with  the  usual  “chemical 
concepts  of  static  and  dynamic  correlations. 
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The  integrals  are  calculated  only  once  analytically,  and  stored.  At  each  SCF 
iteration  the  best  fitting  coefficients  are  evaluated,  Cj,  by  solving  a  linear  least  square 
equation: 


Ac  =  b. 

A  is  the  overlap  matrix  between  the  fitting  functions: 

Au  =  J  d?G,(7)Gj(r) . 

UNIT 

CELL 

c  is  the  vector  of  the  unknown  coefficients  cp  b  is  the  vector  of  the  constant  coef¬ 
ficients  bj ,  that  depend  on  the  density,  and  are  calculated  numerically: 

bj  =  J  drGj(7)vx-c(r) . 

UNIT 

CELL 

The  last  numerical  integration  has  great  advantages  with  respect  to  the  direct  nu¬ 
merical  evaluation  of  the  integrals  it  is  restricted  to  the  unit  cell,  and  the 
integrand  is  total  symmetric.  In  this  way  the  numerical  integration  can  be  limited 
to  the  irreducible  set  of  points. 

The  fitting  functions  do  not  introduce  any  inaccuracy  in  the  evaluation  of  the 
dft  matrix  elements:  in  the  present  work  a  set  of  eight  even-tempered  gaussian 
functions  for  each  atom  have  been  employed:  the  exponential  coefficients  belong 
to  an  interval  that  goes  from  twice  the  highest  exponent  to  the  lower  exponent  of 
the  Atomic  Orbital  basis  set.  Further  fitting  functions  have  been  added  in  the  bond 
regions.  Only  fitting  functions  with  angular  s  symmetry  have  been  adopted:  p,  d , 
and  /  functions  do  not  give  any  contribution  to  the  fitting,  due  to  the  cubic  sym¬ 
metry:  g  functions  give  unimportant  contributions.  The  integrals  have  been 
calculated  numerically  for  silicon  to  test  the  effect  of  the  fitting  technique,  with  a 
high  computational  time  ( 10  times  the  CPU  time  of  the  fitting  method):  the  equi¬ 
librium  lattice  parameters  change  only  by  less  than  0.001  A,  and  total  energy  changes 
by  0.1  kcal/mol. 

The  numerical  integration  over  the  unit  cell  is  performed  by  an  extension  to  the 
periodic  systems  of  the  atomic  partition  method,  introduced  by  Becke  [24].  The 
fitting  parameters  bj  are  calculated  at  each  cycle  of  the  SCF  process  with  a  moderate 
numerical  accuracy  (about  1000  points  per  atom),  while  the  dft  energy  is  calculated 
at  the  end  of  the  SCF  process  with  high  accuracy  (about  10,000  points  per  atom). 
During  the  SCF  process  exponential  weights  are  used  [25],  while  the  energy  is 
calculated  by  means  of  a  partition  in  overlapping  Voronoi  polyhedra  [26].  The 
numerical  integration  technique,  as  well  as  the  numerical  accuracy,  will  be  object 
of  a  separate  analysis  [26  ] .  The  limits  of  the  lattice  sums  ensure  an  accuracy  of  0. 1 
kcal/mol  in  the  total  energy. 

In  the  present  work  eight  cubic  crystals  have  been  considered,  ranging  from 
perfect  co valency  (Diamond,  Si,  Ge)  to  perfect  ionicity  (MgO,  NaCl),  including 
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partially  ionic  bond  situations  (BN,  A1P,  GaAs).  For  covalent  and  partially  covalent 
systems,  the  6-2 1G*  basis  set  has  been  adopted  [27];  the  basis  sets  of  the  ionic 
systems  have  been  optimized  with  periodic  Hartree-Fock  calculations  [28]. 

The  lda  energy  correlation  functional  used  in  this  work  is  the  Perdew-Zunger 
[29]  parametrization  of  the  numerical  results  obtained  by  Ceperley  and  Alder  for 
the  electron  gas  [  30  ] :  the  new  parametrization  proposed  by  Perdew  and  Wang  [31] 
gives  equivalent  results  for  equilibrium  lattice  parameters  and  lattice  formation 
energies.  Gradient  dependent  functionals  are  defined  by  the  Generalized  Gradient 
Approximation  (GGA),  formulated  by  Becke  for  exchange  energy  (bec)  [32]  and 
by  Perdew  for  correlation  energy  ( P9 1 )  [31]. 

Results  and  Discussion 

In  Tables  I  and  II  the  lattice  parameters,  calculated  by  the  different  approxi¬ 
mations,  are  compared.  Table  I  contains  results  obtained  with  exchange-correlation 
energy  functionals,  while  the  data  of  Table  II  are  calculated  by  a  Hartree-Fock 
“exact  exchange”  energy  functional  plus  correlation-only  density  functional.  In 
each  table  the  experimental  and  the  HF  data  are  reported  for  comparison. 

HF  systematically  overestimates  the  lattice  parameters.  For  first  and  second  row 
covalent  crystals  ( C,  Si,  Ge,  BN,  A1P,  and  GaAs)  the  error  is  about  +0.05  angstroms 
(A),  that  corresponds  to  an  error  of  +0.02  A  in  the  bond  lengths.  The  error  increases 
to  +0.12  A  for  GaAs,  that  belongs  to  the  fourth  row  of  the  periodic  table. 

MgO  and  NaCl  are  almost  totally  ionic;  the  Mulliken  charges  of  the  cations  are 
1 .95  and  0.99  at  the  hf  level,  respectively.  At  lda  level  they  appear  slightly  less 
ionic:  the  charges  are  1.85  and  0.95,  respectively.  MgO  is  an  example  of  ionic 
crystals  that  have  a  correct  lattice  parameter  at  hf  level.  On  the  other  hand  hf 
gives  an  error  of  +0.22  A  for  the  lattice  parameter  of  NaCl.  The  very  different 


Table  I.  Lattice  parameters  of  crystals  (in  angstroms),  calculated  using  exchange  and  correlation  energy 

Density  Functionals. 


Expt.  [36] 

HF 

a  posteriori 

Kohn-Sham 

a  posteriori 

Kohn-Sham 

LDA-LDA/HF 

LDA-LDA/LDA-LDA 

BEC-P9 1  /HF 

BEC-P9 1/LDA-LDA 

Diamond 

3.57 

3.58 

3.55 

3.56 

3.6 

3.61 

Si 

5.43 

5.49 

5.43 

5.42 

5.48 

5.49 

Ge 

5.66 

5.81 

5.85 

5.85 

5.86 

5.87 

BN 

3.62 

3.62 

3.6 

3.6 

3.64 

3.65 

A1P 

5.45 

5.55 

5.45 

5.48 

5.53 

5.56 

GaAs 

5.65 

5.77 

5.77 

5.78 

5.78 

5.79 

MgO 

4.20 

4.21 

4.18 

4.19 

4.27 

4.27 

NaCl 

5.57 

5.79 

5.49 

5.41 

5.83 

5.78 

The  electronic  density  is  calculated  by  Hartree-Fock  method  ( a  posteriori  columns)  or  by  Kohn- 
Sham  lda,  using  exchange-correlation  dft  potential  (Kohn-Sham  columns).  The  local  (lda)  energy 
functional  and  the  gradient  corrected  Perdew  GGA  (P91)  are  utilized. 
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Table  II.  Lattice  parameters  of  crystals  (in  angstroms),  calculated  using  Hartree-Fock  “exact  exchange’ 
energy  and  correlation  energy  Density-Functional  (hf-dft  method). 


Expt. 

HF 

a  posteriori 

Kohn-Sham 

a  posteriori 

Kohn-Sham 

HF-LDA/HF 

HF-LDA/HF-LDA 

HF-P91/HF 

HF-P9 1  /HF-LDA 

Diamond 

3.57 

3.58 

3.53 

3.53 

3.51 

3.51 

Si 

5.43 

5.49 

5.42 

5.41 

5.38 

5.37 

Ge 

5.66 

5.81 

5.74 

5.74 

5.71 

5.71 

BN 

3.62 

3.62 

3.57 

3.57 

3.55 

3.55 

A1P 

5.45 

5.55 

5.47 

5.47 

5.42 

5.42 

GaAs 

5.65 

5.77 

5.69 

5.70 

5.67 

5.67 

MgO 

4.20 

4.21 

4.16 

4.16 

4.11 

4.11 

NaCl 

5.57 

5.79 

5.68 

5.67 

5.47 

5.47 

The  electronic  density  is  calculated  by  HF  method  ( a  posteriori  columns)  or  by  Kohn-Sham  lda, 
using  HF  exchange  potential  and  correlation  dft  potential  (Kohn-Sham  columns).  The  local  (lda) 
energy  functional  and  the  gradient  corrected  Perdew  gga  (P91)  are  utilized. 


behavior  of  the  two  ionic  crystals  is  due  to  the  different  importance  of  the  electron 
correlation  in  the  anions:  the  electronic  structure  of  the  Cl”  ion  is  highly  influenced 
by  the  electronic  correlation,  while  the  O2-  ion  is  acceptably  described  at  scf  level. 
All  the  dft  models  give  a  parallel  effect  on  the  lattice  parameters  of  MgO  and 
NaCl.  The  local  correlation-only  dft  (hf-lda)  decreases  the  lattice  parameters, 
giving  errors  of —0.04  and  +0. 1 1  A,  respectively.  The  gradient  corrected  correlation 
dft  (HF-P91 )  reduces  the  lattice  parameters,  giving  errors  of -0.10  and  -0.20  A. 
The  local  exchange  (lda-lda)  increases  the  lattice  parameters,  with  respect  to  HF 
exchange,  with  errors  of  -0.02  and  -0.08  A.  The  gradient  exchange  (BEC-P91 ) 
expands  the  lattices,  giving  errors  of  +0.07  and  +0.25  A.  Ionic  crystals,  for  which 
electronic  correlation  is  very  important,  confirm  being  the  most  difficult  to  treat 
by  dft  and  hf  methods.  Work  is  in  progress  to  estimate  the  effect  of  other  correlation 
techniques  (such  as  Configurations  Interaction,  Perturbation  Theory,  and  Coupled 
Cluster  Method)  on  equilibrium  parameter  of  NaCl. 

As  it  is  clear  from  the  data  in  Tables  I  and  II,  the  geometries  of  the  crystals  do 
not  depend  on  the  electronic  density:  on  the  other  hand  the  geometries  are  strongly 
dependent  on  the  method  used  to  calculate  the  energy.  This  confirms  the  results 
of  the  application  of  DFT  to  molecular  systems:  the  quality  of  the  hybrid  models 
[11]  and  Kohn-Sham  methods  [4,12]  show  that  the  hf  density,  when  used  to 
calculate  the  dft  energy,  performs  equally  or  even  better  than  the  ks  density. 

The  difference  between  hf  and  KS  density,  even  not  detectable  in  the  calculation 
of  energy  and  of  energy-related  quantities,  can  be  analyzed,  for  more  deeply  un¬ 
derstanding  the  physical  consequences  of  the  two  methods. 

Figure  1(b)  shows  the  difference  between  the  Kohn-Sham  density,  in  the  Local- 
Density  approximation,  and  the  Hartree-Fock  density,  for  silicon,  lda  gives  less 
electron  density  close  to  the  core  region  (r  <  0.5  A)  and  in  the  bond  region:  this 
is  due  to  the  less  effective  Fermi  hole  that  the  lda  method  builds  around  each 
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Figure  1 .  Comparison  between  lda  and  hf  electronic  charge  distribution  (electron  den¬ 
sity)  of  silicon.  Continuous  lines  correspond  to  positive  values,  dotted  lines  correspond  to 
negative  values,  dot-dashed  lines  correspond  to  zero  value,  (a)  Difference  between  HF 
density  the  crystal  and  overlapping  free  atom  densities.  Two  contiguous  lines  correspond 
to  a  difference  of  0.002  atomic  units  (electrons  per  cubic  bohrs),  (b)  difference  between 
lda  density  and  hf  density.  Two  contiguous  lines  correspond  to  a  difference  of 0.002  a.u.; 
(c)  difference  between  hf-lda  (Hartree-Fock  exchange  and  lda  correlation)  and  HF 
density.  Two  contiguous  lines  correspond  to  a  difference  of  0.00 1  a.u. 


electron  in  the  high-density  regions,  with  respect  to  the  HF  method.  The  electrons 
are  then  pushed  out  into  regions  away  from  the  cores,  in  opposition  to  the  Si  Si 
bonds,  where  the  density  is  lower.  Figure  2(a)  (for  MgO)  confirms  these  features, 
in  the  regions  where  electronic  density  is  higher,  that  is  the  core  of  the  Mg  cation 
and  the  valence  of  the  O2  anion,  the  lda  concentrates  fewer  electrons  on  average 
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Figure  2.  Comparison  between  lda  and  hf  electron  densities  of  magnesium  oxide.  The 
symbols  are  as  in  Figure  1.  The  02“  anion  is  in  the  center  of  the  figure,  (a)  difference 
between  lda  density  and  HF  density.  Two  contiguous  lines  correspond  to  a  difference  of 
0.001  a.u.;  (b)  difference  between  hf-lda  (Hartree-Fock  exchange  and  lda  correlation) 
and  HF  density.  Two  contiguous  lines  correspond  to  a  difference  of  0.0005  a.u. 


than  the  hf  method:  the  electrons  are  then  pushed  to  the  regions  where  the  density 
is  lower,  that  is,  the  valence  region  of  the  cation.  The  difference  between  the  hf 
density  of  silicon  and  the  overlapping  densities  of  silicon-free  atoms  is  reported  in 
Figure  1(a),  for  better  understanding  the  scale  of  the  differences  between  hf  and 
ks  densities.  The  chemical  effects  of  Si — Si  bond  formation  are  more  than  one 
order  of  magnitude  higher  than  the  differences  between  the  lda  and  the  KS  density. 

The  lda  correlation  effects  are  separately  analyzed  in  Figures  1(c)  and  2(b),  in 
which  the  HF  density  is  subtracted  from  the  density  obtained  by  an  SCF  process 
with  the  HF  “exact  exchange”  potential  and  the  lda  correlation  potential.  The 
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0.50. 


Figure  3.  Band  structure  of  silicon.  The  energy  scale  is  in  Hartree.  The  orbital  zero  energy 
is  arbitrary,  but  common  to  the  three  models,  (a)  HF  band  structure;  (b)  hf-lda  band 
structure  (HF  exchange  and  lda  correlation),  (c)  lda  band  structure  (lda  exchange- 

correlation). 


effects  of  correlation  are  one  order  of  magnitude  weaker  than  the  exchange- 
correlation  effects,  and  are  confined  in  the  cores  of  the  heavier  atoms,  like  silicon 
[Fig.  1(c)]  and  Mg  [Fig.  2(b)].  The  correlation  effects  in  the  core  of  the  oxygen 
are  not  detectable  on  the  same  scale,  confirming  that  the  O2-  anion  is  well  described 
at  the  hf  level. 

The  band  structures  of  Silicon  and  MgO  are  reported  in  Figures  3  and  4,  following 
a  path  that  connects  the  higher  symmetry  points  of  the  First  Brillouin  zone  of  the 
reciprocal  space.  Figures  3(a)  and  4(a)  contain  the  hf  band  structures:  3(b)  and 
4(b)  the  lda  bands.  The  lda  correlation  stabilizes  the  one-particle  orbitals  by  1.1 


Figure  4.  Band  structure  of  Magnesium  Oxide.  Energy  scale  is  in  Hartree.  (a)  HF  band 
structure,  (b)  hf-lda  band  structure  (HF  exchange  and  lda  correlation),  (c)  lda  band 
structure  (lda  exchange-correlation). 
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Table  III.  Formation  energies  of  crystals  (in  Kcal/mol),  calculated  using  exchange  and  correlation 

energy  Density  Functionals. 


Expt.  [37] 

HF 

a  posteriori 

Kohn-Sham 

a  posteriori 

Kohn-Sham 

LDA-LDA/HF 

LDA-LDA/LDA-LDA 

BEC-P91/HF 

BEC-P9 1  /LDA-LDA 

Diamond 

-339. 

-249. 

-409. 

-420. 

-344. 

-351. 

Si 

-216. 

-141. 

-235. 

-248. 

-195. 

-203. 

Ge 

-180. 

-105. 

-197. 

-205. 

-164. 

-171. 

BN 

-314. 

-222. 

-367. 

-380. 

-308. 

-316. 

A1P 

-192. 

-125. 

-222. 

-231. 

-186. 

-192. 

GaAs 

-155. 

-82. 

-172. 

-178. 

-140. 

-146. 

MgO 

-726. 

-673. 

-745. 

-765. 

-705. 

-716. 

NaCl 

-189. 

-182. 

-196. 

-212. 

-179. 

-185. 

The  electronic  density  is  calculated  by  HF  method  ( a  posteriori  columns)  or  by  Kohn-Sham  lda, 
using  exchange-correlation  dft  potential  (Kohn-Sham  columns).  The  local  (lda)  energy  functional  and 
the  gradient  corrected  Perdew  gga  (P9 1 )  are  utilized. 


eV  for  Si  and  by  1.5  eV  for  MgO,  with  respect  to  hf,  while  the  gaps  remain  un¬ 
changed  at  values  7.0  and  16.1  eV,  respectively.  The  lda  exchange-correlation, 
that  does  not  contain  the  Exchange  Self-Energy  singularity  [  33  ] ,  decreases  the  gaps 
to  0.5  and  3.2  eV.  The  band  structures  calculated  in  the  present  work  are  in  agree¬ 
ment  with  the  lda  band  structures  reported  in  the  literature  [34,35]. 

Tables  III  and  IV  report  lattice  formation  energies.  As  for  the  geometries  (Tables 
I  and  II)  there  are  no  great  difference  between  the  data  calculated  by  the  a  posteriori 
and  the  Kohn-Sham  method. 


Table  IV.  Formation  energies  of  crystals  (in  Kcal/mol),  calculated  using  Hartree-Fock  “exact  exchange” 
energy  and  correlation  energy  Density  Functional. 


Expt. 

HF 

a  posteriori 

Kohn-Sham 

a  posteriori 

Kohn-Sham 

HF-LDA/HF 

HF-LDA/HF-LDA 

HF-P91/HF 

HF-P9 1  /HF-LDA 

Diamond 

-339. 

-249. 

-307. 

-306. 

-348. 

-348. 

Si 

-216. 

-141. 

-190. 

-186. 

-221. 

-218. 

Ge 

-180. 

-105. 

-152. 

-151. 

-177. 

-176. 

BN 

-314. 

-222. 

-282. 

-282. 

-312. 

-312. 

A1P 

-192. 

-125. 

-175. 

-174. 

-200. 

-199. 

GaAs 

-155. 

-82. 

p 

-130. 

-150. 

-150. 

MgO 

-726. 

-673. 

-721. 

-721. 

- 727 . 

-726. 

NaCl 

-189. 

-182. 

-186. 

-186. 

-191. 

-190. 

The  electronic  density  is  calculated  by  HF  method  ( a  posteriori  columns)  or  by  Kohn-Sham  lda, 
using  hf  exchange  potential  and  correlation  dft  potential  (Kohn-Sham  columns).  The  local  energy 
(lda)  functional  and  the  gradient  corrected  Perdew  gga  (P9 1 )  are  utilized. 
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The  hf  method  underestimates  the  lattice  formation  energies,  by  about  30%, 
while  the  lda  method  overestimates  the  binding  energies  in  all  cases.  The  gradient 
corrected  dft,  both  at  the  correlation  only  and  at  the  exchange-correlation  level, 
gives  accurate  thermo-chemical  data,  compatibly  with  the  quality  of  the  basis  set, 
that  is  double  zeta  plus  polarization. 

Conclusions 

The  Kohn-Sham  Density-Functional  structural  results  (equilibrium  lattice  pa¬ 
rameters,  and  formation  energies)  confirm  the  trend  observed  for  the  corresponding 
data  calculated  by  the  hybrid  dft/hf  method.  The  method  used  to  compute  total 
energy  is  much  more  relevant  than  the  method  used  to  calculate  electronic  density. 
Differences  between  the  equilibrium  geometries  calculated  using  the  HF  or  the  KS 
electronic  densities  are  only  detectable  for  ionic  crystals  like  NaCl,  where  the  cor¬ 
relation  effects  are  very  important. 
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Abstract 

The  interaction  of  the  C02  molecule  with  a  single  palladium  atom  has  been  studied  by  means  of  a 
linear  combination  of  Gaussian  type  orbitals-density  functional  (lcgto-df)  method,  as  implemented 
in  the  code  deMon.  Local  and  nonlocal  functionals  were  used.  The  core  electrons  of  the  palladium  atom 
were  represented  by  a  model  core  potential,  which  includes  relativistic  effects.  Several  coordination 
modes  of  the  C02  moiety  to  the  Pd  atom  were  studied.  They  were  fully  optimized  by  allowing  both 
electronic  and  structural  relaxation.  The  calculations  indicate  a  dihapto  CO  (r?2-CO)  coordination  mode 
(or  mixed  Pd — C/Pd — O  bonding),  of  Q  symmetry,  as  the  one  of  lowest  energy.  There  is  a  strong  charge 
transfer  from  Pd  to  C02,  which  produces  a  large  structural  change  in  C02:  It  goes  from  a  linear  to  a 
bent  geometry.  The  calculated  frequencies  of  C02  in  Pd— C02  are  689,  1212,  and  2018  cm'1  for  the 
bending,  symmetric,  and  asymmetric  vibrational  modes,  respectively.  These  results  agree  reasonably 
with  the  observed  frequencies  of  C02  adsorbed  on  H20/Pd(l  10)  and  Pd(l  1 1)/Na  surfaces.  The  i?2-CO 
structure  is  a  likely  candidate  for  the  adsorption  of  C02  on  palladium  surfaces,  since  the  asymmetric 
mode  is  observed.  This  mode  is  dipole  allowed  in  Cs  symmetry  but  dipole  forbidden  in  C2v.  The  monohapto 
C  (V-C)  coordination  mode,  of  C2v  symmetry,  was  found  to  be  a  transition  state,  2.2  and  8.4  kcal/mol, 
for  the  local  and  nonlocal  levels,  respectively,  above  the  (t?2-CO)  minimum.  The  t?2-0— O  mode  is  not 
attainable  for  Pd — C02;  calculations  started  with  this  configuration  converge  to  the  tj2-CO  mode,  the 
ground  state.  This  is  a  quite  different  picture  to  that  found  for  Ni — C02 ,  where  the  moiety  is  chemisorbed 
in  the  dihapto  172- O — O  coordination  mode.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

The  experimental  and  theoretical  study  of  transition  metal-ligand  (tm-l)  inter¬ 
actions  has  been  of  high  interest  during  the  last  years  [1,2].  In  particular,  the  study 
of  the  adsorption  of  C02  on  TM  surfaces  is  potentially  important,  since  C02  is  a 
reaction  product  in  the  oxidation  of  CO,  the  catalytic  hydrogenation  of  C02  is 
quite  important  in  organic  chemistry,  and  C02  can  serve  as  an  unexpensive  reagent 
in  some  reduction  processes  [3].  Although  there  are  many  experimental  studies  of 
the  adsorption  of  the  carbon  monoxide,  CO,  on  metal  surfaces  [4,5,6]  the  adsorption 
of  C02  has  received  less  attention.  Recently,  experimental  and  theoretical  chemists 
have  been  interested  in  the  study  of  the  bonding  of  C02  to  tm  clusters  and  surfaces. 
They  have  tried  to  determine  its  coordination  behavior,  so  as  to  be  able  to  consider 
the  bonding  and  the  reaction  pathways  [7]  (ground  and  transition  states). 
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Even  though  that  CO2  is  highly  unreactive,  it  is  immediately  activated  in  the 
presence  of  a  tm  surface.  There  have  been  proposed  two  ways  for  the  fixation  of 
C02  by  a  metal  surface:  physisorbed  linear  C02  species  and  chemisorbed  (negatively 
charged)  bent  CO25  species.  The  latter  form  was  predicted  theoretically  [8],  and  it 
has  been  observed  on  clean  surfaces  of  Ni(l  10)  [9,10]  as  well  as  on  surfaces  which 
have  been  modified  via  alkali  adsorption  [1 1,12,13]. 

There  is  IR  spectroscopic  evidence  which  supports  the  existence  of  several 
M — C02  complexes.  For  instance,  1  -to- 1  M — CO2  complexes  have  been  made  with 
the  following  metals  [14]:  Ni,  Li,  Na,  K,  and  Cs.  Also,  some  research  has  been  done 
for  Ti,  V,  Fe,  Co,  Ni,  and  Cu  [15, 16].  In  the  case  of  Pd,  some  complexes  have  been 
identified  in  which  the  Pd  atom  is  coordinated  to  C02  and  to  other  ligands  [17]. 
But  the  1-to-l  Pd — C02  complexe  has  not  been  reported.  Recently,  Brosseau  et  al. 
[18],  by  means  of  HREELS  spectroscopy,  have  shown  conclusively  that  the  C02 
molecule  is  chemisorbed  (in  the  presence  of  H20)  onto  the  Pd(l  10)  surface  as  a 
negative  moiety  CO25.  A  bent  C2v  geometry  was  suggested  for  the  chemisorbed 
CO26,  the  bending  and  the  stretching  vibrational  modes  were  assigned  at  770  and 
1200  cm-1,  respectively.  Later  on,  the  same  authors  [19]  redid  the  experiments  and 
found  another  peak,  at  1631  cm-1;  the  previous  observations  were  moved,  from 
770  and  1200  cm-1,  to  786  and  1 199  cm-1.  The  bending  mode  at  786  cm-J  and 
the  symmetric  stretch  at  1199  cm-1  are  allowed  for  the  Cs  and  the  C2V  symmetry, 
while  the  asymmetric  mode  at  1631  cm-1  is  dipole  forbidden  for  C2v.  This  suggests 
Cs  symmetry  for  C026  chemisorbed  on  Pd(l  10)/H2O. 

Clearly,  theoretical  calculations  would  be  of  interest  to  confirm  or  deny,  and 
further  describe  the  mode  of  bonding  of  CO2  to  Pd,  and  to  other  TM  systems  [1]. 
There  is  a  theoretical  study  of  C02  on  Ni  surfaces  which  addresses  the  bonding  of 
the  moiety  [8],  but  there  have,  to  our  knowledge,  been  no  calculations  for  CO2 
bonded  to  Pd  atoms  or  surfaces.  In  this  contribution  we  present  a  theoretical  analysis 
for  pd — C02,  which  has  been  performed  by  means  of  a  density  functional  method. 
Our  goal  is  a  determination  of  the  preferred  coordination  modes,  as  well  as  the 
estimation  of  chemisorption  energies,  bond  lengths,  and  vibrational  frequencies. 
Although  Pd — C02  is  a  minimal  model,  it  will  be  shown  that  some  of  our  theoretical 
estimates  correlate  with  the  experimental  results  obtained  for  the  adsorption  of 
C02  on  infinite  palladium  surfaces.  Our  results  provide  insight  into  the  Pd — C02 
bonding  and  provide  support  to  some  previous  assignments  of  the  vibrational  modes, 
which  have  been  made  on  experimental  grounds.  As  we  shall  see,  a  quite  different 
picture  is  emerged  from  our  calculations  for  Pd — C02  vs.  Ni — C02 . 

Computational  Details 

We  have  used  the  code  deMon  [20,21,22],  which  implements  a  linear  combination 
of  Gaussian-type  orbitals-density  functional  method.  Calculations  were  done  at  the 
local  spin  density  approximation  (lsda)  level,  using  the  Vosko,  Wilk,  and  Nusair 
(VWN)  [23]  functional.  Nonlocal  density  gradient-type  corrections  were  included 
self-consistently,  using  the  functional  of  Perdew  and  Wang  (1986)  for  exchange 
[24],  and  Perdew  (1986)  for  correlation  [25,26].  These  calculations  will  be  labeled 
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simply  as  nl  (for  nonlocal).  The  (5211/41 1/1)  orbital  basis  sets  for  carbon  and 
oxygen,  optimized  for  dft  calculations  [28]  were  used.  These  bases  are  of  6-3 1G** 
quality.  The  core  electrons  of  the  Pd  atom  were  approximated  by  a  model  core 
potential  (MCP).  The  scalar  relativistic  effects  are  incorporated  into  the  mcp  [29]. 
The  4p64d95s]  electrons  were  included  explicitly  in  the  calculation,  for  which  a 
palladium(+16)(221 1/21 1 1/21)  orbital  basis  set  was  constructed.  The  auxiliary  basis 
sets,  used  in  the  fitting  of  the  charge  density  (CD)  and  the  exchange-correlation  (xc) 
potential  have  the  pattern  (5,2;5,2)  for  both  carbon  and  oxygen,  and  palla¬ 
dium^  16)(3,4;3, 4)  for  the  Pd  atom.  In  this  notation  (ku  k2 ;  h ,  &),  kx(h)  is  the 
number  of  s-type  gaussians  in  the  CD  (XC)  basis  and  k2(h)  is  the  number  of  s-, 
p-,  and  d-type  guassians  constrained  to  have  the  same  exponent  in  the  CD  (xc) 
basis.  The  charge  density  was  fitted  analytically,  while  the  xc  potential  was  fitted 
numerically  on  a  fine  grid  [27]  comprised  of  32  radial  shells  and  26  angular  points 
per  shell,  giving  rise  to  a  total  of  832  points  per  atom  [30]. 1  At  the  end  of  each  scf 
procedure,  the  exchange-correlation  contribution  to  the  energy  gradients  were  cal¬ 
culated  by  numerical  integration  on  an  augmented  set  of  grid  points  consisting  of 
the  same  32  radial  shells  with  either  50,  1 10,  or  194  angular  grid  points  [27]. 

Without  imposed  symmetry  constraints,  and  by  means  of  the  Broyden-Fletcher- 
Goldfarb-Shanno  (bfgs)  algorithm  [31],  the  geometries  were  optimized  by  mini¬ 
mizing  the  norm  of  the  gradient  with  a  threshold  of  10"6  a.u.  The  bfgs  algorithm 
searches  the  PES  through  its  first  derivatives,  evaluated  analytically,  and  through 
approximate  second  derivatives  (the  Hessian  matrix)  updated  numerically.  All  Re' s 
and  bond  angles  were  simultaneously  refined.  A  vibrational  analysis  was  done  for 
the  lowest  energy  structures.  Here,  the  harmonic  approximation  was  used  with  two- 
point  numerical  differentiation  of  gradients  [27].  Additionally,  the  infrared  inten¬ 
sities  were  calculated  by  two-point  differencing  of  the  dipole  moments.  In  our  model 
core  potential  spin-polarized  calculations,  the  standard  Mulliken  population  analyses 
were  done  to  obtain  the  atomic  charges. 

Results:  Coordination  Modes  of  Pd — C02 

We  have  done  an  exhaustive  search  for  the  possible  coordination  modes  of  C02 
to  a  single  palladium  atom.  In  Figure  1  are  shown  the  four  possibilities.  They  are: 
(a)  carbon-oxygen  (r/2-C O),  with  Cs  symmetry,  (b)  carbon  (^-C),  with  C2v  symmetry, 
(c)  oxygen  and  (d)  oxygen-oxygen  (?/2-00),  with  C2v  symmetry.  The  first 

and  the  last  are  dihapto  modes,  that  is,  the  metal  atom  is  bonded  to  two  ligand 
atoms,  whereas  b)  and  c)  are  monohapto  coordination  modes.  To  determine  which 
one  is  the  most  stable,  we  have  done  a  full  optimization  for  each  of  the  corresponding 
geometries.  Afterwards,  a  vibrational  analysis  was  made  for  the  converged  structures. 
The  results  are  shown  in  Table  I.  We  discuss  the  main  findings  below. 

The  tj2-CO  Coordination  Mode 

The  dihapto  t;2-CO  coordination  mode  was  found  as  the  one  of  lowest  energy. 
Here,  there  is  a  strong  charge  transfer  from  Pd  to  C02  (about  0.7  electrons;  see 

1  The  radial  grid  used  in  deMon  was  inspired  by  [30]  and  differing  only  in  the  fact  that  it  uses  the 
Gaussian-Legendre  quadrature  scheme  instead  of  Gauss-Chebyshev. 
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Figure  1 .  The  four  possible  coordination  modes  of  C02  to  Pd:  (a)  the  t?2-CO  mode,  (b) 
the  ^-C  mode,  (c)  the  v ‘-O,  and  (d)  the  t?2-0— O  mode.  The  input  and  the  converged 

structures  are  indicated. 

Table  II),  which  produces  a  large  structural  change  in  C02:  It  goes  from  a  linear  to 
a  bent  geometry  (our  vibrational  analysis,  see  below,  is  consistent  with  this  feature). 
In  this  772-CO  mode  (or  mixed  Pd— C/Pd— O  bond),  of  Q  symmetry,  there  is  an 
elongation  (with  respect  to  free  C02)  of  the  C=0  bond  involving  the  oxygen  atom 
nearest  to  the  metal  center  (see  Fig.  1).  The  two  C=0  bond  lengths,  in  Pd — C02, 
are  no  longer  equal;  they  are  1.20  and  1.24  A.  The  Mulliken  population  analysis 
(see  Table  2)  shows  that  the  metal  atom  is  positive  (+0.7),  the  carbon  atom  is 
positive  (+0.1),  the  oxygen  atom  nearest  to  the  Pd  center  is  negative  (-0.5),  and 
the  other  oxygen  atom  is  also  negative,  but  to  a  lesser  extent  (-0.3  electrons). 
Hence,  there  are  two  types  of  electrostatic  interactions  in  the  i?2-CO  mode,  repulsive 
between  the  metal  center  and  the  carbon  atom,  and  attractive  between  the  oxygen 
and  the  palladium  atoms,  the  latter  contributing  to  the  total  energy  lowering  of  the 
rj2-CO  mode.  These  two  types  of  electrostatic  interaction  are  shown  in  Figure  2. 

The  vibrational  analysis  for  the  t?2-CO  geometry  yields  no  imaginary  frequency, 
(see  Table  I)  showing  that  this  structure  is  a  local  minimum.  The  calculated  fre¬ 
quencies  associated  with  metal-carbon  and  metal-oxygen  modes  are  203,  373,  and 
561  cm-1.  In  particular,  373  cm-1  corresponds  to  the  symmetric  elongation  of  the 
Pd — C  and  Pd — O  bonds.  This  mode  favors  the  occurrence  of  a  mixed  carbon- 
oxygen  bond  in  Pd— C02.  The  bending,  symmetric,  and  asymmetric  vibrational 
modes  of  C02  in  Pd — C02  have  calculated  frequencies  of  689,  1212,  and  2018 
cm-1,  respectively. 

The  experimental  frequencies  of  C02  on  a  H2O/Pd(110)  surface  [19]  are  786, 
1 199,  and  163 1  cm"1,  respectively.  This  suggests  a  Cs  symmetry  because  the  asym- 
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Table  I.  Bond  lengths  Rei  bond  angles,  relative  energies,  A E,  and  vibrational  frequencies,  for  the 

C02  to  Pd  coordination  modes. 


v2  (Pd  — CO) 

V  (Pd-C) 

V  (Pd— O) 

Re( A) 

Pd— C 

1.99 

1.96 

c— O, 

1.24 

1.22 

1.18 

c— 02 

1.20 

1.22 

1.17 

Pd  — O, 

2.07 

Bond  angle 

O,—  C  —  Pd 

79.7 

104.6 

02— C— Pd 

129.9 

104.6 

o,— c— o2 

150.4 

150.8 

179.0 

Pd  — o,  — C 

146.0 

A E  (kcal/mol) 

0.0  (0.0)a 

+2.2  (+8.4)a 

+  13.8 

*/(cm_1)d 

Exp.e 

203 

-112 

97 

373 

311 

333 

561 

586 

538 

6895b 

7075b 

543 

786 

1212^ 

1245^ 

1321 

1199 

2012*4 

1974*4 

2398 

1631 

a  Nonlocal  results. 

b  Bending, c  symmetric,  and  d  asymmetric  stretch  of  C02. 
e  C02  on  Pd(l  10)/H2O,  from  Ref.  [19]. 

metric  mode  (1631  cm"1)  is  dipole-forbidden  in  C2v  symmetry  but  dipole-allowed 
in  Cs.  Two  of  the  observed  bands  agree  quite  well  with  our  calculations  for  the 
minimal  Pd— C02  model,  but  there  is  a  large  discrepancy  for  the  asymmetric  mode 
(calculated:  2018  cm-1,  experimental:  1631  cm"1).  However,  this  mode  is  quite 
sensitive  to  the  details  of  the  surface.  For  example,  Wohlrab  et  al.  [1 1]  have  deter¬ 
mined  the  frequencies  of  C02  on  a  Pd(l  1 1)/Na  surface;  the  observed  values,  744, 


Table  II.  Electron  populations  on  each  atom  (a  16-valence  electron  mcp  is  used). 


PdC02 
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Pd, 


C 
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Figure  2.  The  attractive  and  repulsive  electrostatic  interactions  for  the  r)2-C O  and  the 
7)i -C  coordination  modes. 


1210,  and  1530  cm-1,  show  a  close  correspondence  to  those  of  Brosseau  et  al.  and 
also  indicate  Cs  symmetry  as  the  likely  coordination  mode  of  Pd — C02.  However, 
the  asymmetric  mode  is  the  most  sensitive,  showing  a  large  difference  (about  100 
cm-1)  between  the  two  surfaces.  We  have  also  found  such  a  sensitivity  in  calculations 
involving  larger  clusters  (to  be  published  elsewhere).  The  asymmetric  mode  is  shifted 
to  lower  values  as  the  size  of  the  cluster  is  increased.  For  example,  in  Pd4 — C02 
that  frequency  is  equal  to  1843  cm-1,  which  is  closer  to  the  experimental  findings 
than  the  calculated  value,  2018  cm-1,  for  Pd — C02.  Calculations  for  an  infinite 
C02/Pd(l  10)  surface,  using  periodic  boundary  conditions  [32]  in  the  code  deMon, 
are  in  progress  in  our  laboratory;  this  will  complement  our  studies  using  small  Pd„ 
clusters. 

The  calculated  chemisorption  energy  of  C02  is  1.10  eV  for  the  lsda,  and  0.88 
eV  at  the  nl  level.  As  is  well  known,  the  lsda  substantially  overestimates  the 
binding  energies  in  tm-L  systems  [33],  the  introduction  of  NL  corrections  improves 
dramatically  the  calculations  of  the  binding  properties.  However,  an  accurate  es¬ 
timation  of  the  binding  energy  of  C02  on  palladium  surfaces  would  require  (in 
addition  to  nl  corrections)  the  use  of  larger  Pd„  clusters  or  an  infinite  Pd  surface. 
Both  approaches  are  in  progress  in  our  laboratory.  (For  a  Pd4 — C02  system,  the 
chemisorption  energies  are  2.67  eV  for  lsda  and  2.36  eV  at  the  NL  level). 

The  7]J-C  Coordination  Mode 

The  7}l-C  mode  (monocoordinated  C02  through  the  carbon  atom)  was  located 
2.2  kcal/mol,  for  the  lsda,  and  8.4  kcal/mol,  at  the  nl  level,  above  the  rj2-C O 
minimum.  (The  NL  approach,  a  higher  level  of  theory,  enhances  the  stability  of  the 
r)2-CO  structure).  Since  there  is  C2v  symmetry  in  the  tj1-C  mode,  the  two  C — O 
bonds  have  the  same  bond  length,  equal  to  1 .22  A;  it  is  interesting  to  observe  that 
this  value  is  the  average  of  the  two  unequal  C — O  bond  lengths  in  the  i?2-CO  structure 
(1.20  and  1.24  A).  Here,  there  is  also  a  charge  transfer  from  Pd  to  C02,  about  0.5 
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electrons  (see  Table  II).  This  charge  transfer  is  smaller  than  that  found  in  rf-CO. 
From  the  total  charge  distribution  it  is  observed  that  the  Pd  and  C  centers  are 
positive,  whereas  the  oxygen  atom  is  negative  (see  Table  II).  Hence,  there  is  a 
repulsive  electrostatic  interaction  between  the  metal  center  and  the  carbon  atom 
which  contributes  to  the  reduced  stability. 

Our  vibrational  analysis  shows  that  the  V-C  structure  is  not  a  minimum  but 
rather  a  transition  state,  since  there  is  an  imaginary  frequency  (associated  with  the 
Pd — C  and  Pd — O  vibrational  modes;  see  Table  1).  In  this  structure,  values  of  707, 
1245,  and  1974  cm-1  are  assigned  to  the  bending,  symmetric,  and  asymmetric 
stretching  vibrations  of  C02  in  Pd — C02.  There  is  an  increase  in  the  frequencies 
for  the  bending  and  symmetric  modes,  and  a  decrease  in  the  asymmetric  mode 
with  respect  to  our  calculated  values  for  the  rj2-CO  structure. 

The  rf-O  Mode 

The  oxygen,  V-O,  coordination  mode  was  located  much  higher  in  energy,  13.8 
kcal/mol  for  the  lsda.  It  is  interesting  to  observe  that,  in  this  structure,  the  C02 
molecule  remains  almost  linear,  slightly  perturbed  by  the  Pd  atom  and  there  is  only 
a  small  charge  transfer  from  the  metal  center  to  the  C02  moiety,  about  0.3  electrons. 
The  bonds  lengths  are  almost  the  same  as  in  the  free  linear  C02  molecule:  1.18 
and  1.17  A. 

Our  vibrational  analysis  for  the  rjl-0  mode  is  consistent  with  the  occurrence  of 
a  linear  C02  molecule  in  this  coordination  mode  (though  there  should  be  some 
differences  due  to  the  elongation  of  one  C  =  0  bond).  The  experimental  frequencies 
for  free  C02  are:  667,  1340,  and  2349  cm-1;  our  calculated  frequencies  are:  543, 
1321  and  2398  cm-1  for  C02  in  Pd — C02  in  this  coordination  mode.  Note  that 
the  1321  and  2398  cm-1  theoretical  estimates  are  quite  close  to  their  experimental 
counterparts,  1 340  and  2349  cm"1  for  the  bare  C02  molecule  but  the  bending  mode 
is  put  about  100  cm"1  below  the  bare  C02  value. 

The  r]2-0 — O  Mode 

It  was  not  possible  to  converge  the  t;2-0 — O  coordination  mode  for  Pd — C02, 
where  the  moiety  should  be  coordinated  by  the  two  oxygen  centers.  The  two  oxygen 
atoms  in  r\2-0 — O  starting  geometries  always  moved  away  from  the  Pd  center,  the 
result  converging  to  the  ??2( C — O)  mode,  which  is  the  one  of  lowest  energy,  as 
shown  above.  This  implies  that  in  the  r)2(0 — O)  structure  there  is  a  repulsive  in¬ 
teraction  between  the  Pd  and  the  two  oxygen  atoms.  We  have  examined  this  mode 
of  binding,  more  closely,  through  the  following  theoretical  experiment.  Since  the 
formate  ion  HCOO"5  and  C026  have  similar  effects  and  are  often  compared  [9], 
we  have  modeled  the  transition  of  the  formate  ion  into  the  C02  6  moiety  by  gradually 
taking  away  the  hydrogen  atom.  We  have  applied  this  process  to  both  NiHC02  and 
PdHC02 .  For  NiHC02 ,  our  calculations  agree  with  experimental  results  for  bulk 
surfaces  in  that  the  two  oxygen  atoms  are  down  [8]  (see  Fig.  3);  that  is,  in  NiC02 
the  preferred  coordination  mode  is  rj2( O — O)  [8].  In  contrast,  we  have  found  that 
while  pulling  away  the  hydrogen  atom  of  the  formate  moiety,  the  oxygen  atoms 
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Ni 

a  b  a  b 

Figure  3.  Evolution,  through  a  geometry  optimization,  of  the  rf-O — O  coordination 
mode  for  NiHC02  and  PdHC02.  The  input  and  the  converged  structures  are  indicated. 


are  displaced  upward  in  the  case  of  palladium  (see  Fig.  3).  This  leads  us  to  exclude 
the  possibility  of  adsorption  of  C02  by  the  coordination  of  the  two  oxygens,  the 
n2-0 — O  mode,  for  Pd.  The  t/2(0 — O)  structure  is  still  a  possible  mode  of  coordi¬ 
nation  of  C02  to  an  Ni(l  10)  surface,  since  it  is  close  in  energy  to  the  ground  state 
r}2(C — O)  mode. 


Conclusions 

The  preferred  coordination  modes  of  the  C02  molecule  to  a  single  palladium 
atom  have  been  determined  by  means  of  the  Gaussian  dft  code  deMon.  The  lowest 
energy  coordination  mode  is  ??2-CO ,  where  there  is  a  mixed  Pd — C/Pd — O  bonding. 
A  strong  charge  transfer  from  the  metal  to  the  C02  moiety  was  found,  which  pro¬ 
duces  a  strong  structural  change  in  C02:  It  goes  from  a  linear  to  a  bent  geometry. 
This  picture  and  the  results  of  our  vibrational  analysis  for  the  ??2-CO  mode  are  in 
agreement  with  the  experimental  results  for  the  adsorption  of  C02  on  the  H20/ 
Pd(l  10)  and  Pd(l  10)/Na  surfaces.  Theory  and  experiment  agree  in  that  the  ?y2-CO 
mode,  of  Cs  symmetry,  is  the  most  likely  for  the  chemisorption  of  C02  on  palladium 
surfaces,  since  the  asymmetric  mode  (observed  at  1530  or  1631  cm-1,  calculated 
at  2018  cm-1)  is  dipole-allowed  for  Cs  symmetry  and  dipole-forbidden  for  C2v.  In 
fact,  the  rjl-C  mode,  of  C2v  symmetry,  was  found  to  be  a  transition  state,  2.2  kcal/ 
mol  (lsda)  or  8.4  kcal/mol  (nl),  above  the  minimum.  The  asymmetric  mode  for 
chemisorbed  C02  is  quite  sensitive  to  the  details  of  the  surface.  [In  further  calcu¬ 
lations,  to  be  published  elsewhere,  this  is  directly  related  to  the  size  of  the  cluster. 
Using  a  Pd4 — C02  system,  we  have  calculated  1843  cm-1  for  the  asymmetric  mode, 
which  is  closer  to  the  experimental  observations.  Calculations  for  bigger  Pd„ — C02 
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systems  and  for  an  infinite  C02/Pd(l  10)  surface  are  in  progress  in  our  laboratory. 
These  will  give  more  insight  and  will  complement  the  present  study  of  the  minimal 
Pd— C02  model.]  The  ??2-0— O  mode  is  not  attainable  in  Pd— C02,  it  converges 
to  the  i?2-CO  mode,  the  ground  state.  This  is  a  quite  different  picture  to  that  found 
for  Ni — C02,  where  the  dihapto  r]2-0 — O  coordination  mode  is  nearly  degenerate 
with  the  j}2-CO  structure. 
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Abstract 

Structures  and  energies  have  been  calculated  for  HO,  H20,  02,  H02,  H202,  and  03  molecules  using 
the  nonlocal  functionals  pw86,  PW91,  B-P86,  and  B-lyp  with  the  goal  of  obtaining  their  atomization 
energies.  Results  were  compared  with  those  from  highly  correlated  methods  and  experiment.  It  was 
found  that  all  nonlocal  functionals  perform  similarly  to  or  better  than  correlated  methods  mp4  and  QCI 
(using  relatively  equivalent  basis  sets).  All  nonlocal  energies  were  self-consistently  calculated  using  the 
optimized  geometries  for  each  functional.  None  of  the  results  contain  any  empirical  correction  except 
those  inherent  to  some  of  the  functionals.  Increasing  the  size  of  the  basis  set  when  using  the  nonlocal 
functionals  does  not  lead  to  any  significant  improvement  of  the  energies  and  surprisingly  it  worsens  the 
results  for  one  of  the  functionals.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

We  are  interested  in  improving  the  accuracy  of  energies  calculated  using  nonlocal 
functionals.  The  main  goal  is  to  determine  what  corrections,  if  any,  are  needed  to 
compensate  for  the  errors  due  to  the  approximate  nature  of  the  exchange-correlation 
functionals,  and  for  the  errors  due  to  the  finite  character  of  the  basis  sets  used. 
Previous  works  on  this  matter  [1,2]  have  shown  that  corrections  were  possible  in 
some  cases,  especially  when  using  local  functionals.  In  the  present  work,  we  have 
focused  specifically  upon  the  smallest  neutral  molecular  systems  containing  only 
hydrogen  and  oxygen  atoms.  We  have  computed  geometries  and  atomization  ener¬ 
gies  using  state  of  the  art  nonlocal  functionals  and  compared  them  with  sophisticated 
ab  initio  results  and  with  precise  experimental  data.  The  evaluation  of  the  atom¬ 
ization  energies  is  one  of  the  most  challenging  tasks  for  ab  initio  methods.  Modem 
experimental  techniques  are  able  to  yield  atomization  energies  within  1  kcal/mol 
of  error  (chemical  accuracy). 

In  an  effort  to  develop  methods  able  to  yield  atomization  energies  of  chemical 
accuracy,  many  schemes  have  been  proposed  to  correct  ab  initio  energies,  including 
those  that  were  obtained  using  highly  correlated  methods.  Two  of  these  successful 
schemes  are  the  Gaussian  1  (G1 )  [3  ]  and  the  Gaussian  2  (G2)  [4]  procedures.  The 
G1  procedure  adds  to  the  molecular  atomization  energy  a  correction  of —3.85  kcal/ 
mol  times  the  difference  of  the  number  of  paired  electron  shells  between  the  molecule 
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and  the  constituent  atoms,  and  an  additional  0. 12  kcal/mol  per  each  singly  occupied- 
shell  difference.  Given  the  importance  of  the  above  two  differences,  we  define,  for 
any  molecule:  the  Double  Occupied  Difference  (dod)  as  the  difference  of  the  num¬ 
ber  of  doubly  occupied  MO’s  (Molecular  Orbitals)  in  the  molecule  minus  the  number 
of  doubly  occupied  AO’s  (Atomic  Orbitals)  in  the  constituent  atoms,  and  the  Single 
Occupied  Difference  (sod)  as  the  difference  between  the  number  of  singly  occupied 
AO’s  in  the  constituent  atoms  minus  the  number  of  singly  occupied  MO’s  in  the 
molecule.  For  a  small  molecule  like  water,  the  corrections  due  to  the  DOD  and  SOD 
yield  -7.2  kcal /mol,  which  is  many  times  bigger  than  the  chemical  accuracy.  The 
errors  in  the  atomization  energies  are  mostly  due  to  the  bonds  of  the  molecular 
systems.  The  bonds  have  a  complicated  character  that  always  require  the  use  of 
basis  sets  with  atomic  functions  of  high  angular  momentum  number.  Normally, 
high  angular  momentum  numbers  cannot  be  used  in  atomic  basis  sets  because  of 
technical  limitations  in  computational  resources. 


Methodology 

Density  functional  theory  (dft  ),  as  it  has  been  applied  in  this  work,  is  based  on 
the  Hohenberg-Kohn  theorem  [5]  and  the  Kohn-Sham  [6]  procedure,  dft  is 
strictly  speaking  a  first  principles  formalism,  and  it  can  be  developed  from  the 
nonrelativistic  Schrodinger  equation: 

77*  =  EV  ( 1 ) 

where  *  is  the  electronic  wave  function  (i.e.,  within  the  Born-Oppenheimer  ap¬ 
proximation  )  and  H  is  the  Hamiltonian  operator: 

H=vext+f+Vee  (2) 

where  uext  is  the  external  potential  due  to  the  nuclei,  T  is  the  kinetic  energy  operator, 
and  Vee  is  the  electron-electron  energy  operator.  Because  of  the  Hohenberg-Kohn 
theorem,  the  energy  of  the  system  can  be  written  as  a  functional  of  the  electron 
density  p  which  can  be  defined  as 

p(r)  =  JV<*|5(?-  ?t)\¥)  (3) 

where  5  is  the  Dirac  delta  function  and  N  is  the  total  number  of  electrons.  Because 
of  the  antisymmetric  character  of  the  wave  function,  r,  can  be  any  of  the  N  spatial 
variables  of  *.  To  express  the  total  energy  as  a  functional  of  p,  the  electron-electron 
operator  in  Eq.  (2)  is  multiplied  by  a  parameter  A  (0  <  A  <  l);asa  result,  a  new 
Hamiltonian  [  7  ]  is  implemented: 

Hx  =  f  +0X+  \Vee  (4) 

whose  eigenfunction  and  eigenvalue  are  *x  and  Ex,  respectively,  and  they  are  related 
by 


77x*x  =  £x*x  • 


(5) 
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in  Eq.  (4)  is  chosen  such  that  the  density  for  any  X  is  always  constraint  to  be  the 
same  as  the  density  for  X  =  1  [8,9],  Therefore,  for  X  =  0  we  have  a  system  of 
noninteracting  electrons  with  density  identical  to  the  density  of  the  real  system. 
The  total  energy  for  this  new  Hamiltonian  is 

Ex  =  <'J'x|7ix|tfx>  =  J  «x(7)p(7)rf?  +  (’J'xIT'  +  •  (6) 

By  taking  partial  derivative  with  respect  to  X,  invoking  the  Hellmann-Feynman 
theorem  and  integrating  between  0  and  1,  we  obtain 

E(p)  =  J  i'ext(7)p(7)  dr  +  Ts  +  <¥x|  K«|¥x>  d\  (7) 

where  Ts  is  the  kinetic  energy  of  the  noninteractive  system  of  electrons  (X  =  0). 
From  the  above  expression  the  exchange-correlation  functional,  Exc,  is  defined 

<*x|FJ*x>  d\  =  1  J  J  dr,  d72  +  Exc(p)  (8) 

where  the  double  integral  is  the  classical  electron-electron  energy  interaction  Fclass. 
Finally,  the  total  energy  of  the  interactive  or  real  system  ( X  =  1 )  is  exactly 

E[p]  =  J  vext(r)p(r)  dr  +  7^[p]  +  Eciass[p]  +  Exc[p]  (9) 

where  the  Exc[p]  will  have  to  be  approximated  because  it  can  only  be  evaluated  in 
special  cases  like  the  uniform  electron  gas  (jellium).  Once  an  exchange-correlation 
functional  has  been  chosen,  the  variational  principle  [1]  of  dft  is  invoked;  Con¬ 
sequently,  the  minimization  of  the  total  energy  functional,  Eq.  (9),  with  respect  to 
the  density  is  straightforward.  In  this  work  we  are  using  a  group  of  nonlocal  func¬ 
tionals  referred  to  as  generalized  gradient  approximations  (gga). 

We  have  used  the  programs  deMon  [10,11]  and  Gaussian  92 /dft  [12].  The 
calculations  with  the  functionals  Perdew-Wang  86  (pw86)  [13,14]  and  Perdew- 
Wang  9 1  ( PW  9 1 )  [  1 5- 1 7  ]  were  done  using  the  program  deMon,  and  the  calculations 
with  the  combination  of  functionals  Becke  exchange  [18]  and  correlation  pw86 
(B-PW86)  as  well  as  the  combination  Becke  exchange  and  Lee-Yang-Parr  corre¬ 
lation  (B-lyp)  [19]  were  done  using  the  program  Gaussian  92 /dft.  Two  basis 
sets  were  used  [20]  with  the  program  deMon:  Double-  f  valence  plus  polarization 
on  all  atoms  (dzvp2)  and  its  extension  to  a  triple- f  (tzvp2).  Two  basis  sets  were 
used  [12]  with  the  program  Gaussian  92 /dft:  the  standard  6-3 1G**  and  the 
largest  built-in  6-31 1++G(3df,3pd).  The  dzvp2  and  the  6-31G**  are  of  equal 
size  and  almost  equivalent,  and  they  will  also  be  designated  as  (D).  The  tzvp2 
and  the  full  6-31 1++G(3df,3pd)  will  be  designated  as  (T)  and  ( F ),  respectively. 
DFT  results  were  compared  with  those  obtained  using  the  ab  initio  method  Moller- 
Plesset  (mp4)  that  includes  single,  double,  triple,  and  quadruple  substitutions;  and 
with  those  obtained  using  the  quadratic  configuration  interaction  (QCI)  method 
that  includes  single,  double,  and  perturbative  triple  substitutions.  The  mp4  calcu¬ 
lations  were  done  using  the  6-3 1 1G*  *,  6-3 1 1 +G*  *,  and  6-3 1  lG(2df,p)  basis  sets. 
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The  QCI  calculations  were  performed  using  the  6-31 1G**  basis  set.  All  the  above 
ab  initio  procedures  are  part  of  the  Gaussian  1  (G1 )  procedure  [3]. 


Results 

Table  I  shows  the  atomic  energies  of  hydrogen  and  oxygen  atoms  together  with 
their  errors  with  respect  to  exact  values.  Both  pw  functionals  give  results  for  hydrogen 
in  excellent  agreement  with  the  exact  values,  with  the  old  pw  86  results  being  slightly 
better.  The  results  from  the  combination  B-P86  and  B-lyp  improve  when  a  full 
basis  set  is  used.  For  oxygen,  the  old  pw  86  does  not  perform  so  accurately;  however, 
the  new  PW  9 1  gives  a  large  improvement  in  the  total  energy.  The  combinations  B- 
P86  and  B-lyp  perform  similarly  for  oxygen  but  their  errors  increase  in  the  opposite 
direction  when  using  a  larger  basis  set.  This  suggests  that  with  some  intermediate 
size  basis  set,  the  error  for  the  oxygen  atom  can  be  made  zero;  however,  this  is  not 
necessarily  going  to  improve  the  study  of  the  chemistry  of  oxygen-containing  mol¬ 
ecules,  yet  it  is  going  to  make  the  selection  of  basis  sets  for  DFT  difficult.  In  this 
respect,  it  can  also  be  observed  in  Table  I  that  some  of  the  dft  energies  for  hydrogen 
are  lower  than  the  exact  energy  of  —0.5  hartrees.  This  pseudo  violation  of  the 
variational  principle  is  because  all  functionals  are  approximations;  therefore,  the 
calculated  energies  are  not  upper  bounds  to  the  exact  energies.  Another  related 
problem  that  can  also  be  observed  for  the  hydrogen  atom,  and  it  will  also  appear 
on  some  molecules  containing  hydrogen  atoms  is  that  the  energies  calculated  with 
the  tzvp2  are  bigger  than  the  energies  obtained  with  the  smaller  dzvp2  basis  set. 
The  reason  for  this  anomalous  behavior  is  the  auxiliary  hydrogen  basis  set  that  is 


Table  I.  Calculated  energies  of  H  and  O  atoms  and  errors  with  respect  to  the  exact  values. 


Hydrogen  H  error  Oxygen  O  error 

(hartrees)  (kcal/mol)  (hartrees)  (kcal/mol) 


pw86/dzvpp 

pw86/tzvpp 

PW91/DZVPP 

PW91/TZVPP 

B-P8  6/6-3 1G** 
B-lyp/6-31G** 

B-P86/6-31 1++G(3df,3pd) 
B-lyp/6-3  1 1  ++G(3df,3pd) 
mp4/6-31  1G** 

MP4/6-3 1 1 +G** 

MP4/6-31  lG(2df,p) 
qcisd(T)/6-31  1G** 

Exact 


— 0.500043 

-0.03 

-0.50001 

-0.01 

-0.49927 

0.46 

-0.49927 

0.46 

-0.49810 

1.19 

-0.49545 

2.86 

-0.50014 

-0.09 

-0.49772 

1.43 

-0.4998  lb 

0.12 

-0.4998  lb 

0.12 

-0.4998  lb 

0.12 

-0.4998  lb 

0.12 

-0.5 

— 75. 14784a 

-53.43 

— 75. 1 5608 

-58.60 

— 75.05879 

2.45 

-75.05980 

1.82 

-75.05028 

7.80 

-75.04696 

9.88 

-75.08096 

-11.46 

-75.08025 

-11.01 

— 74.93333b 

81.18 

— 74.93724b 

78.73 

— 74.96478b 

61.45 

— 74.93402b 

80.75 

— 74.9820c 

a  Reference  [2]. 
b  Reference  [3]. 
c  Estimated  in  Ref.  [3]. 
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unsuitable  to  be  used  in  conjunction  with  the  tzvp2  orbital  basis  set  and  the  pw 
functionals.  The  hydrogen  energy  obtained  from  the  ab  initio  methods  is  the  same 
as  the  one  obtained  with  the  Hartree-Fock  procedure  because  no  correlation  is 
involved  and  all  basis  sets  used  are  equivalent  for  the  hydrogen  atom.  The  error  of 
0.12  kcal/mol  corresponds  to  the  correction  of  —0.12  kcal/mol  used  in  the  G1 
procedure.  This  is  an  error  G1  procedure  corrects  whenever  a  single  electron  occupies 
a  molecular  orbital. 

For  oxygen,  the  ab  initio  errors  are  relatively  large,  but  this  is  not  of  major 
concern  because  the  core  electrons  are  not  correlated  on  either  the  mp4  or  qci 
calculations;  therefore,  when  calculating  atomization  or  any  other  energy  difference, 
most  of  these  errors  will  be  canceled  with  an  analogous  error  from  the  core  electrons 
in  the  molecule.  This  is  not  the  case  for  the  dft  energies. 

Table  II  shows  the  total  energies  for  the  molecules  using  various  levels  of  theory. 
Each  energy  corresponds  to  a  local  minimum  geometry  for  all  the  dft  procedures 
and  to  a  geometry  optimized  at  the  mp2  level  for  all  the  ab  initio  procedures.  The 
calculations  using  the  pw  86/dzvp2  were  done  using  an  earlier  version  of  the  pro¬ 
gram  deMon  with  slightly  different  functions  for  the  fitting  of  the  electronic  density. 

The  parameters  of  the  optimized  structures  for  various  nonlocal  functionals  and 
basis  sets  are  shown  in  Table  III.  Each  structure  corresponds  to  a  local  minima  self- 
consistently  evaluated.  We  have  also  tabulated  MP2/6-31G*  and  experimental  ge¬ 
ometries  for  comparison  purposes.  Although  the  energies  are  the  best  indicators  of 
the  quality  of  the  model  used,  the  geometries  furnish  additional  insight  on  the 
performance  of  the  various  functionals  and  basis  sets.  As  a  rule,  all  GGA  functionals 
overestimate  bond  lengths.  The  increase  in  size  of  the  basis  set  using  the  PW  86 
functional  does  not  lead  to  any  improvement  of  the  geometries;  the  bond  lengths 
become  slightly  worse,  angles  hardly  change,  and  the  dihedral  on  H202  gets  0.7 
degrees  worse.  The  pw91  geometries  are  slightly  better  when  increasing  the  basis 
set  size.  In  addition,  the  bond  lengths  are  improved  when  going  from  pw  86  to 
pw91,  but  the  bond  angles  get  worse.  There  is  a  perceptible  improvement  in  the 
dihedral  angle  of  H202  when  the  new  PW91  functional  is  used.  Comparing  only 
the  cases  where  a  double- f  valence  basis  set  is  used,  the  best  bond  lengths  are 
calculated  by  the  mp2  method  followed  by  those  calculated  using  the  pw  9 1  and  P- 
W86  functionals,  the  worst  bond  lengths  are  obtained  with  the  B-LYP  functional. 
The  best  bond  angles  are  obtained  with  the  PW  86  functional  followed  by  those 
obtained  with  the  B-P86,  mp2,  and  B-lyp  methods;  the  worst  bond  angles  are 
calculated  with  the  pw91  functional.  The  dihedral  of  H202  is  almost  perfectly 
reproduced  with  pw91,  and  with  slightly  less  quality  with  the  pw86  and  B-P86 
functionals.  The  worst  dihedral  is  calculated  with  the  MP2  procedure.  We  conclude 
that  no  definite  favored  method  can  be  advocated  from  analyzing  the  optimized 
geometries.  The  use  of  the  large  basis  set  6-31 1++G(3df,3pd),  with  B-P86  and  B- 
LYP  functionals,  reduces  the  errors  in  bond  lengths  and  angles  by  a  factor  of  two, 
but  the  dihedral  of  H202  is  spoiled  yielding  the  largest  errors.  Presently,  due  to  its 
high  demand  on  computational  resources,  the  full  basis  set  does  not  have  practical 
interest  for  larger  systems. 


Table  II.  Total  energies  (in  hartrees). 


SEMINARIO 


inininvSvSinwSirji/SiAv~)Tt 

M(NfN(N(NM(N(NMMMD 

N(N(NM(NtN(N(NMNtNfS 

I  I  I  I  I  I  i  I  I  I  I  I 


or^(NONOv->«NrocN}'vor^o 
Om-0000--1,M-'Oh 
—  O'  O  N  — 

^^O'OvOTifNfSt^'n'OfOin 


^Tt'O'^O'd'OOOONOOOOv 
OC'4'Ot--t'--“ 'ON'O— 1  OOOOO 
'tr''Ovo(N-''oa\(N  —  O't 
(J-MIN-ONhin-fNON- 
O-lO\O\O\00O\Ch'O^'CvO 


TffNoommr^o— 

MVO^mTft^'t'O'OVOOO 

'or~'^o»noor^  —  oo  oo  vo 
inmmmromr'loO’—O 


hM(N(NhhrtOhO^Ch 
ooooNmiAioovor^-OON'^to 
^(NM^ONOO^r^VO^OMVO 
nr'',  nrn-H^vo^r'OO— 'h- 

'O^d'O'O'O'O’O^o'O'O'O'vO 

r^r~-r-r-r-r-r--r-r'r-r-r- 

I  I  I  I  I  I  I  I  I  I  I  I 


^  vooivi-i'n'flinT)--- 
OOtN^Wh-MOOiTifNON^CM 
inmrnrfir,)(N^ir)OCiflrfiO\ 
(Nf^rfinN-iOVlOOaNW 
oc  oo  ^  ^  ^  in  in  >-0  >n 

i^inini^ini/iinini/iininio 

I  I  I  I  I  I  I  I  1  I  I  I 


TtiOOMOr'-«fSMM(NN 

cooocn  —  'sf-oo\'-or'-r-~r'~ro 
mONTi-'^t-vor^r-or-r-c^-oo 
oor~r--r'r-'or-'00^0'0'0 


UJs22  =  :o  +  o 

>  >  > 

N  N  N 


iD  O  -  - 
OO  OO  Oi  Ol 
£  £  £  £ 
Cu  CU  0,  d. 


so  cl^o  a-'O'-o^o  q 
oo  >-  oo  >-  I>  L;-  c/; 


Table  III.  Geometries  (distances  d  in  Angstroms,  angles  a,  and  dihedrals  w  in  degrees). 
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Table  IV.  Atomization  energies  including  zero  point  energies  (in  kcal/mol). 
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The  atomization  energies  are  shown  in  Table  IV.  For  purposes  of  comparing 
with  experiment,  the  zero  point  energies  (zpe)  from  the  G1  procedure  have  been 
added  in  all  cases.  We  also  include  in  Table  IV  the  energies  from  the  mp4  and  QCI 
models  used  in  the  G1  procedure.  The  errors  in  the  atomization  energies  with 
respect  to  experiment  are  shown  in  Table  V.  We  have  added  a  column  with  the 
average  of  the  absolute  errors.  The  average  errors  for  all  procedures,  DFT  and  ab 
initio,  are  very  far  from  the  chemical  accuracy.  This  is  very  unfortunate  because 
they  are  the  most  sophisticated  purely  ab  initio  (i.e.,  with  no  empirical  corrections) 
methods  existing  today.  The  ab  initio  procedures  shown  here,  mp4  and  QCi  using 
large  basis  sets,  are  hardly  applicable  to  systems  containing  more  than  four  or  five 
heavy  atoms  because  of  their  immense  requirement  of  computational  resources. 

The  best  functional  using  a  double- f  basis  set  for  this  small  set  of  molecules  is 
the  B-lyp  combination,  followed  by  the  pw86  and  PW91  functionals.  These  three 
functionals  yield  atomization  energies  that  are  comparable  to  or  better  than  those 
calculated  with  the  mp4  and  QCi  procedures  that  do  not  use  f-functions  but  larger 
basis  sets.  The  worst  atomization  energies  are  those  from  the  B-P86  combination; 
even  though,  the  separated  exchange  and  correlation  parts  work  as  expected  in  the 
other  two  cases  (B-lyp  and  pw86).  The  fact  that  the  energies  from  the  dft  cal¬ 
culations  are  at  least  comparable  with  those  from  the  mp4  and  QCi  procedures  that 
use  larger  basis  sets  is  very  promising  for  the  analysis  of  large  molecular  systems. 
The  dzvp2  or  the  6-3 1G*  *  basis  set  requires  of  only  15  basis  functions  per  heavy 
atom;  while  the  6-31 1G*  *,  which  is  the  minimal  basis  set  recommended  for  mp4 
or  QCi  calculations,  uses  18  functions.  The  6-311+G**  uses  22  and  the  6- 
3 1  lG(2df,p)  uses  30  basis  functions  per  oxygen  atom.  On  the  other  hand,  the  mp4 
and  QCI  methods  scale  like  N7  while  the  dft  methods  scale  like  N3  with  respect 
to  the  size  ( N  in  this  particular  case)  of  the  basis  set  or  the  system. 

The  increase  of  basis  set  size  directly  improves  the  atomization  energies  obtained 
from  the  mp4  calculations.  The  inclusion  of  f-functions  yields  a  marked  decrease 
of  errors.  This  is  not  the  case  with  the  dft  results.  The  upgrade  from  double- f  to 
triple- f  on  both  pw  cases  decreases  the  average  errors  by  about  0.5  kcal/mol.  In 
the  case  of  the  B-lyp  combination  the  improvement  is  only  0.9  kcal/mol;  however, 
the  size  of  the  basis  set  grows  from  15  to  39  basis  functions  per  heavy  atom.  In  the 
case  of  the  B-P86  combination  the  errors  increase  by  1.3  kcal/mol  becoming  the 
case  with  the  worst  atomization  energies  in  this  study.  It  is  recommended  that  the 
Perdew  correlation  functional  be  used  with  the  akin  Perdew  exchange  functional 
to  take  advantage  of  inherent  error  cancellations  [15-17]  between  them.  The  present 
results  show  that  a  basis  set  of  double- £  valence  with  a  simple  polarization  function 
on  all  atoms  is  enough  to  obtain  the  best  energies  that  can  be  achieved  with  this 
group  of  nonlocal  functionals.  DFT  methods  do  not  require  the  basis  sets  to  be  as 
large  as  the  ab  initio  methods  require.  Given  a  basis  set,  then  it  is  much  easier  to 
assemble  the  electronic  density  than  the  N-electron  wave  function.  The  complicated 
character  of  the  correlation  portion  of  the  energy  is  handled  by  the  correlation 
functional  in  dft;  the  more  sophisticated  this  functional  the  more  precise  the 
calculation  will  be.  Whereas,  in  an  ab  initio  calculation,  this  complicated  character 
of  the  correlation  portion  of  the  energy  is  handled  entirely  by  the  wave  function 


Table  V.  Errors  in  the  atomization  energies  (kcal/mol). 
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requiring  a  large  basis  set  with  high  angular  momentum  functions.  However,  it  is 
important  to  understand  that  the  definitions  of  correlation  energies  in  dft  and  ab 
initio  are  not  entirely  equivalent. 

The  DFT  methods  used  in  the  present  work  overestimate  atomization  energies 
while  ab  initio  correlated  methods  underestimate  them.  In  both  cases,  the  errors 
can  be  reduced  by  using  the  error  average  as  a  correction  to  the  energies.  Although 
the  ab  initio  mp4  and  QCi  errors  are  relatively  large,  straightforward  corrections 
can  be  made  to  these  values  because  the  errors  are  roughly  proportional  to  the 
number  of  electrons.  For  instance,  the  average  error  per  molecule  could  be  reduced 
from  16.3  kcal/mol  to  3.2  kcal/mol  if  the  atomization  QCi  energies  were  corrected 
by  a  1 . 1 875  kcal /mol  per  each  electron  in  the  molecule.  These  kinds  of  corrections 
are  possible  with  high-level  ab  initio  energies  [3,4]  as  well  as  with  local  dft  energies 
[1,2].  Besides  the  basis  set  additivity,  the  most  important  correction  in  the  G1 
procedure  is  the  high  level  correction  (HLC).  This  can  be  predicted  by  noticing  that 
the  errors  in  the  atomization  energies  are  proportional  to  the  dod.  Table  V  shows 
that  the  errors  increase  in  the  following  order:  first  HO  and  02  that  have  a  dod  of 
one,  followed  by  H20  and  H02  that  have  DOD  of  two,  and  finally  02H2  and  O3 
that  have  DOD  of  three.  The  3.9  kcal/mol  (more  exactly  3.85 )  error  in  the  hydrogen 
molecule  is  the  correction  used  in  the  G 1  per  each  fully  occupied  MO .  This  explains 
why  when  using  ab  initio  methods,  the  error  for  the  atomization  energy  of  the 
water  molecule  is  bigger  than  the  error  for  the  oxygen  molecule.  The  DFT  nonlocal 
energies  do  not  show  the  trend  in  errors  observed  in  the  ab  initio  results.  Although 
other  kinds  of  corrections  have  been  successfully  done  to  dft  energies  using  local 
functionals  [1,2] ,  the  nonlocal  results  are  not  suitable  for  such  corrections.  However, 
it  can  be  observed  that  the  nonlocal  energy  errors  are  strongly  correlated  to  the 
number  of  heavy  atoms  in  the  molecule  as  it  happens  with  two  local  functionals 
[  2  ] .  With  the  exception  of  the  B-lyp  functional,  all  the  other  nonlocal  functionals 
show  a  large  and  systematic  increase  in  errors  when  the  number  of  oxygens  increases 
from  one  to  three.  Therefore,  if  we  correct  the  pw  86  atomization  energies  by  7.46 
kcal  /mol  per  oxygen  atom  present  in  the  molecule,  the  average  absolute  error  is 
reduced  to  just  2.0  kcal/mol  from  the  original  12.5  kcal/mol.  In  both  cases,  dft 
and  ab  initio ,  the  errors  of  dissociation  energies  or  bond  energies  (when  the  fragments 
are  not  individual  atoms)  are  not  too  severe  because  the  errors  in  the  fragments 
will  cancel  the  error  in  the  molecule.  The  errors  are  proportional  to  the  number  of 
electrons  (ab  initio)  or  to  the  number  of  atoms  (dft);  therefore,  tolerances  from 
1  to  3  kcal/mol  in  bond  energies  are  always  anticipated  with  the  above  dft  methods 
using  modest  basis  sets.  This  dependence  of  the  error  on  the  number  of  heavy  atoms 
for  the  nonlocal  functionals  suggests  that  a  correction  of  nonuniversal  (depending 
on  the  external  potential)  character  is  required  in  most  of  the  functionals.  This 
would  be  the  case  if  the  complicated  effects  of  atoms  making  bonds  can  not  be 
properly  accounted  for  in  the  already  complicated  functionals  used  in  this  work. 
The  pw91  functional  is  the  only  free-parameter  functional  (i.e.,  deduced  purely 
from  first  principles  arguments).  However,  the  accuracy  of  results  without  correc¬ 
tions,  although  better  than  expensive  ab  initio  methods,  is  not  yet  of  the  chemical 
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accuracy  available  to  the  best  experiments;  therefore,  work  is  still  required  to  develop 
better  functionals. 
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Abstract 

Self-consistent  electronic  subband  structure  calculations  of  periodic  w-type  delta ( <5 ) -doped  layers  in 
Si  and  GaAs  are  performed  within  the  local  density-functional  approximation.  The  behavior  of  the 
energy  levels,  potential  profiles,  miniband  occupancies,  and  Fermi  level  position  with  the  variation  of 
the  dopant  concentration  ND  and  the  spacing  between  the  5-layers  ds  is  examined  for  Si  5-doping  in 
GaAs  and  Sb  5-doping  in  Si.  The  physical  properties  of  these  5-doping  structures  show  a  strong  dependence 
on  ds,  reflecting  a  transition  from  isolated  delta-wells  to  superlattices  as  ds  decreases.  The  crossover 
involving  the  change  from  a  two-dimensional  to  three-dimensional  behavior  is  discussed  for  both  kinds 
of  systems.  ©  1994  John  Wiley  &  Sons,  Inc. 


Introduction 

Improvements  in  epitaxial  growth  of  materials  have  allowed  for  a  better  control 
of  impurity  incorporation  in  semiconductors.  Nowadays,  there  has  been  an  in¬ 
creasing  interest  in  the  delta(5)-doped  systems,  thus  called  because  of  the  final 
target  of  achieving  impurities  distributed  in  one  atomic  layer.  In  these  planar  doping 
structures,  the  carriers  released  by  the  ionized  dopants  are  confined  in  the  V-shaped 
potential-well  induced  by  a  5-function-like  doped  sheet.  Such  two-dimensional  (2D) 
doping  sheets  have  been  used  as  parts  of  many  semiconductor  devices  and  will 
certainly  play  an  important  role  in  the  development  of  future  quantum  electronic 
devices  [1-4]. 

While  isolated  and  multiple  5-doped  layers  in  GaAs  have  been  the  subject  of 
intense  investigations  [  5- 1 0  ] ,  the  study  of  5-doping  in  Si  is  more  recent  due  mainly 
to  the  difficulties  in  achieving  well-controlled  impurity  incorporation  during  epitaxial 
growth  of  silicon  [11,12]. 

The  purpose  of  this  work  is  to  present  results  of  self-consistent  (SCF)  one-electron 
state  calculations  of  periodic  ft -type  5-doped  layers  in  GaAs  and  Si,  for  different 
doping  concentrations  ND  and  sheet  periods  ds.  Many-body  effects  such  as  exchange 
and  correlation  are  taken  into  account  within  the  local  density-functional  approx¬ 
imation  (lda).  It  is  shown  that  the  electronic  properties  produced  by  multiple  Si- 
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5-doping  in  GaAs  and  Sb-5-doping  in  Si  change  considerably  with  the  variation  of 
Nd  and  ds  and  a  transition  from  independent  isolated  5-wells  to  a  superlattice  is 
found  to  take  place. 


Theoretical  Model  and  Calculations 


We  consider  an  infinite  sequence  of  equally  spaced  n  -5-doping  layers  perpendic¬ 
ular  to  the  [001]  direction  of  a  GaAs  (or  Si)  host  crystal.  All  donors  are  assumed 
to  be  ionized  and  uniformly  distributed  within  a  range  of  10  A.  Since  low  temper¬ 
ature  is  usually  employed  to  reduce  diffusion  of  Si  and  Sb  out  of  the  doped  planes, 
no  significant  spread  of  the  donors  is  expected.  The  electrons  released  by  the  donors 
form  a  quasi  2D-electron  gas  which  screens  the  potential  of  the  positive  charges. 
The  subband  structure  of  the  systems  is  calculated  on  the  basis  of  the  effective  mass 
approximation.  Using  the  local-density  approximation  (lda),  the  electrons  will  be 
described  by  the  wave  function  eikxXeikyV  Fkz{z)  where  x  and  y  are  the  directions 
parallel  to  the  5-layers  and  Fkz(z)  is  the  solution  of  the  one-dimensional  Schrodinger 
equation 


"  — h 2  d2 
2m*  dz2 


V  (z) 


Fnkz{z)  =  En(kz)Fnkz(z ) 


(1) 


Here  V(z)  is  the  effective  potential  given  by  a  sum  of  the  Coulombic  potential 
Vc(z)  and  the  local  exchange-correlation  potential  VXc(z )>  n  is  the  subband  index, 
kz  is  the  electron  wave  vector  along  the  z-direction,  and  m*  is  the  electron  effective 
mass  in  GaAs.  In  the  case  of  silicon,  due  to  the  many-valley  structure  of  the  Si 
conduction  band  and  the  anisotropy  of  the  electron  effective  mass  in  a  single  valley, 
the  2D  electron  gas  within  the  5-region  is  described  by  two  kinds  of  free  electron 
states  with  different  effective  masses  m *  and  mf ,  corresponding,  respectively,  to 
the  longitudinal  and  transverse  axis  of  the  Si  constant-energy  ellipsoidal  surfaces 
[13] .  Two  types  of  minibands  are  to  be  distinguished  in  this  case:  one  arising  from 
conduction  band  valleys  with  mf  parallel  to  the  z-axis,  and  one  due  to  valleys  with 
mf  perpendicular  to  the  z-axis.  Details  of  the  model  can  be  found  elsewhere  [  9, 1 3  ] . 

Equation  ( 1 )  has  to  be  solved  self-consistently  for  the  electrons  in  connection 
with  the  Poisson  equation  which  determines  Kc(z).  Eq.  ( 1 )  is  numerically  integrated 
by  adopting  the  Wigner-Seitz-Slater  version  of  the  cellular  method  with  space 
filling  one-dimensional  cells  and  the  exact  point-match  approach  [14]  with  the 
boundary  conditions 


(2a) 

(2b) 


The  Eqs.  (2a)  and  (2b)  assure  the  continuities  of  the  envelope  functions  and  their 
normal  derivatives  at  the  cell  boundaries,  respectively. 
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The  Coulombic  potential  Vc(z),  due  to  the  electrostatic  electron-electron,  and 
electron-ionized  donors  interactions,  is  obtained  from  the  Poisson  equation 


d2Vc(z) 

dz2 


-  —  [n(z)  -  Nd(z) ], 
e 


(3) 


with  n(z)  being  the  electron  density  which  is  given  by 

n(z)  =  2  Nnk2{z)\Fnki(z)\2  (4) 

n,kz 

Assuming  zero  temperature,  the  distribution  function  becomes  6[EF  —  En(kz)], 
where  EF  is  the  Fermi  energy  and  6  is  the  Heaviside  step  function.  Then,  one  gets 

yyi  * 

Nnkz  =  —2[Ef-  En(kz)]6[EF  -  En(kz)]  .  (5) 

For  the  exchange-correlation  potential  VXc(z)  we  have  assumed  the  well-known 
analytic  expression  suggested  by  Hedin  and  Lundqvist  [15].  The  values  used  for 
the  effective  masses  m*i  mf  and  m *  are  0.068mo,  0.1905ra0,  and  0.9163mo,  re¬ 
spectively,  where  m0  is  the  electron  mass.  For  the  dielectric  constants  of  GaAs  and 
Si  we  adopted  the  values  12.5e0  and  12.1c0,  respectively  [16]. 


Results 

Figure  1  shows  typical  results  for  Si  6-doped  GaAs  layers,  for  ND  -  3  X  10 12 
cm-2  and  three  different  values  of  ds.  For  the  larger  value  of  ds  =  800  A  four  discrete 
subband  levels  are  found  occupied.  The  general  aspect  of  the  potential  and  charge 
density  profiles  does  not  change  in  a  significant  manner  with  the  decrease  in  ds.  It 
is  more  interesting  to  look  at  the  variations  of  the  potential  depth  (F0),  the  Fermi 
level  position  ( EF ),  and  miniband  widths,  which  contain  the  information  physically 
more  significant  in  practice.  As  can  be  seen  in  Figure  1,  the  potential  depth  V0 
decreases  as  ds  decreases.  The  general  trend  of  the  Fermi  level  EF  shows  that  it 
remains  practically  constant,  suffering  only  a  slight  increase  for  ds  <  70  A.  When 
the  6-wells  are  wide  apart  and  their  potentials  have  negligible  overlap,  the  system 
behaves  as  a  set  of  independent  isolated  wells;  as  ds  decreases,  their  coupling  is 
increased,  and  minibands  start  to  appear  characteristic  of  superlattice  behavior. 
Similar  results  are  obtained  for  Sb  6-doping  in  Si. 

In  Table  I  are  depicted  the  values  of  V0,  EF,  Eif  A Eit  and  ni}  corresponding, 
respectively,  to  the  potential  depth,  Fermi  level  position,  energy  of  the  bottom  of 
the  subbands,  their  energy  dispersion  (obtained  as  A E  =  \Et{kz  =  0)  —  Et{kz  = 
tt !ds)  | ),  and  the  subband  occupancy,  for  Si  6-doping  in  GaAs  with  ND  -  3  X  10 12 
cm-2  and  for  ds  varying  between  800  and  40  A.  As  expected,  higher  subbands  start 
to  be  populated  as  ds  increases.  This  behavior  is  accompanied  by  a  narrowing  of 
the  minibands,  indicating  that  confinement  is  enhanced  in  this  case. 

Table  II  shows  the  results  obtained  for  V0,  EF,  Et ,  A Et,  and  nt  for  Sb  6-doped 
layers  in  Si  for  a  typical  donor  concentration  of  ND~  1.3  X  1013  cm-2,  and  for  ds 
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Figure  1 .  Band  diagram  scheme  for  periodic  Si  5-doped  layers  in  GaAs  for  ND  =  3  X 
10 12  cm-2  and  a  spread  of  10  A.  In  (a)  ds  =  800  A,  in  (b)  ds  =  220  A,  and  in  (c)  ds  =  70 
A.  Potential  profiles  and  subband  energies  are  drawn  to  scale  as  obtained  from  SCF  cal¬ 
culations.  Ei  indicates  the  bottom  of  the  electron  energy  subbands.  The  Fermi  level  EF  is 
indicated  by  a  dashed  line. 


varying  between  300  and  50  A  which  covers  the  entire  range  of  periods  in  which 
the  transition  from  isolated  wells  to  a  superlattice  of  5-wells  is  observed.  It  is  worth 
noting  that  since  the  effective  masses  in  silicon  are  larger  than  in  GaAs,  for  the 
former  we  expect  the  electrons  will  be  less  delocalized.  This  is  indeed  seen  in  the 
results  shown  in  Tables  I  and  II.  Whereas  for  5-doping  in  Si  the  system  already 
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Table  I.  Characteristics  of  the  periodic  Si  5-doped  GaAs 
systems,  extracted  from  the  SCF  calculations  for  ND  =  3  X 
1012  cm"2.  V0  is  the  potential  depth,  EF  is  the  Fermi  energy, 
Ei  is  the  energy  of  the  bottom  of  the  subbands,  A  Et  is  the 
energy  dispersion  (see  text)  and  w,  is  the  subband  occupancy. 
The  values  of  V0,  EF,  Eh  and  Ais,  are  in  meV.  The  occupancies 
are  given  in  units  of  1012  cm”2. 


ds( A) 

800 

400 

220 

100 

40 

Vo 

119.7 

113.8 

98.9 

65.4 

29.3 

ef 

119.6 

119.6 

119.9 

122.7 

125.1 

E0 

48.1 

48.2 

47.5 

34.3 

19.5 

A£o 

— 

— 

1.6 

42.3 

336.4 

no 

2.03 

2.03 

2.06 

2.51 

3.00 

Ex 

96.0 

96.0 

91.0 

105.0 

— 

A  Ex 

— 

1.2 

20.1 

153.3 

— 

«1 

0.67 

0.68 

0.82 

0.49 

— 

E2 

111.1 

109.5 

115.7 

— 

— 

A  E2 

0.1 

7.9 

50.2 

— 

— 

n2 

0.25 

0.29 

0.12 

— 

— 

E 3 

117.9 

— 

— 

— 

— 

A  E3 

1.8 

— 

— 

— 

__ 

«3 

0.05 

— 

— 

— 

— 

behaves  as  a  set  of  independent  wells  for  ds  >  1 50  A,  in  the  case  of  GaAs  the  separate 
well  behavior  is  observed  only  for  ds  «  800  A.  We  should  mention  that  the  values 
of  ds,  for  which  the  transition  from  isolated  to  superlattice  behavior  takes  place, 
are  practically  independent  of  the  donor  concentration  ND. 

In  the  range  of  donor  concentrations  investigated  for  5-doping  in  Si  (8  X  10 12 
cm-2  <  Nd  <  3  X  10 14  cm-2),  the  transition  from  2D  to  3D  behavior  is  observed 
only  for  high  doping  densities,  i.e.,  ND  >  2  X  10 13  cm-2,  and  short  periods  ds  « 
50  A,  when  the  Fermi  level  is  found  to  lie  above  the  potential  barrier  between  the 
5-layers.  We  note,  however,  that  although  part  of  the  electron  gas  exhibits  a  3D 
character,  the  percentage  of  3D  electrons  is  very  low  ( « 1-2%  of  the  total  electron 
concentration).  Shown  in  Figure  2  is  a  plot  of  the  Fermi  level  position  EF ,  relative 
to  the  bottom  of  the  5-well,  as  a  function  of  ND  for  ds=  50  A.  We  observe  that  EF 
is  linear  for  ND  <  10 14  cm-2,  reflecting  the  2D  character,  while  for  larger  values  of 
Nd  the  behavior  is  typically  3D.  These  results  could  be  an  indication  that  even  for 
electron  densities  as  high  as  « 10 14  cm-2,  the  real  behavior  is  still  not  actually  free- 
electron  like.  We  mention,  however,  that  these  “delocalized”  electrons  could  cause 
a  remarkable  increase  of  the  mobility  along  the  [001]  direction.  Further  experiments 
would  be  required  in  order  to  confirm  these  predictions. 

Conclusions 

In  conclusion,  we  have  presented  theoretical  results  of  the  electronic  subband 
structure  of  periodic  ft -type  5-doped  layers  in  GaAs  and  Si,  within  the  local-density 
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Table  II.  Characteristics  of  the  periodic  Sb  5-doped  Si 
systems,  extracted  from  the  scf  calculations  for  ND=  1.3  X 
1013  cm-2.  The  quantities  V0,  EF,  E(,  and  A Et  are  the  same 
as  in  Table  I  and  are  given  in  meV.  The  subband  occupancies 
n,  are  in  units  of  1013  cm'2.  Transverse  levels  are  primed  while 
the  longitudinal  ones  are  unprimed  (see  text). 


ds{  A) 

300 

200 

150 

100 

50 

Vo 

99.1 

99.2 

99.2 

99.0 

92.4 

ef 

98.8 

98.8 

98.7 

97.7 

91.2 

Eo 

39.6 

39.6 

39.6 

39.7 

39.1 

A£o 

— 

— 

— 

— 

4.0 

«o 

0.51 

0.51 

0.51 

0.5 

0.45 

E'o 

62.9 

62.8 

62.5 

60.4 

48.6 

AE'0 

— 

0.10 

0.8 

6.3 

60.2 

n’o 

0.68 

0.68 

0.68 

0.71 

0.81 

Ex 

86.9 

87.0 

87.0 

86.5 

85.9 

A  Ex 

— 

— 

0.2 

2.5 

32.1 

0.10 

0.10 

0.10 

0.09 

0.05 

e2 

98.0 

97.9 

97.8 

— 

— 

A  E2 

0.3 

1.3 

3.6 

— 

— 

n2 

0.01 

3.6 

0.01 

— 

— 

approximation,  for  different  doping  periods  and  donor  concentrations.  The  physical 
properties  of  these  systems  are  strongly  dependent  on  period  and  donor  concen¬ 
tration,  showing  a  transition  from  isolated  wells  to  superlattices.  We  expect  that, 
particularly  in  the  case  of  6-doping  in  silicon  for  which  less  experimental  work  is 
available,  magnetotransport  measurements  can  verify  the  validity  of  our  calculation. 
Also,  experimental  results  are  needed  to  provide  a  guide  for  further  theoretical 
efforts. 


Figure  2.  Dependence  of  the  Fermi  level  position  £>,  relative  to  the  bottom  of  the  wells 
placed  at  the  zero  of  the  energy,  on  the  donor  sheet  concentration  ND  for  periodic  Sb  6- 
doped  layers  in  Si  with  ds  =  50  A. 
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Abstract 

The  work  functions  and  surface  energies  of  Al(  111)  films  ranging  from  one  to  seven  layers  thick  have 
been  calculated  using  the  linear  combinations  of  Gaussian  type  orbitals-fitting  function  (lcgto-ff) 
technique,  as  implemented  in  the  program  package  FILMS,  an  all-electron  full-potential  electronic  structure 
method.  Both  quantities  exhibit  significant  quantum  size  effect  (QSE),  in  basic  agreement  with  three 
previous  investigations  using  more  approximate  techniques.  However,  there  are  significant  quantitative 
differences  among  the  four  sets  of  results.  ©  1994  John  Wiley  &  Sons,  Inc.* 


Introduction 

It  has  been  nearly  two  decades  since  it  was  first  demonstrated  that  the  work 
function  of  a  jellium  ultra-thin  film  (utf)  exhibits  quantum  oscillations  as  a  function 
of  film  thickness  [1,2].  That  discovery  stimulated  a  number  of  theoretical  studies 
directed  toward  determining  the  size  and  extent  of  this  so-called  quantum  size  effect 
(qse)  in  the  work  functions  of  real  systems.  Between  1983  and  1986,  a  series  of 
three  systematic  investigations  of  thickness  dependencies  in  the  work  function  and 
surface  energy  of  an  fee  Al(l  11)  film  made  it  clear  that  a  free  standing  (unsupported) 
metallic  UTF  composed  of  discrete  atoms  also  could  exhibit  a  significant  QSE  [3- 
5].  Unfortunately,  it  was  soon  demonstrated  that  there  is  little  hope  of  actually 
observing  a  work  function  QSE  because  of  the  need  to  grow  a  metallic  film  on  an 
electronically  inert  substrate  in  a  controlled  layer-by-layer  fashion  [4]. 

Although  it  seems  unlikely  that  the  work  function  QSE  will  ever  be  observed 
experimentally,  there  has  been  a  continuing  interest  in  studying  the  QSE  in  various 
properties  of  free  standing  films  for  both  fundamental  and  pragmatic  reasons.  Fun¬ 
damentally,  one  may  hope  to  uncover  an  observable  static  QSE  in  other  properties. 
Practically,  perhaps  the  most  important  theoretical  tool  used  to  study  surfaces  is 
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the  UTF  approximation.  If  a  surface  can  be  modelled  adequately  by  a  reasonably 
thin  film,  the  properties  of  that  surface  can  be  calculated  using  modem  high-precision 
UTF  electronic  structure  techniques.  Obviously,  the  adequacy  of  the  utf  approxi¬ 
mation  will  depend  on  the  extent  of  the  QSE  in  the  relevant  properties  of  the  film. 
Thus,  it  is  imperative  to  determine  carefully  the  magnitude  of  any  QSE  in  a  given 
utf  prior  to  using  that  utf  as  a  model  for  a  surface. 

During  the  past  decade,  the  linear  combinations  of  Gaussian  type  orbitals-fitting 
function  (lcgto-ff)  technique,  as  embodied  in  the  computer  code  FILMS  [6], 
has  been  used  extensively  to  study  QSE  in  the  electronic  and  structural  properties 
of  numerous  UTF  systems;  see  Ref.  [7]  for  a  review.  Most  of  the  work  done  to  date 
with  FILMS  has  focussed  either  on  very  thin  films  (1  or  2  layers  thick)  or  very 
small  atoms  (H,  Li,  Be,  C)  because  of  the  large  demands  FILMS  makes  on  computer 
resources,  particularly  CPU  time.  Recently,  two  of  us  (UB  and  NR)  have  imple¬ 
mented  a  number  of  improvements  in  the  algorithms  used  in  FILMS,  thereby 
achieving  a  significant  reduction  in  the  needed  CPU  time  [8].  This  improved  version 
of  FILMS  was  then  used  to  calculate  the  properties  of  MgO  films  up  to  5  layers 
thick  [9].  Adsorption  systems,  such  as  hydrogen  on  Li(001)  and  ethylene  on  Ni(l  10), 
also  have  been  investigated  recently  [10]. 

In  the  present  investigation,  FILMS  has  been  used  to  study  the  work  function 
and  surface  energy  QSE  of  unrelaxed  Al(  111)  films  ranging  from  one  to  seven  layers 
thick.  Although  the  Al(lll)  film  was  the  first  atomic  utf  to  be  systematically 
examined  for  QSE  [3-5],  none  of  those  early  studies  employed  all-electron  full- 
potential  band  structure  techniques  comparable  to  the  current  state-of-the-art.  Fur¬ 
thermore,  there  are  significant  differences  among  the  three  existing  sets  of  results 
which  remain  to  be  resolved.  Finally,  the  present  work  is  intended  as  a  precursor 
to  a  more  demanding  investigation  of  alkali-metal  adsorption  on  the  Al(  111)  surface; 
providing  the  requisite  study  of  convergence  with  respect  to  thickness  mentioned 
earlier.  Thus,  considerable  effort  has  been  devoted  here  to  developing  compact 
orbital  and  fitting  function  basis  sets  to  be  used  in  future  calculations  involving  Al. 

The  organization  of  the  remainder  of  this  article  is  as  follows.  First,  the  early 
work  on  QSE  in  Al(l  1 1)  films  is  reviewed.  The  lcgto-ff  technique  and  the  basis 
sets  used  then  are  discussed.  The  current  results  are  presented  in  the  penultimate 
section.  A  few  concluding  remarks  are  given  in  the  final  section. 

Background 

In  1983,  Feibelman  [3]  conducted  the  first  systematic  investigation  of  QSE  in  the 
work  function  and  surface  energy  of  a  real  metal  film.  That  work  studied  1,  2,  3, 
4,  and  6  layer  thick  Al(  111)  films  using  a  linear  combinations  of  atomic  orbitals 
(lcao)  electronic  structure  method.  The  work  functions  were  calculated  both  for 
ideal  (bulk)  geometries  and  for  partially  relaxed  geometries,  while  the  surface  energies 
only  were  obtained  at  the  relaxed  geometries.  Those  lcao  calculations  employed 
the  Wigner  interpolation  form  of  the  local  density  approximation  (lda)  to  exchange- 
correlation  (XC)  [11],  used  a  frozen-core  approximation,  and  fitted  the  charge  density 
and  the  XC  integral  kernels  with  5-type  GTOs.  The  irreducible  part  of  the  Brillouin 
zone  (bz)  was  sampled  at  only  10  points. 
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The  work  functions  calculated  by  Feibelman  [3]  for  the  ideal  Al(l  1 1)  films  began 
with  a  maximum  value  of  4.2  eV  for  the  monolayer,  dropped  steadily  to  a  minimum 
of  3.7  eV  for  the  3-layer,  and  then  rose  steadily  to  4.0  eV  in  the  6-layer.  For  the 
relaxed  films,  which  exhibit  slightly  expanded  outer-layer  spacings,  the  work  func¬ 
tions  again  began  at  a  monolayer  value  of  4.2  eV,  dropped  to  3.8  eV  for  the  3- 
layer,  and  then  remained  nearly  constant  for  the  4-  and  6-layer  films.  The  surface 
energies  of  the  relaxed  films  began  with  a  monolayer  value  of  0.40  eV/atom,  dropped 
to  an  exceptionally  small  value  (0. 1 8  eV/ atom)  for  the  2-layer,  jumped  back  up  to 
0.39  eV/atom  for  the  3-layer,  and  then  steadily  rose  to  0.52  eV/atom  in  the  6-layer. 
To  this  date,  the  prediction  [3]  of  a  significant  QSE  in  Al(  111)  films  has  never  been 
challenged.  However,  there  was  considerable  concern  at  the  time  that  the  quanti¬ 
tative  results  might  have  been  affected  seriously  by  the  various  approximations 
made  in  the  LCAO  code,  especially  the  use  of  GTO  fitting  functions  [3,4]. 

To  check  and  improve  on  the  results  of  Ref.  [3],  Feibelman  and  Hamann  [4] 
repeated  the  calculations  on  the  ideal  Al(  111)  films  (adding  the  5-layer  film)  using 
a  surface  linearized  augmented-plane-wave  (slapw)  method.  Like  the  LCAO  results, 
the  slapw  Al(  111)  work  function  begins  with  its  maximum  value  for  the  monolayer 
(4.74  eV)  and  drops  to  its  minimum  at  the  3-layer  (4. 10  eV).  However,  the  slapw 
work  function  then  jumps  up  to  4.43  eV  for  the  4-layer  followed  by  a  steady  decline 
to  4.31  eV  in  the  6-layer.  For  all  of  the  films  studied,  the  slapw  work  functions 
were  larger  than  the  corresponding  LCAO  results  by  0.3  to  0.6  eV.  This  rather  large 
quantitative  difference  was  attributed  to  errors  in  the  LCAO  results  due  to  the  charge 
fitting  procedure.  The  basic  problem  was  that  the  charge  fitting  produced  a  spurious 
negative  electron  density  in  the  far  vacuum  region,  thereby  reducing  the  dipole 
barrier  and  the  work  function.  Such  an  error  in  the  charge  density  clearly  would 
also  affect  the  surface  energies  of  the  films.  Unfortunately,  the  slapw  code  did  not 
calculate  the  total  energies  of  the  films  and  thus  could  not  be  used  to  study  the 
surface  energy  QSE. 

The  third  systematic  investigation  of  the  Al(  111)  films  was  conducted  by  Batra 
et  al.  [5]  using  norm  conserving  pseudopotentials  and  the  xc  model  of  Perdew  and 
Zunger  [12].  Batra  et  al.  calculated  the  work  functions  and  surface  energies  for  1-, 
3-,  5-,  and  7-layer  Al(  111)  films  using  the  repeated  slab  approximation.  As  in  the 
previous  studies  [3,4],  a  relatively  sparse  bz  mesh  was  used;  only  49  points  in  the 
entire  BZ.  For  a  uniformly  distributed  mesh,  this  would  correspond  to  only  8  points 
in  the  irreducible  part  of  the  bz.  Batra  et  al.  concluded  on  the  basis  of  analytical 
force  calculations  that,  contrary  to  the  earlier  results  [3],  geometry  relaxation  effects 
should  be  small.  For  this  reason,  the  work  functions  and  surface  energies  were  only 
calculated  for  the  ideal  geometries. 

The  Al(  111)  work  function  calculated  by  Batra  et  al.  [5]  ranges  from  a  maximum 
of  4. 1  eV  for  the  monolayer  to  a  minimum  of  3.2  eV  for  the  3-layer,  and  then 
steadily  rises  to  3.7  eV  for  the  7-layer.  Although  this  behavior  appears  to  be  in 
better  qualitative  agreement  with  the  results  of  Ref.  [3]  than  with  those  of  Ref.  [4], 
there  is  no  fundamental  disagreement  between  Refs.  [4]  and  [5],  since  the  slapw 
calculations  did  not  consider  the  7-layer  film  while  the  pseudopotential  calculations 
did  not  include  the  4-  or  6-layer  films.  In  addition,  the  surface  energies  found  in 
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the  pseudopotential  calculations  [5]  only  vary  between  0.44  and  0.50  eV/atom;  a 
considerably  smaller  range  than  was  found  with  the  LCAO  method  [3],  even  if  one 
ignores  the  exceptionally  small  surface  energy  predicted  for  the  2-layer  in  the  latter 
study. 


Methodology 


The  LCGTO—FF  Technique 

The  LCGTO-FF  technique  is  an  all-electron,  full-potential,  electronic  structure 
method  which  is  characterized  by  its  use  of  three  independent  GTO  basis  sets  to 
expand  the  orbitals,  charge  density,  and  lda  xc  integral  kernels  (here  Hedin- 
Lundqvist  [13]).  The  purpose  of  the  charge  fit  is  to  reduce  the  total  number  of 
integrals  by  ensuring  that  only  3-center  integrals  appear  in  the  total  energy  and 
one-electron  equations,  instead  of  the  usual  4-center  integrals.  The  xc  fit  acts  as  a 
sophisticated  numerical  quadrature  scheme  capable  of  producing  accurate  results 
on  a  rather  coarse  numerical  integration  mesh. 

The  current  implementation  of  the  lcgto-ff  method  in  the  program  FILMS 
was  discussed  recently  in  detail  [6].  Since  that  time,  there  have  been  a  number  of 
important  algorithmic  improvements  [8].  First,  the  multiple  expansions  used  in 
the  past  to  speed  up  the  evaluation  of  the  orbital-fitting  function-orbital  integrals 
have  been  replaced  by  a  generalized  Ewald  procedure  utilizing  mixed  real  and 
reciprocal  space  integration.  Thus,  convergence  of  the  lattice  sums  is  achieved  by 
splitting  the  Coulomb  integrals  into  long-range  and  short-range  parts.  A  further 
speedup  is  attained  by  performing  the  Coulomb  integrals  for  all  needed  /  and  m 
values  simultaneously  for  each  combination  of  GTO  exponents.  In  addition  to  these 
major  algorithmic  modifications,  numerous  changes  have  been  made  throughout 
FILMS  to  allow  vectorization. 

For  all  of  these  calculations,  the  BZ  integrals  are  performed  via  the  linear  triangle 
integration  scheme,  using  19,  37,  or  61  points  in  the  irreducible  triangle  of  the  BZ. 
Those  points  were  distributed  and  grouped  in  a  manner  that  ensures  a  more  rapid 
convergence  of  the  bz  integrals  than  is  usually  achieved  by  linear  triangle  integration; 
see  the  Appendix  of  Ref.  [14].  The  SCF  cycle  was  iterated  until  the  total  energy 
changed  by  less  than  1  per  atom. 

The  GTO  Basis  Sets 

As  was  noted  above,  considerable  effort  was  devoted  to  developing  compact  orbital 
and  fitting  function  basis  sets  for  Al.  To  this  end,  we  began  by  performing  reference 
calculations  on  the  1-,  2-,  and  3-layer  films  using  very  rich  basis  sets  and  the  37 
point  BZ  scan.  As  discussed  elsewhere  [15],  different  basis  sets  must  be  used  for 
atoms  in  the  interior  and  exterior  layers  of  a  utf  due  to  differences  in  their  envi¬ 
ronments. 

For  the  exterior  layers,  the  rich  orbital  basis  set  was  constructed  from  Huzinaga’s 
[16]  Uslp  atomic  basis  set  as  follows.  First,  the  exponent  of  the  most  diffuse  p- 
type  GTO  was  shifted  to  0.12.  The  basis  set  was  then  augmented  by  a  single  pz- type 
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GTO  with  an  exponent  of  0.05.  Finally,  a  flkype  polarization  function  with  an 
exponent  of  0.30  was  added.  To  reduce  the  size  of  the  calculations  slightly,  the  four 
tightest  5-type  and  three  tightest  p- type  GTOs  were  contracted  using  the  coefficients 
of  the  15  and  2 p  lda  atomic  orbitals.  The  orbital  basis  for  the  interior  layers  was 
obtained  from  the  exterior  layer  basis  by  shifting  the  exponent  of  the  most  diffuse 
5-type  GTO  to  0.10  and  eliminating  the  most  diffuse  pz- type  gto. 

For  the  reference  calculations,  the  same  10s3d2pz  gto  basis  set  was  used  for  both 
the  charge  and  xc  fit  functions  in  the  exterior  layers.  For  the  interior  layers,  the 
exponent  of  the  most  diffuse  5-type  function  was  increased  slightly  and  the  pz- type 
functions  were  removed  [17].  Overall,  these  large  orbital  and  fitting  function  basis 
sets  should  provide  high  quality  reference  results. 

To  develop  compact  orbital  basis  sets  for  A1  from  these  rich  basis  sets,  the  primitive 
GTOs  were  contracted  into  a  6s3p\d  plus  1  pz  basis  set  for  the  exterior  layers  and  a 
6s3p\d  basis  for  the  interior  layers.  These  compact  orbital  basis  sets  are  given  in 
Table  I.  The  sizes  of  the  charge  and  xc  basis  sets  were  reduced  by  eliminating  a 
number  of  the  largest  exponents,  and  adjusting  the  remaining  exponents.  Ultimately, 


Table  I.  Gaussian  exponents  (a)  and  contraction  coefficients  (C)  of  the  compact  orbital  basis  sets 
for  the  exterior  and  interior  layers  of  the  Al(  111)  films.  Horizontal  lines  separate  the  contractions. 


Type 

Exterior 

Interior 

a 

c 

a 

c 

s 

0.2349 1082e  +  05 

0.0010120 

0.2349 1082e  +  05 

0.0010120 

0.35475323e  +  04 

0.0076655 

0.35476323e  +  04 

0.0076655 

0.82347 199e  +  03 

0.0375355 

0.82347 199e  +  03 

0.0375355 

0.23767938e  +  03 

0.1355686 

0.23767938e  +  03 

0.1355686 

0.78602059e  +  02 

0.2614458 

0.78602059e  +  02 

0.2614458 

0.78602059e  +  02 

0.0717569 

0.78602059e  +  02 

0.0717569 

0.29049906e  +  02 

0.4328815 

0.29049906e  +  02 

0.4328815 

0.1 16238 12e  +  02 

0.1973914 

0.1 16238 12e  +  02 

0.1973914 

0.34653431e  +  01 

1.0000000 

0.34653431e  +  01 

1.0000000 

0.12331429e  +  01 

1.0000000 

0.12331429e  +  01 

1.0000000 

0.20181005e  +  00 

1.0000000 

0.20181005e  +  00 

1.0000000 

0.07804797e  +  00 

1.0000000 

O.lOOOOOOOe  +  00 

1.0000000 

P 

0. 1 4 1 5 1036e  +  03 

0.0112485 

0. 14 1 5 1036e  +  03 

0.0112485 

0.332 17629e  +  02 

0.0759425 

0.33217629e  +  02 

0.0759425 

0. 103930 14e  +  02 

0.2569316 

0.1 03930 14e  +  02 

0.2569316 

0.35925839e  +  01 

0.4541624 

0.35925839e  +  01 

0.4541624 

0.12421020e  +  01 

0.3852556 

0.12421020e  +  01 

0.3852556 

0.30402357e  +  00 

1.0000000 

0.30402357e  +  00 

1.0000000 

0.12000000e  +  00 

1.0000000 

0.12000000e  +  00 

1.0000000 

Pz 

0.05000000e  +  00 

1.0000000 

d 

0.30000000e  +  00 

1.0000000 

0.30000000e  +  00 

1.0000000 
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Table  II.  Gaussian  exponents  of  the  compact  charge  and  xc  basis  sets  for 
the  exterior  and  interior  layers  of  the  Al(  111)  films. 


Type 

Charge 

xc 

Exterior 

Interior 

Exterior 

Interior 

s 

900.00 

900.00 

80.00 

80.00 

240.00 

240.00 

16.00 

16.00 

80.00 

80.00 

3.80 

3.80 

27.00 

27.00 

1.00 

1.00 

9.00 

9.00 

0.27 

0.27 

3.00 

3.00 

0.08 

0.10 

0.90 

0.90 

0.27 

0.27 

0.08 

0.10 

d 

0.90 

0.90 

0.90 

0.90 

0.30 

0.30 

0.30 

0.30 

Pz 

1.00 

0.90 

0.30 

0.30 

the  charge  fit  basis  sets  were  reduced  to  a  9s2d2pz  basis  for  the  exterior  layers  and 
a  9s2d  basis  for  the  interior  layers.  The  xc  basis  set  was  reduced  in  a  similar  fashion 
to  a  6s2d2pz  basis  for  the  exterior  layers  and  a  6s2d  basis  set  for  the  interior  layers. 
These  compact  fitting  function  basis  sets  are  given  in  Table  II. 

Table  III  compares  cohesive  energies  per  atom  (Ec)  and  work  functions  (</>)  ob¬ 
tained  for  the  1-,  2-,  and  3-layer  Al(  111)  films  with  both  the  rich  and  compact  basis 
sets,  using  the  37  point  bz  mesh.  The  systematic  relative  error  (relative  with  respect 
to  variations  of  the  film  thickness)  in  both  quantities  should  be  much  larger  for 
these  three  films  than  for  all  subsequent  films  because  they  encompass  two  important 
thickness-dependent  changes  in  the  basis  sets:  1)  the  inclusion  of  pz-typc  fitting 
functions  for  n  >  2,  and  2)  the  use  of  interior  basis  sets  for  n  >  3.  For  films  with 
n  >l  3,  the  basis  set  induced  errors  in  Ec  and  <f>  will  both  converge  rapidly  to  constant 


Table  III.  Comparison  of  cohesive  energies  ( Ee )  and  work  functions  (0) 
obtained  with  the  rich  and  compact  basis  sets  (using  the  37  point  bz  mesh) 
for  the  1 -layer,  2-layer,  and  3-layer  A  1(1 1 1)  films;  in  eV. 


Ec 

Rich 

Compact 

Rich 

Compact 

1 - layer 

2- layer 

3- layer 


3.100 

3.626 

3.816 


3.078 

3.605 

3.787 


4.908 

4.685 

4.386 


4.873 

4.636 

4.372 
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offsets  and  should  have  virtually  no  influence  on  the  layer-dependent  oscillations 
which  are  the  focus  of  the  present  study.  The  cumulative  effect  of  all  of  the  basis 
set  reductions  is  a  lowering  of  Ec  by  2 1  to  29  meV  and  of  0  by  14  to  50  meV .  Since 
these  shifts  all  have  the  same  sign,  the  relative  imprecisions  are  only  8  meV  for  Ec 
and  35  meV  for  <f>.  For  films  with  n  >  3,  these  relative  imprecisions  in  Ec  and  <j> 
should  be  no  more  than  5  meV  and  20  meV,  respectively. 

For  all  of  the  calculations  presented  in  the  next  section,  an  additional  size  re¬ 
duction  was  achieved  by  constraining  each  of  the  six  most  local  charge  fitting  func¬ 
tions  to  have  the  same  coefficient  on  all  sites.  This  intersite  core  contraction  scheme 
forces  that  portion  of  the  density  nearest  to  the  nucleus  to  be  identical  for  all  of 
the  A1  atoms  in  the  film.  Similarly,  the  three  most  local  XC  fitting  functions  were 
also  site  contracted.  In  test  calculations  on  the  Al(  111)  4-layer  film,  this  intersite 
core  contraction  scheme  changed  the  total  energy  per  atom  and  the  work  function 
by  much  less  than  1  meV. 

The  calculations  reported  here  were  carried  out  on  a  SUN  SPARC  ELC  work¬ 
station  with  only  250  Mbytes  of  swap  space,  and  required  less  than  500  Mbytes  of 
disk  file  space.  Thus,  the  present  work  provides  a  demonstration  of  the  type  of 
calculations  of  which  FILMS  is  capable  in  a  small  scale  computing  environment. 

Results 

The  total  energies  and  work  functions  for  ideal  fee  Al(l  1 1)  films  ranging  from 
one  to  seven  layers  thick  were  calculated  using  the  19  point  BZ  mesh.  Those  cal¬ 
culations  were  repeated  for  the  one  to  six  layer  films  using  the  37  point  BZ  mesh. 
(The  7-layer  calculation  with  the  37  point  mesh  required  more  memory  than  was 
available).  The  cohesive  energies  per  atom  were  then  determined  relative  to  a  spin- 
polarized  LDA  atomic  energy  (—482.596608  Ry)  calculated  without  the  use  of  fitting 
functions.  The  cohesive  energies,  work  functions,  and  widths  (W)  of  the  occupied 
bands  found  here  for  the  Al(l  1 1)  films  are  listed  in  Table  IV. 

Comparing  the  results  in  Table  IV  obtained  with  the  two  bz  meshes  reveals 
several  interesting  trends.  First,  the  mesh  induced  shifts  in  Ec  all  have  the  same 


Table  IV.  Comparison  of  cohesive  energies  ( Ec ),  work  functions  ($),  and  widths  (W)  of  the  occupied 
bands  obtained  with  the  19  and  37  point  BZ  scans  for  the  1  -►  7  layer  thick  A  1(11 1)  films;  in  eV. 


1 - layer 

2- layer 

3- layer 

4- layer 

5 - layer 

6- layer 

7- layer 


Ec 


W 


19  pt. 

37  pt. 

19  pt. 

37  pt 

19  pt. 

37  pt. 

3.021 

3.078 

4.847 

4.873 

8.09 

8.10 

3.580 

3.605 

4.700 

4.636 

10.14 

10.09 

3.778 

3.787 

4.498 

4.372 

10.65 

10.76 

3.818 

3.831 

4.265 

4.323 

10.97 

10.94 

3.870 

3.880 

4.583 

4.553 

11.07 

11.05 

3.905 

3.915 

4.483 

4.482 

11.08 

11.08 

3.926 

4.487 

11.14 

682 


BOETTGER  ET  AL. 


sign  with  the  largest  change  being  57  meV  for  the  1 -layer.  Second,  for  the  films 
with  n  >  3,  the  change  in  Ec  is  nearly  constant  and  is  less  than  1 5  meV.  These 
results  suggest  that  for  the  37  point  mesh,  the  relative  imprecision  in  Ec  due  to  the 
BZ  integration  should  be  no  more  than  5  meV  for  ^-layers  with  n  >  3,  and  no  more 
than  50  meV  for  all  of  the  films.  Combining  this  result  with  the  basis  set  error 
suggests  a  total  relative  error  in  Ec  of  no  more  than  10  meV  for  n  >  3.  Again,  the 
absolute  error  should  approach  a  constant  offset  for  large  n. 

The  work  function  is  more  strongly  influenced  by  the  bz  mesh  density,  with  the 
largest  shift  being  126  meV  for  the  3-layer.  Furthermore,  the  sign  of  the  shift  in  <f> 
is  not  the  same  for  all  of  the  films.  Thus,  the  relative  error  in  4>  is  greater  than  the 
absolute  error.  To  better  assess  the  influence  of  the  BZ  mesh  on  the  workfunction, 
the  3-layer  calculation  (the  worst  case)  was  repeated  using  a  6 1  point  bz  mesh.  That 
calculation  produced  Ec  =  3.790  eV,  <f>  =  4.393  eV,  and  W  =  10.69  eV.  These 
results  indicate  that  </>  is  converged  to  within  30  meV  for  the  37  point  mesh.  As¬ 
suming  that  the  sign  of  the  error  in  </>  still  varies  and  that  the  error  is  greatest  for 
the  3-layer,  the  relative  error  in  <£  due  to  the  BZ  mesh  should  be  no  more  than 
about  50  meV.  Combining  this  result  with  the  basis  set  error  implies  a  total  relative 
imprecision  of  no  more  than  75  meY  over  the  full  range  of  film  thicknesses  con¬ 
sidered  here,  with  most  of  that  imprecision  being  due  to  the  bz  mesh.  Given  the 
fact  that  even  the  19  point  mesh  used  here  is  denser  than  those  used  in  the  previous 
studies,  the  present  results  should  be  more  reliable  than  the  earlier  results. 


Figure  1 .  Calculated  work  functions  vs.  the  number  of  layers  ( n )  for  the  lcgto-ff  cal¬ 
culations  with  37  bz  points  (solid  line)  and  19  bz  points  (dashed  line).  Also  shown  are  the 
results  from  Ref.  [3]  (dash-dot  line),  Ref.  [4]  (dash-triple  dot  line),  and  Ref.  [5]  (dotted 
line).  The  experimental  result  from  Ref.  [18]  is  indicated  by  a  horizontal  line. 
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Figure  1  compares  the  work  functions  listed  in  Table  IV  with  those  found  in  the 
previous  studies  [3-5].  The  experimental  work  function  for  the  Al(lll)  surface 
(4.24  eV)  [18]  is  indicated  by  a  horizontal  line.  The  five  curves  in  Figure  1  can  be 
divided  into  two  groups;  three  curves  tightly  clustered  above  the  measured  surface 
value  and  two  located  well  below.  The  latter  two  curves  (from  Refs.  [3]  and  [5]) 
give  no  indication  that  <f>  is  converging  with  increasing  n.  In  contrast,  the  current 
results  for  both  bz  meshes  and  the  slapw  results  from  Ref.  [4]  are  all  clearly 
converging  as  n  increases. 

The  present  calculations  predict  that  <f>  steadily  decreases  for  n  =  1  -►  4,  jumps 
up  between  n  =  4  and  5,  and  then  steadily  decreases  for  n  =  5  ->  7.  The  slapw 
calculations  [4]  predict  a  similar  behavior,  but  locate  the  jump  in  <j>  between  n  = 

3  and  4.  There  are  at  least  two  methodological  differences  between  the  slapw 
calculations  and  the  present  lcgto-ff  calculations  that  could  be  responsible  for 
this  rather  minor  disagreement.  First,  different  xc  models  were  used  in  the  two 
calculations  (hl  here  and  Wigner  in  Ref.  [4]).  Alternatively,  the  results  in  Table 
IV  indicate  that  the  values  of  0  at  n  =  3  and  4  are  particularly  sensitive  to  the  bz 
mesh  density.  Since  Ref.  [4]  only  used  a  10  point  mesh  vs.  the  19  and  37  point 
meshes  used  here,  it  may  be  that  the  order  of  the  two  points  was  reversed  in  that 
study. 

The  surface  energies  for  the  Al(  111)  films  were  determined  here  using  an  approach 
suggested  in  Ref.  [19].  First,  define  the  incremental  energy  for  an  TV-layer  film  as 

AE(N)  =  En-  £V-i  ,  (!) 

where  EN  is  the  total  energy  per  cell  of  the  TV-layer.  Then,  assuming  there  is  only 
one  atom  per  layer  in  the  film  unit  cell,  the  surface  energy  per  atom  of  any  «-layer 
with  n  <  TV  can  be  approximated  as, 

Es(n)  =  xh  X  [En  —  n  X  AE(N)]  .  (2) 

In  the  limit  of  large  TV,  AE(N)  will  converge  to  a  value  for  the  bulk  energy  per  atom 
that  is  consistent  with  the  film  calculation  and  ^(TV)  will  converge  to  the  semi¬ 
infinite  surface  energy.  Any  attempt  to  replace  AE  in  Eq.  (2)  with  an  independently 
calculated  bulk  energy  (Eb)  will  introduce  an  error  in  Es  that  increases  linearly 
with  n  [19]. 

Table  V  shows  the  incremental  energies  obtained  for  the  Al(  111)  films  using 
both  bz  meshes,  relative  to  the  free  atom  energy.  The  bulk  binding  energy  of  A1 
reported  by  Moruzzi,  Janak,  and  Williams  [20]  is  also  given.  By  the  7-layer,  the 
incremental  energy  has  converged  to  about  —4.05  ±  0.05  eV.  This  result  is  in 
reasonable  agreement  with  the  bulk  binding  energy  of  -3.88  eV,  given  the  signif¬ 
icantly  different  computational  method  used  in  Ref.  [20].  The  minimum  at  n  =  4 
is  again  striking. 

Table  VI  compares  the  surface  energies  obtained  for  the  37  point  bz  mesh  using 
TV  =  6  in  Eq.  (2)  with  those  obtained  for  the  19  point  mesh  using  TV  =  6  and  7  in 
Eq.  (2).  Also  shown  in  Table  VI  is  a  value  of  Es  (0.51  eV/atom)  deduced  from  the 
surface  tension  of  the  liquid  metal  [21].  Two  features  are  immediately  apparent  in 
Table  VI.  First,  the  change  in  the  BZ  scan  density  has  only  a  minor  effect  on  Es. 
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Table  V.  The  incremental  energies  (AE) 
obtained  with  the  1 9  and  37  point  BZ  scans, 
relative  to  the  atomic  energy;  in  eV.  The 
bulk  binding  energy  of  A1  from  Ref.  [20] 
is  also  listed. 


A E 

19  pt. 

37  pt. 

1 -layer 

-3.021 

-3.078 

2-layer 

-4.138 

-4.131 

3-layer 

-4.174 

-4.153 

4-layer 

-3.937 

-3.962 

5-layer 

-4.079 

-4.076 

6-layer 

-4.080 

-4.090 

7-layer 

-4.055 

Bulk 

-3.878 

Second,  the  relatively  small  difference  between  AZs(6)  and  AE(7)  produces  a  sub¬ 
stantial  change  in  Es  vs.  n.  This  sensitivity  to  small  changes  in  A E  is  due  to  the 
linear  increase  of  the  error  in  Es  as  a  function  of  n  [19].  For  the  remainder  of  this 
discussion  only  the  19  point  results  will  be  considered. 

Figure  2  compares  Es  values  obtained  here  using  TV  =  6  and  7  with  the  results 
from  Refs.  [3]  and  [5];  recall  that  no  values  of  Es  were  given  in  Ref.  [4].  Again,  the 
curves  from  Refs.  [3]  and  [5]  show  no  sign  of  converging  as  n  increases.  This  lack 
of  convergence  may  be  due  to  the  fact  that  Refs.  [3]  and  [5]  both  relied  on  inde- 


Table  VI.  Surface  energies  (Es)  obtained  with  the  1 9  point 
bz  scan  for  TV  =  6  and  7  compared  to  values  obtained  with 
the  37  point  bz  scan  for  N  =  6;  in  eV/surface-atom.  Also 
shown  is  the  experimental  value  deduced  by  Ref.  [21]. 


19  pt. 

37  pt. 

N=1  N=  6 

N=  6 

1 -layer 

0.517 

0.529 

0.506 

2-layer 

0.475 

0.501 

0.486 

3-layer 

0.415 

0.454 

0.454 

4-layer 

0.474 

0.525 

0.519 

5-layer 

0.462 

0.526 

0.526 

6-layer 

0.449 

0.526 

0.526 

7-layer 

0.449 

Expt. 

0.51 
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Figure  2.  Calculated  surface  energies  vs.  the  number  of  layers  n  are  shown  for  the  19  k- 
point  lcgto-ff  calculations  using  N  ~  7  (solid  line)  and  N  =  6  (dashed  line).  Also  shown 
are  the  results  from  Ref.  [3]  (dash-dot  line)  and  Ref.  [5]  (dotted  line).  The  experimental 
result  from  Ref.  [21]  is  indicated  by  a  horizontal  line. 


pendently  calculated  bulk  energies  to  find  Es.  In  contrast,  the  present  results  for  is, 
are  based  on  incremental  energies  and  both  curves  show  a  clear  convergence  with 
increasing  n ;  although  not  to  the  same  value. 

Comparison  of  the  two  curves  in  Figure  2  obtained  here  using  N  =  6  and  7 
clearly  illustrates  the  linear  divergence  caused  by  shifts  in  A E  [19].  However,  this 
divergence  does  not  alter  the  qualitative  behavior  of  Es.  There  is  an  initial  decrease 
in  Es  for  n  =  1  — ►  3,  followed  by  an  abrupt  increase  between  n  =  3  and  4,  and  then 
a  very  slow  decrease  for  n  =  4  -►  7.  Although  it  is  not  entirely  clear  what  the 
converged  value  of  Es  would  be  for  very  large  N  since  AE  has  not  yet  stabilized,  it 
is  reassuring  that  the  “experimental”  surface  energy  lies  between  the  two  results 
calculated  here. 


Conclusions 

A  significant  QSE  has  been  found  in  both  <j>  and  Es  for  Al(  111)  films  ranging  from 
one  to  seven  layers  thick.  The  present  results  for  (f>  vs.  n  are  in  excellent  qualitative 
agreement  with  the  previous  slapw  calculations  [4],  The  only  noteworthy  difference 
between  the  lcgto-ff  and  slapw  results  is  the  location  of  the  abrupt  increase  in 
<f>  predicted  in  both  calculations.  On  the  other  hand,  the  present  results  for  the  QSE 
in  both  <f)  and  Es  differ  significantly  from  the  lcao  results  of  Ref.  [3]  and  the 
pseudopotential  results  of  Ref.  [5].  The  current  results  should  be  more  reliable  than 
all  previous  results. 


686 


BOETTGER  ET  AL. 


In  spite  of  the  QSE  in  <f>  and  Es,  both  quantities  are  clearly  converging  as  n 
increases.  However,  it  is  not  clear  that  the  region  of  significant  QSE  has  been  passed 
even  for  n  =  7.  The  overall  impact  of  the  QSE  on  the  work  function  should  be  no 
more  than  0.3  eV  for  n  >  7.  The  QSE  in  Es  should  be  even  smaller,  no  more  than 
0.05  eV  for  n>l.  However,  even  for  the  7-layer  film,  the  calculated  surface  energy 
is  still  being  affected  by  small  fluctuations  in  the  incremental  energy,  AE(N).  If 
solid  surface  modelling  is  desired,  elimination  of  that  particular  variation  in  Es 
might  require  the  use  of  much  thicker  films. 
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Abstract 

High-Tc  superconductors  (htscs)  are  usually  obtained  by  doping  electron  donors  or  acceptors  into 
parent  materials.  The  actual  role  played  by  doping  is  still  uncertain  with  various  interpretations.  The 
present  electronic  structure  study  provides  some  hints  which  may  help  to  solve  the  mystery.  ©  1994  John 
Wiley  &  Sons,  Inc. 


Introduction 

Muller  and  Bednorz  [1]  synthesized  the  first  htsc  by  doping  some  Ba+2  to  replace 
some  of  the  La+3  in  the  parent  material  La2Cu04.  This  process  is  considered  to 
dope  holes  into  the  half-filled  band  structure  of  the  parent  material.  The  analogous 
electron  doping  can  also  lead  to  HTSC  [2],  but  for  simplicity,  hereafter  only  hole 
doping  will  be  considered. 

The  synthesis  of  the  known  HTSCs,  especially  the  simplest  SrCu02  based  HTSC 
[2],  reveal  that  the  Cu02  layer(s)  must  be  the  responsible  component.  Indeed,  elec¬ 
tronic  structure  studies  showed  that  the  active  band,  i.e.,  the  half-filled  band,  is  just 
the  in-plane  Cn-d  O-p  antibonding  band  [3].  However,  it  is  still  not  very  clear 
what  role  doping  really  plays. 

There  have  been  many  interpretations  and  speculations  about  the  effect(s)  that 
doping  causes.  A  typical  theory  [4]  starts  from  the  Mott  insulating  state  [5]  of  the 
parent  material  with  a  half-filled  electronic  band.  Due  to  the  strong  electronic  cor¬ 
relations  in  the  half-filled  band,  electrons  become  very  localized  with  one  electron 
per  site  and  with  only  magnetic  coupling  between  them.  The  parent  material  is  a 
poor  conductor,  although  it  should  not  be  so  with  a  partially  occupied  valence 
band,  according  to  traditional  band  theory.  As  holes  are  doped  in,  the  valence  band 
is  brought  away  from  half-filling,  and  the  material  becomes  conducting.  It  is  believed 
[4]  that  high-Tc  superconductivity  arises  from  the  Bose  condensation  of  the  holes. 

Such  theories  may  be  internally  consistent  within  a  model  itself,  but  they  cannot 
explain  a  large  number  of  experimental  facts.  However,  they  bring  out  very  inter- 
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esting  ideas,  which  has  resulted  in  many  studies  on  the  Hubbard  model  [6],  and 
variants  thereof. 

In  the  next  section,  the  method  and  the  system  studied  are  first  briefly  intro¬ 
duced,  and  the  results  are  presented.  Discussion  on  the  obtained  results  and  their 
validity  are  given  in  the  third  section.  Finally,  in  the  last  section,  concluding  remarks 
are  made. 


Method  and  Results 

In  order  to  find  out  what  the  effect  the  doped  holes  have  in  the  material,  a 
Hartree-Fock  (HF)  electronic  structure  study  with  CRYSTAL-88  [7]  was  carried 
out  for  a  two-sided  symmetric  slab,  which  mimics  the  Cu02  layer  of  the  La2Cu04 
crystal.  It  has  [8]  the  space  group  T>\h  (P4/mmm),  No.  123,  lattice  constants  a  = 
3.787  A  and  c  =  13.288  A,  and  atom  positions  Cu(0.0,  0.0,  0.0),  Op(0.0,  1.894, 
0.0),  Lad  1.894,  1.894,  1.852),  Oo(0.0,  0.0,  2.428),  La2(0.0,  0.0,  4.792),  where 
and  00  designate  in-plane  and  out-of-plane  oxygens,  respectively. 

The  slab  under  investigation  has  the  unit  cell  configuration  Cu02[Na+1]2.  Here 
[Na+1]  is  the  + 1  sodium  pseudopotential.  Each  is  placed  at  the  position  correspond¬ 
ing  to  Laj  in  the  La2Cu04  crystal  as  the  electron  donor,  and  each  contributes  one 
electron  to  the  Cu02  unit  cell  so  that  the  formal  electronic  configuration  is  Cu- 
3 d9  0-2 p6,  in  which  the  highest  occupied  band  is  the  half-filled  Cu -d  O-p  cr-an- 
tibonding  band,  i.e.,  band,  corresponding  to  the  previously  mentioned  one  for 
the  parent  material  in  the  nonsuperconducting  phase. 

For  the  present  study,  a  He-core  pseudopotential  for  O  [9],  an  Ar-core  pseudo¬ 
potential  for  Cu  [10],  and  the  Na+1  pseudopotential  [9],  are  used.  For  Cu-3d  valence 
orbitals,  triple  $  Gaussian  Orbitals  (TZ  GTOs)  are  used  with  double  f  (DZ)  4s  4p 
basis  [11]  also  included.  For  O,  a  DZ  gto  basis  for  the  2p  valence  orbitals  are 
employed  including  a  polarization  d-function  basis  with  an  exponent  equal  to  that 
of  the  outer  p  function  [9]. 

To  simulate  the  doping,  the  hole  concentration  can  be  modulated  by  taking 
electrons  out  of  the  Cu02  plane  and  putting  them  on  the  out-of-plane  Na+I  centers. 
In  this  way,  ab  initio  self-consistent  computations  for  an  arbitrary  in-plane  hole 
concentration  can  be  made  without  employing  super-cells  or  the  Rigid-Band  Ap¬ 
proximation  [12].  The  reason  the  Lai  position  was  considered  instead  of  the  out- 
of-plane  00  position  was  that  the  00  positions  are  further  away  from  the  Cu02 
plane  and  known  to  be  far  less  important  for  the  in-plane  electronic  structures  [13]. 
Moreover,  the  Na+1  centers  will  be  used  as  hole  modulators  as  described  below, 
which  is  the  function  of  the  La  atoms. 

The  electronic  structure  computation  was  carried  out  for  a  number  of  hole  con¬ 
centrations.  In  Figure  1 ,  the  total  density  of  states  (doss)  for  hole  concentrations 
0.0, 0.352, 0.425, 0.505,  and  0.745  holes-per-Cu02-unit-cell  are  given,  respectively. 
It  can  be  seen  clearly  that  there  is  a  bump  in  the  total  DOS  in  the  neighborhood  of 
the  Fermi  energy  (ej),  which  is  of  an  amplitude  twice  as  large  as  that  in  its  vicinity. 
It  moves  from  below  e/to  above  e/with  increasing  hole  concentration. 


Energy  (Ry,  Ef=0.0) 

Figure  1 .  The  total  doss  for  the  Cu02  plane  with  hole  dopings,  0.0,  0.352,  0.425,  0.505, 
and  0.745  holes/(unit  cell),  respectively.  The  doss  of  the  localized  nonbonding  Cu  -d  states 
have  not  been  included  because  they  change  little  with  the  different  dopings  and  are  also 
too  far  below  c/and  less  important  for  the  properties  considered  here.  However,  it  is  such 
states  that  differentiate  the  HF  computations  from  the  corresponding  lda  ones. 
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Discussion 

According  to  the  BCS  theory  [14],  once  there  is  an  attractive  potential  between 
two  electrons  (in  bcs  it  was  due  to  phonons)  around  ^electrons  may  pair  together, 
and  the  superconducting  state  occurs.  The  superconducting  temperature,  Tc,  can 
be  expressed  as 

Tc  =  a*e~b/n(e'\  (1) 

where  a  =  1.140#,  in  which  0#  =  ( h<x>)fkB  is  the  average  phonon  energy  scaled  in 
temperature  units,  n(e/)  is  the  average  total  DOS  around  the  Fermi  energy  (e/),  and 
b  is  proportional  to  the  inverse  of  V,  the  average  energy  of  the  electron-quasiparticle 
interaction,  which  resulted  in  the  pairing  of  electrons  that  in  turn  led  to  the  super¬ 
conductivity.  Tc  therefore  increases  with  increasing  n(c/).  A  schematic  representation 
of  changes  of  Tc  with  varying  n {ej)  is  given  in  Figure  2.  It  is  obvious  that  the  n(e/) 
changes  with  hole  doping  obtained  in  the  present  study  can  result  in  large  changes 
in  Tc. 

The  BCS  theory  may  not  fully  explain  high-Tc  superconductivity.  However,  the 
magnetic  flux  quantum  experiment  [15]  showed  that  electrons  in  the  new  htscs 
are  still  paired,  so  that  if  0#  is  simply  considered  as  an  average  energy  scaled  in 
temperature  units,  and  V  is  of  the  corresponding  origin,  Eq.  (1)  should  still  indicate 
the  right  trends. 
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Figure  2.  The  schematic  representation  of  Tc  =  a  *  e~b,n{cf)  changes  with  varying  n(ej ).  The 
parameters  are  chosen  to  be  within  the  range  of  the  realistic  values  of  conventional  su¬ 
perconductors  (a  =  1000,  b  =  35),  and  to  give  conventional  Te’s. 
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The  quantities  in  Eq.  (1)  are  of  the  orders  of  magnitude  Tc  ~  101,  ~  103, 

and  ~  10-2,  so  that  a  doubled  magnitude  of  the  DOS  in  the  neighborhood 

of  e/will  result  in  a  Tc  roughly  one  order  of  magnitude  larger,  according  to  the  BCS 
theory,  which  means  that  a  low  Tc  superconductor  will  be  turned  into  a  htsc! 

If  the  present  result  is  compared  with  experiment,  the  consistency  becomes  more 
obvious.  Recall  Figure  1.  of  Ref.  [3],  i.e.,  the  phase  diagram  of  the  htscs  under 
different  dopings.  Beyond  a  threshold  doping  level,  Tc  increases  with  increasing 
hole  concentration  until  a  maximum  is  reached,  and  then  decreases  as  the  hole 
concentration  continuously  increases  until  Tc  drops  to  zero. 

These  experimental  results  can  be  understood  from  the  point  of  view  of  the 
present  computations:  the  high  bump  in  the  total  DOS  shifts  with  the  hole  concen¬ 
tration  x.  As  x  increases,  the  bump  moves  towards  e/from  below.  At  a  value  of  x 
such  that  the  bump  is  sufficiently  close  to  e/9  there  is  the  onset  of  superconductivity. 
As  x  continues  to  increase,  the  bump  moves  further  until  it  is  right  at  the  e/,  which 
corresponds  to  the  maximum  Tc  in  Figure  1 .  of  Ref.  [3].  As  x  changes  further  from 
that  point,  the  bump  begins  to  move  away  from  efi  which  corresponds  to  the  drop 
of  Tc  after  the  maximum.  After  the  DOS  bump  moves  sufficiently  beyond  £/,  the 
superconductivity  fades  out  again. 

From  the  band  structure,  it  is  easily  seen  that  the  DOS  bump  near  ef  is  mainly 
from  the  weakly  dispersed  portion  of  the  partially  filled  band(s)  around  X/Y  points. 
So  when  the  bump  is  right  at  e/,  the  Fermi  surface  becomes  nesting  in  the  X-Y 
plane.  The  band,  which  is  completely  empty  in  the  zero  hole  case,  and  becomes 
increasingly  occupied  with  increasing  hole  concentration  is  the  hole  modula¬ 
tor  band. 

It  should  be  mentioned  that  a  peak  in  the  computed  DOS  of  La2Cu04  was  noticed 
earlier,  and  an  estimate  about  the  shift  of  the  peak  with  the  hole  concentration 
change  was  made  under  the  Rigid-Band  Approximation  [16].  However,  the  con¬ 
tinuous  movement  of  the  peak  driven  by  the  changing  hole  concentration  shown 
above  suggests  that  the  doping  induced  DOS  enhancement  at  the  Fermi  energy  is 
possibly  responsible  for  the  high-Tc  superconductivity. 

Some  comments  should  be  made  regarding  the  use  of  the  hf  method  [17].  Unlike 
the  local  density  approximated  density  functional  method  (lda)  [18,19],  it  does 
not  use  local  approximations  on  the  exchange(-correlation)  potential.  The  corre¬ 
sponding  single  determinantal  wavefunction  is  a  clearly  defined  quantity  which 
takes  account  of  the  different  nodal  structures  of  the  different  orbitals  via  its  nonlocal, 
exact  exchange  term.  In  principle,  it  is  known  what  is  missing  from  the  hf  state 
and  how  it  can  be  improved  by  introducing  correlations  either  through  perturbative 
or  variational  procedures  [20].  As  a  result,  HF  calculations  can  be  used  as  a  good 
reference  for  other  computations.  In  fact,  in  the  original  Kohn-Sham  paper  [18], 
a  variant  of  the  lda  scheme  was  proposed  with  the  hf  state  as  the  reference.  It  is 
also  known  that  the  hf  density  is  accurate  up  to  the  second  order  [21]. 

However,  it  is  well-known  that  due  to  the  uncompensated  nonlocal  exchange 
term  under  the  hf  approximation  (or  to  say  the  unscreened  long  range  Coulomb 
interaction)  calculated  hf  band  structures  for  metals  are  about  twice  as  wide  as 
those  from  experiment,  as  are  band  gaps  for  insulators  and  semiconductors.  What 
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is  more  alarming  is  that  for  a  metallic  system  with  partly  filled  band  at  e/the  derivative 
of  the  band,  de/dk,  becomes  logarithmically  infinite.  This  divergence  will  affect  the 
DOS  through 


n{c)  oc  f  (2) 

|  dk 

and  result  in  a  vanishing  n(ej)  [22].  However,  all  this  is  due  to  the  divergence  at 
k  =  0  of  47re2/&2,  which  is  the  Fourier  transform  of  the  Coulomb  potential  e2/r.  It 
is  just  a  reflection  of  the  unscreened  long  range  Coulomb  interaction  in  the  hf 
scheme.  The  part  of  the  interaction  integrals/lattice  summations  which  result  in 
the  so  called  hf  pathology  were  identified  in  the  works  given  in  Ref.  [23]. 

Nevertheless,  the  interaction  in  reality  is  screened,  and  with  the  form 

V(r)  oc  e2{f-pj  ,  (3) 

according  to  the  Random  Phase  Approximation  (rpa)  [24],  which  takes  care  of 
the  zero  point  plasmon  fluctuations,  or  the  long-range  correlations.  (This  form  of 
potential  is  often  called  the  Yukawa  potential.) 

From  the  form  of  the  screened  potential,  it  is  clear  that  it  results  in  a  rapid 
decrease  of  the  interaction  beyond  a  distance  l/ko  in  metallic  systems,  i.e.,  the 
interaction  becomes  short  range,  instead  of  long  range. 

In  the  numerical  scheme  [17]  of  solid  state  hf  computations,  the  interaction 
integral  and  lattice  summation  are  carried  out  only  up  to  a  cut-off  distance,  not  to 
infinity.  This  is  a  numerical  approximation,  but  it  has  partially  compensated  the 
error  from  the  hf  approximation,  eliminated  the  unphysical  divergence,  and  thus 
removed  the  vanishing  at  e/from  the  computed  DOS.  Essentially,  the  cut-off  of  the 
interaction  integral/lattice  summation  can  be  considered  as  equivalent  to  replacing 
the  rpa  screening  factor  e~^r  by  a  step  function  1  —  @(r  —  \/kf).  With  this  com¬ 
parison,  the  nonvanishing  DOS  at  ef  becomes  straightforward  to  understand,  and 
the  reliability  of  HF  calculations  is  increased,  too. 

Nevertheless,  another  important  fact  is  that  in  the  presence  of  strong  correlations, 
as  in  transition  metal  oxides  with  quite  localized  d  electrons,  which  includes  the 
parent  materials  of  htscs,  the  lack  of  correlation  in  the  hf  scheme  will  lead  to 
complete  failure  in  explaining  Mott  insulating  [5]  behavior.  Fortunately,  the  systems 
under  investigation  here  are  the  normal  state  of  the  htsc  phase  which  is  already 
out  of  the  Mott  insulating  range.  So  except  for  the  band  width  and  band  gap  over¬ 
estimation,  the  result  should  be  generally  correct  and  give  qualitatively  right 
predictions. 


Conclusions 

From  the  results  shown  above,  it  seems  that  the  initial  doping  causes  the  parent 
material  to  become  metallic.  Then  increased  doping  moves  ef  towards  the  DOS 
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maximum  and  the  onset  of  high-Tc  superconductivity.  Then  further  dopings  even¬ 
tually  destroy  HTSC  again  and  return  the  normal  state. 
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(Proceedings  of  the  International  Symposium  on  Atomic,  Molecular,  and  Solid-State 
Theory  and  Quantum  Statistics) 

QUANTUM  BIOLOGY  SYMPOSIUM  NO.  3 

(Proceedings  of  the  International  Symposium  on  Quantum  Biology  and  Quantum  Phar¬ 
macology) 

1977  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  1 1 

(Proceedings  of  the  International  Symposium  on  Atomic,  Molecular,  and  Solid-State 
Theory,  Collision  Phenomena,  and  Computational  Methods) 

QUANTUM  BIOLOGY  SYMPOSIUM  NO.  4 

(Proceedings  of  the  International  Symposium  on  Quantum  Biology  and  Quantum  Phar¬ 
macology) 

1978  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  12 

(Proceedings  of  the  International  Symposium  on  Atomic,  Molecular,  and  Solid-State 
Theory,  Collision  Phenomena,  and  Computational  Methods) 

QUANTUM  BIOLOGY  SYMPOSIUM  NO.  5 

(Proceedings  of  the  International  Symposium  on  Quantum  Biology  and  Quantum  Phar¬ 
macology) 

1979  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  13 

(Proceedings  of  the  International  Symposium  on  Atomic,  Molecular,  and  Solid-State 
Theory,  Collision  Phenomena,  Quantum  Statistics,  and  Computational  Methods) 
QUANTUM  BIOLOGY  SYMPOSIUM  NO.  6 

(Proceedings  of  the  International  Symposium  on  Quantum  Biology  and  Quantum  Phar¬ 
macology) 

1980  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  14 

(Proceedings  of  the  International  Symposium  on  Atomic,  Molecular,  and  Solid-State 
Theory,  Collision  Phenomena,  Quantum  Statistics,  and  Computational  Methods) 
QUANTUM  BIOLOGY  SYMPOSIUM  NO.  7 

(Proceedings  of  the  International  Symposium  on  Quantum  Biology  and  Quantum  Phar¬ 
macology) 

1981  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  1 5 

(Proceedings  of  the  International  Symposium  on  Atomic,  Molecular,  and  Solid-State 
Theory,  Collision  Phenomena,  and  Computational  Quantum  Chemistry) 

QUANTUM  BIOLOGY  SYMPOSIUM  NO.  8 

(Proceedings  of  the  International  Symposium  on  Quantum  Biology  and  Quantum  Phar¬ 
macology) 

1982  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  16 

(Proceedings  of  the  International  Symposium  on  Quantum  Chemistry,  Theory  of  Con¬ 
densed  Matter,  and  Propagator  Methods  in  the  Quantum  Theory  of  Matter) 
QUANTUM  BIOLOGY  SYMPOSIUM  NO.  9 

(Proceedings  of  the  International  Symposium  on  Quantum  Biology  and  Quantum  Phar¬ 
macology) 

1983  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  17 

(Proceedings  of  the  International  Symposium  on  Atomic,  Molecular,  and  Solid-State 
Theory,  Collision  Phenomena,  and  Computational  Quantum  Chemistry) 


QUANTUM  BIOLOGY  SYMPOSIUM  NO.  10 

(Proceedings  of  the  International  Symposium  on  Quantum  Biology  and  Quantum  Phar¬ 
macology) 


1984  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  18 

(Proceedings  of  the  International  Symposium  on  Atomic,  Molecular,  and  Solid-State 
Theory,  and  Computational  Quantum  Chemistry) 

QUANTUM  BIOLOGY  SYMPOSIUM  NO.  1 1 

(Proceedings  of  the  International  Symposium  on  Quantum  Biology  and  Quantum  Phar¬ 
macology) 

1985  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  19 

(Proceedings  of  the  International  Symposium  on  Atomic,  Molecular,  and  Solid-State 
Theory,  Scattering  Problems,  Many  Body  Phenomena,  and  Computational  Quantum 
Chemistry) 

QUANTUM  BIOLOGY  SYMPOSIUM  NO.  12 

(Proceedings  of  the  International  Symposium  on  Quantum  Biology  and  Quantum  Phar¬ 
macology) 

1986  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  20 

(Proceedings  of  the  International  Symposium  on  Atomic,  Molecular,  and  Solid-State 
Theory,  Scattering  Problems,  Many  Body  Phenomena,  and  Computational  Quantum 
Chemistry) 

QUANTUM  BIOLOGY  SYMPOSIUM  NO.  13 

(Proceedings  of  the  International  Symposium  on  Quantum  Biology  and  Quantum  Phar¬ 
macology) 

1987  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  21 

(Proceedings  of  the  International  Symposium  on  Quantum  Chemistry,  Solid-State  Theory, 
and  Computational  Methods) 

QUANTUM  BIOLOGY  SYMPOSIUM  NO.  14 

(Proceedings  of  the  International  Symposium  on  Quantum  Biology  and  Quantum  Phar¬ 
macology) 

1988  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  22 

(Proceedings  of  the  International  Symposium  on  Quantum  Chemistry,  Solid-State  Theory, 
and  Computational  Methods) 

QUANTUM  BIOLOGY  SYMPOSIUM  NO.  15 

(Proceedings  of  the  International  Symposium  on  Quantum  Biology  and  Quantum  Phar¬ 
macology) 

1989  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  23 

(Proceedings  of  the  International  Symposium  on  Quantum  Chemistry,  Solid-State  Theory, 
and  Molecular  Dynamics) 

QUANTUM  BIOLOGY  SYMPOSIUM  NO.  16 

(Proceedings  of  the  International  Symposium  on  Quantum  Biology  and  Quantum  Phar¬ 
macology) 

1990  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  24 

(Proceedings  of  the  International  Symposium  on  Quantum  Chemistry,  Solid  State  Physics, 
and  Computational  Methods) 

QUANTUM  BIOLOGY  SYMPOSIUM  NO.  17 

(Proceedings  of  the  International  Symposium  on  Quantum  Biology  and  Quantum  Phar¬ 
macology) 


1991  QUANTUM  CHEMISTRY  SYMPOSIUM  NQ.  25 

(Proceedings  of  the  International  Symposium  on  Quantum  Chemistry,  Solid  State  Physics, 
and  Computational  Methods) 

QUANTUM  BIOLOGY  SYMPOSIUM  NO.  18 

(Proceedings  of  the  International  Symposium  on  Quantum  Biology  and  Quantum  Phar¬ 
macology) 

1992  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  26 

(Proceedings  of  the  International  Symposium  on  Atomic,  Molecular,  and  Condensed  Matter 
Theory  and  Computational  Methods) 

QUANTUM  BIOLOGY  SYMPOSIUM  NO.  19 

(Proceedings  of  the  International  Symposium  on  the  Application  of  Fundamental  Theory 
to  Problems  of  Biology  and  Pharmacology) 

1993  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  27 

(Proceedings  of  the  International  Symposium  on  Atomic,  Molecular,  and  Condensed  Matter 
Theory  and  Computational  Methods) 

QUANTUM  BIOLOGY  SYMPOSIUM  NO.  20 

(Proceedings  of  the  International  Symposium  on  the  Application  of  Fundamental  Theory 
to  Problems  of  Biology  and  Pharmacology) 

1994  QUANTUM  CHEMISTRY  SYMPOSIUM  NO.  28 

(Proceedings  of  the  International  Symposium  on  Atomic,  Molecular,  and  Condensed 
Matter  Theory  and  Computational  Methods) 

QUANTUM  BIOLOGY  SYMPOSIUM  NO.  21 

(Proceedings  of  the  International  Symposium  on  the  Application  of  Fundamental  Theory 
to  Problems  of  Biology  and  Pharmacology) 

All  of  the  above  symposia  can  be  individually  purchased  from  the  Subscription  Department,  John 
Wiley  &  Sons. 


